Records |
Links |
Author |
Josep Llados; Horst Bunke; Enric Marti |

Title |
Finding rotational symmetries by cyclic string matching |
Type |
Journal Article |
Year |
1997 |
Publication |
Pattern recognition letters |
Abbreviated Journal |
Volume  |
18 |
Issue |
14 |
Pages |
1435-1442 |
Keywords |
Rotational symmetry; Reflectional symmetry; String matching |
Abstract |
Symmetry is an important shape feature. In this paper, a simple and fast method to detect perfect and distorted rotational symmetries of 2D objects is described. The boundary of a shape is polygonally approximated and represented as a string. Rotational symmetries are found by cyclic string matching between two identical copies of the shape string. The set of minimum cost edit sequences that transform the shape string to a cyclically shifted version of itself define the rotational symmetry and its order. Finally, a modification of the algorithm is proposed to detect reflectional symmetries. Some experimental results are presented to show the reliability of the proposed algorithm |
Address |
Corporate Author |
Thesis |
Publisher |
Elsevier |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
Medium |
Area |
Expedition |
Conference |
Notes |
Approved |
no |
Call Number |
IAM @ iam @ LBM1997a |
Serial |
1562 |
Permanent link to this record |
Author |
Lluis Pere de las Heras; Oriol Ramos Terrades; Sergi Robles; Gemma Sanchez |

Title |
CVC-FP and SGT: a new database for structural floor plan analysis and its groundtruthing tool |
Type |
Journal Article |
Year |
2015 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
18 |
Issue |
1 |
Pages |
15-30 |
Keywords |
Abstract |
Recent results on structured learning methods have shown the impact of structural information in a wide range of pattern recognition tasks. In the field of document image analysis, there is a long experience on structural methods for the analysis and information extraction of multiple types of documents. Yet, the lack of conveniently annotated and free access databases has not benefited the progress in some areas such as technical drawing understanding. In this paper, we present a floor plan database, named CVC-FP, that is annotated for the architectural objects and their structural relations. To construct this database, we have implemented a groundtruthing tool, the SGT tool, that allows to make specific this sort of information in a natural manner. This tool has been made for general purpose groundtruthing: It allows to define own object classes and properties, multiple labeling options are possible, grants the cooperative work, and provides user and version control. We finally have collected some of the recent work on floor plan interpretation and present a quantitative benchmark for this database. Both CVC-FP database and the SGT tool are freely released to the research community to ease comparisons between methods and boost reproducible research. |
Address |
Corporate Author |
Thesis |
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; ADAS; 600.061; 600.076; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ HRR2015 |
Serial |
2567 |
Permanent link to this record |
Author |
Christophe Rigaud; Clement Guerin; Dimosthenis Karatzas; Jean-Christophe Burie; Jean-Marc Ogier |

Title |
Knowledge-driven understanding of images in comic books |
Type |
Journal Article |
Year |
2015 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
18 |
Issue |
3 |
Pages |
199-221 |
Keywords |
Document Understanding; comics analysis; expert system |
Abstract |
Document analysis is an active field of research, which can attain a complete understanding of the semantics of a given document. One example of the document understanding process is enabling a computer to identify the key elements of a comic book story and arrange them according to a predefined domain knowledge. In this study, we propose a knowledge-driven system that can interact with bottom-up and top-down information to progressively understand the content of a document. We model the comic book’s and the image processing domains knowledge for information consistency analysis. In addition, different image processing methods are improved or developed to extract panels, balloons, tails, texts, comic characters and their semantic relations in an unsupervised way. |
Address |
Corporate Author |
Thesis |
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; 600.056; 600.077 |
Approved |
no |
Call Number |
RGK2015 |
Serial |
2595 |
Permanent link to this record |
Author |
David Aldavert; Marçal Rusiñol; Ricardo Toledo; Josep Llados |

Title |
A Study of Bag-of-Visual-Words Representations for Handwritten Keyword Spotting |
Type |
Journal Article |
Year |
2015 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
18 |
Issue |
3 |
Pages |
223-234 |
Keywords |
Bag-of-Visual-Words; Keyword spotting; Handwritten documents; Performance evaluation |
Abstract |
The Bag-of-Visual-Words (BoVW) framework has gained popularity among the document image analysis community, specifically as a representation of handwritten words for recognition or spotting purposes. Although in the computer vision field the BoVW method has been greatly improved, most of the approaches in the document image analysis domain still rely on the basic implementation of the BoVW method disregarding such latest refinements. In this paper, we present a review of those improvements and its application to the keyword spotting task. We thoroughly evaluate their impact against a baseline system in the well-known George Washington dataset and compare the obtained results against nine state-of-the-art keyword spotting methods. In addition, we also compare both the baseline and improved systems with the methods presented at the Handwritten Keyword Spotting Competition 2014. |
Address |
Corporate Author |
Thesis |
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; ADAS; 600.055; 600.061; 601.223; 600.077; 600.097 |
Approved |
no |
Call Number |
Admin @ si @ ART2015 |
Serial |
2679 |
Permanent link to this record |
Author |
Lluis Pere de las Heras; Ahmed Sheraz; Marcus Liwicki; Ernest Valveny; Gemma Sanchez |

Title |
Statistical Segmentation and Structural Recognition for Floor Plan Interpretation |
Type |
Journal Article |
Year |
2014 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
17 |
Issue |
3 |
Pages |
221-237 |
Keywords |
Abstract |
A generic method for floor plan analysis and interpretation is presented in this article. The method, which is mainly inspired by the way engineers draw and interpret floor plans, applies two recognition steps in a bottom-up manner. First, basic building blocks, i.e., walls, doors, and windows are detected using a statistical patch-based segmentation approach. Second, a graph is generated, and structural pattern recognition techniques are applied to further locate the main entities, i.e., rooms of the building. The proposed approach is able to analyze any type of floor plan regardless of the notation used. We have evaluated our method on different publicly available datasets of real architectural floor plans with different notations. The overall detection and recognition accuracy is about 95 %, which is significantly better than any other state-of-the-art method. Our approach is generic enough such that it could be easily adopted to the recognition and interpretation of any other printed machine-generated structured documents. |
Address |
Corporate Author |
Thesis |
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; ADAS; 600.076; 600.077 |
Approved |
no |
Call Number |
HSL2014 |
Serial |
2370 |
Permanent link to this record |
Author |
Marçal Rusiñol; Lluis Pere de las Heras; Oriol Ramos Terrades |

Title |
Flowchart Recognition for Non-Textual Information Retrieval in Patent Search |
Type |
Journal Article |
Year |
2014 |
Publication |
Information Retrieval |
Abbreviated Journal |
IR |
Volume  |
17 |
Issue |
5-6 |
Pages |
545-562 |
Keywords |
Flowchart recognition; Patent documents; Text/graphics separation; Raster-to-vector conversion; Symbol recognition |
Abstract |
Relatively little research has been done on the topic of patent image retrieval and in general in most of the approaches the retrieval is performed in terms of a similarity measure between the query image and the images in the corpus. However, systems aimed at overcoming the semantic gap between the visual description of patent images and their conveyed concepts would be very helpful for patent professionals. In this paper we present a flowchart recognition method aimed at achieving a structured representation of flowchart images that can be further queried semantically. The proposed method was submitted to the CLEF-IP 2012 flowchart recognition task. We report the obtained results on this dataset. |
Address |
Corporate Author |
Thesis |
Publisher |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1386-4564 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ RHR2013 |
Serial |
2342 |
Permanent link to this record |
Author |
David Fernandez; Josep Llados; Alicia Fornes |

Title |
A graph-based approach for segmenting touching lines in historical handwritten documents |
Type |
Journal Article |
Year |
2014 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
17 |
Issue |
3 |
Pages |
293-312 |
Keywords |
Text line segmentation; Handwritten documents; Document image processing; Historical document analysis |
Abstract |
Text line segmentation in handwritten documents is an important task in the recognition of historical documents. Handwritten document images contain text lines with multiple orientations, touching and overlapping characters between consecutive text lines and different document structures, making line segmentation a difficult task. In this paper, we present a new approach for handwritten text line segmentation solving the problems of touching components, curvilinear text lines and horizontally overlapping components. The proposed algorithm formulates line segmentation as finding the central path in the area between two consecutive lines. This is solved as a graph traversal problem. A graph is constructed using the skeleton of the image. Then, a path-finding algorithm is used to find the optimum path between text lines. The proposed algorithm has been evaluated on a comprehensive dataset consisting of five databases: ICDAR2009, ICDAR2013, UMD, the George Washington and the Barcelona Marriages Database. The proposed method outperforms the state-of-the-art considering the different types and difficulties of the benchmarking data. |
Address |
Corporate Author |
Thesis |
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; 600.056; 600.061; 602.006; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ FLF2014 |
Serial |
2459 |
Permanent link to this record |
Author |
Marçal Rusiñol; Volkmar Frinken; Dimosthenis Karatzas; Andrew Bagdanov; Josep Llados |

Title |
Multimodal page classification in administrative document image streams |
Type |
Journal Article |
Year |
2014 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
17 |
Issue |
4 |
Pages |
331-341 |
Keywords |
Digital mail room; Multimodal page classification; Visual and textual document description |
Abstract |
In this paper, we present a page classification application in a banking workflow. The proposed architecture represents administrative document images by merging visual and textual descriptions. The visual description is based on a hierarchical representation of the pixel intensity distribution. The textual description uses latent semantic analysis to represent document content as a mixture of topics. Several off-the-shelf classifiers and different strategies for combining visual and textual cues have been evaluated. A final step uses an n-gram model of the page stream allowing a finer-grained classification of pages. The proposed method has been tested in a real large-scale environment and we report results on a dataset of 70,000 pages. |
Address |
Corporate Author |
Thesis |
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
DAG; LAMP; 600.056; 600.061; 601.240; 601.223; 600.077; 600.079 |
Approved |
no |
Call Number |
Admin @ si @ RFK2014 |
Serial |
2523 |
Permanent link to this record |
Author |
Partha Pratim Roy; Umapada Pal; Josep Llados |

Title |
Recognition of Multi-oriented Touching Characters in Graphical Documents |
Type |
Conference Article |
Year |
2008 |
Publication |
Computer Vision, Graphics & Image Processing, 2008. Sixth Indian Conference on, |
Abbreviated Journal |
Volume  |
16 |
Issue |
Pages |
297–304 |
Keywords |
Abstract |
Address |
Corporate Author |
Thesis |
Publisher |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
Medium |
Area |
Expedition |
Conference |
ICVGIP ’08 |
Notes |
Approved |
no |
Call Number |
DAG @ dag @ RPL2008c |
Serial |
1080 |
Permanent link to this record |
Author |
Alicia Fornes; Anjan Dutta; Albert Gordo; Josep Llados |

Title |
CVC-MUSCIMA: A Ground-Truth of Handwritten Music Score Images for Writer Identification and Staff Removal |
Type |
Journal Article |
Year |
2012 |
Publication |
International Journal on Document Analysis and Recognition |
Abbreviated Journal |
Volume  |
15 |
Issue |
3 |
Pages |
243-251 |
Keywords |
Music scores; Handwritten documents; Writer identification; Staff removal; Performance evaluation; Graphics recognition; Ground truths |
Abstract |
The analysis of music scores has been an active research field in the last decades. However, there are no publicly available databases of handwritten music scores for the research community. In this paper we present the CVC-MUSCIMA database and ground-truth of handwritten music score images. The dataset consists of 1,000 music sheets written by 50 different musicians. It has been especially designed for writer identification and staff removal tasks. In addition to the description of the dataset, ground-truth, partitioning and evaluation metrics, we also provide some base-line results for easing the comparison between different approaches. |
Address |
Corporate Author |
Thesis |
Publisher |
Place of Publication |
Editor |
Language |
Summary Language |
Original Title |
Series Editor |
Series Title |
Abbreviated Series Title |
Series Volume |
Series Issue |
Edition |
1433-2833 |
Medium |
Area |
Expedition |
Conference |
Notes |
Approved |
no |
Call Number |
Admin @ si @ FDG2012 |
Serial |
2129 |
Permanent link to this record |