|
Gemma Sanchez and Josep Llados. 2001. A Graph Grammar to Recognize Textured Symbols..
|
|
|
David Fernandez, Josep Llados and Alicia Fornes. 2014. A graph-based approach for segmenting touching lines in historical handwritten documents. IJDAR, 17(3), 293–312.
Abstract: Text line segmentation in handwritten documents is an important task in the recognition of historical documents. Handwritten document images contain text lines with multiple orientations, touching and overlapping characters between consecutive text lines and different document structures, making line segmentation a difficult task. In this paper, we present a new approach for handwritten text line segmentation solving the problems of touching components, curvilinear text lines and horizontally overlapping components. The proposed algorithm formulates line segmentation as finding the central path in the area between two consecutive lines. This is solved as a graph traversal problem. A graph is constructed using the skeleton of the image. Then, a path-finding algorithm is used to find the optimum path between text lines. The proposed algorithm has been evaluated on a comprehensive dataset consisting of five databases: ICDAR2009, ICDAR2013, UMD, the George Washington and the Barcelona Marriages Database. The proposed method outperforms the state-of-the-art considering the different types and difficulties of the benchmarking data.
Keywords: Text line segmentation; Handwritten documents; Document image processing; Historical document analysis
|
|
|
Josep Llados and Enric Marti. 1999. A graph-edit algorithm for hand-drawn graphical document recognition and their automatic introduction into CAD systems. Machine Graphics & Vision, 8, 195–211.
|
|
|
Josep Llados and Enric Marti. 1999. A graph-edit algorithm for hand-drawn graphical document recognition and their automatic introduction into CAD systems..
|
|
|
Josep Llados, Enric Marti and Jaime Lopez-Krahe. 1999. A Hough-based method for hatched pattern detection in maps and diagrams. Proceeding of the Fifth Int. Conf. Document Analysis and Recognition ICDAR ’99.479–482.
Abstract: A hatched area is characterized by a set of parallel straight lines placed at regular intervals. In this paper, a Hough-based schema is introduced to recognize hatched areas in technical documents from attributed graph structures representing the document once it has been vectorized. Defining a Hough-based transform from a graph instead of the raster image allows to drastically reduce the processing time and, second, to obtain more reliable results because straight lines have already been detected in the vectorization step. A second advantage of the proposed method is that no assumptions must be made a priori about the slope and frequency of hatching patterns, but they are computed in run time for each hatched area.
|
|
|
Josep Llados, J. Lopez-Krahe and Enric Marti. 1999. A Hough-based method for hatched pattern detection in maps and diagrams..
|
|
|
Albert Gordo, Jaume Gibert, Ernest Valveny and Marçal Rusiñol. 2010. A Kernel-based Approach to Document Retrieval. 9th IAPR International Workshop on Document Analysis Systems.377–384.
Abstract: In this paper we tackle the problem of document image retrieval by combining a similarity measure between documents and the probability that a given document belongs to a certain class. The membership probability to a specific class is computed using Support Vector Machines in conjunction with similarity measure based kernel applied to structural document representations. In the presented experiments, we use different document representations, both visual and structural, and we apply them to a database of historical documents. We show how our method based on similarity kernels outperforms the usual distance-based retrieval.
|
|
|
Alicia Fornes, Volkmar Frinken, Andreas Fischer, Jon Almazan, G. Jackson and Horst Bunke. 2011. A Keyword Spotting Approach Using Blurred Shape Model-Based Descriptors. Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. ACM, 83–90.
Abstract: The automatic processing of handwritten historical documents is considered a hard problem in pattern recognition. In addition to the challenges given by modern handwritten data, a lack of training data as well as effects caused by the degradation of documents can be observed. In this scenario, keyword spotting arises to be a viable solution to make documents amenable for searching and browsing. For this task we propose the adaptation of shape descriptors used in symbol recognition. By treating each word image as a shape, it can be represented using the Blurred Shape Model and the De-formable Blurred Shape Model. Experiments on the George Washington database demonstrate that this approach is able to outperform the commonly used Dynamic Time Warping approach.
|
|
|
Fernando Vilariño and Dimosthenis Karatzas. 2016. A Living Lab approach for Citizen Science in Libraries. 1st International ECSA Conference.
|
|
|
Gemma Sanchez, Josep Llados and K. Tombre. 2002. A mean string algorithm to compute the average among a set of 2D shapes. PRL, 23(1-3), 203–214.
|
|