|
Oriol Ramos Terrades. 2006. Linear Combination of Multiresolution Descriptors: Application to Graphics Recognition. (Ph.D. thesis, .)
|
|
|
Oriol Ramos Terrades. 2003. Descripcio i classificacio de simbols tecnics usant la transformada de crestetes.
|
|
|
Umapada Pal, Partha Pratim Roy, N. Tripathya and Josep Llados. 2010. Multi-oriented Bangla and Devnagari text recognition. PR, 43(12), 4124–4136.
Abstract: There are printed complex documents where text lines of a single page may have different orientations or the text lines may be curved in shape. As a result, it is difficult to detect the skew of such documents and hence character segmentation and recognition of such documents are a complex task. In this paper, using background and foreground information we propose a novel scheme towards the recognition of Indian complex documents of Bangla and Devnagari script. In Bangla and Devnagari documents usually characters in a word touch and they form cavity regions. To take care of these cavity regions, background information of such documents is used. Convex hull and water reservoir principle have been applied for this purpose. Here, at first, the characters are segmented from the documents using the background information of the text. Next, individual characters are recognized using rotation invariant features obtained from the foreground part of the characters.
For character segmentation, at first, writing mode of a touching component (word) is detected using water reservoir principle based features. Next, depending on writing mode and the reservoir base-region of the touching component, a set of candidate envelope points is then selected from the contour points of the component. Based on these candidate points, the touching component is finally segmented into individual characters. For recognition of multi-sized/multi-oriented characters the features are computed from different angular information obtained from the external and internal contour pixels of the characters. These angular information are computed in such a way that they do not depend on the size and rotation of the characters. Circular and convex hull rings have been used to divide a character into smaller zones to get zone-wise features for higher recognition results. We combine circular and convex hull features to improve the results and these features are fed to support vector machines (SVM) for recognition. From our experiment we obtained recognition results of 99.18% (98.86%) accuracy when tested on 7515 (7874) Devnagari (Bangla) characters.
|
|
|
Farshad Nourbakhsh, Dimosthenis Karatzas and Ernest Valveny. 2010. A polar-based logo representation based on topological and colour features. 9th IAPR International Workshop on Document Analysis Systems.341–348.
Abstract: In this paper, we propose a novel rotation and scale invariant method for colour logo retrieval and classification, which involves performing a simple colour segmentation and subsequently describing each of the resultant colour components based on a set of topological and colour features. A polar representation is used to represent the logo and the subsequent logo matching is based on Cyclic Dynamic Time Warping (CDTW). We also show how combining information about the global distribution of the logo components and their local neighbourhood using the Delaunay triangulation allows to improve the results. All experiments are performed on a dataset of 2500 instances of 100 colour logo images in different rotations and scales.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2009. SSP: Sketching slide Presentations, a Syntactic Approach. 8th IAPR International Workshop on Graphics Recognition.
Abstract: The design of a slide presentation is a creative process. In this process first, humans visualize in their minds what they want to explain. Then, they have to be able to represent this knowledge in an understandable way. There exists a lot of commercial software that allows to create our own slide presentations but the creativity of the user is rather limited. In this article we present an application that allows the user to create and visualize a slide presentation from a sketch. A slide may be seen as a graphical document or a diagram where its elements are placed in a particular spatial arrangement. To describe and recognize slides a syntactic approach is proposed. This approach is based on an Adjacency Grammar and a parsing methodology to cope with this kind of grammars. The experimental evaluation shows the performance of our methodology from a qualitative and a quantitative point of view. Six different slides containing different number of symbols, from 4 to 7, have been given to the users and they have drawn them without restrictions in the order of the elements. The quantitative results give an idea on how suitable is our methodology to describe and recognize the different elements in a slide.
|
|
|
Joan Mas, Gemma Sanchez, Josep Llados and B. Lamiroy. 2007. An Incremental On-line Parsing Algorithm for Recognizing Sketching Diagrams. 9th IEEE International Conference on Document Analysis and Recognition.452–456.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2006. An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2005. An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2005. An Adjacency Grammar to Recognize Symbols and Gestures in a Digital Pen Framework. Pattern Recognition and Image Analysis (IbPRIA 2005), LNCS 3523: 115–122.
|
|
|
Joan Mas, Jose Antonio Rodriguez, Dimosthenis Karatzas, Gemma Sanchez and Josep Llados. 2008. HistoSketch: A Semi-Automatic Annotation Tool for Archival Documents. Proceedings of the 8th International Workshop on Document Analysis Systems,.517–524.
|
|