|
Anton Cervantes, Gemma Sanchez, Josep Llados, Agnes Borras and A. Rodriguez. 2005. Biometric Recognition Based on Line Shape Descriptors. Sixth IAPR International Workshop on Graphics Recognition (GREC 2005).335–344.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2005. An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars.
|
|
|
N. Zakaria, Jean-Marc Ogier and Josep Llados. 2005. On-line Graphics Recognition based on Invariant Spatio-Sequential Descriptor: Fuzzy Matrix.
|
|
|
W. Liu and Josep Llados. 2006. Graphics Recognition. Ten Years Review and Future Perspectives. (LNCS.)
|
|
|
Alicia Fornes, Josep Llados and Gemma Sanchez. 2005. Primitive Segmentation in Old Handwritten Music Scores.
|
|
|
Lluis Gomez, Y. Patel, Marçal Rusiñol, C.V. Jawahar and Dimosthenis Karatzas. 2017. Self‐supervised learning of visual features through embedding images into text topic spaces. 30th IEEE Conference on Computer Vision and Pattern Recognition.
Abstract: End-to-end training from scratch of current deep architectures for new computer vision problems would require Imagenet-scale datasets, and this is not always possible. In this paper we present a method that is able to take advantage of freely available multi-modal content to train computer vision algorithms without human supervision. We put forward the idea of performing self-supervised learning of visual features by mining a large scale corpus of multi-modal (text and image) documents. We show that discriminative visual features can be learnt efficiently by training a CNN to predict the semantic context in which a particular image is more probable to appear as an illustration. For this we leverage the hidden semantic structures discovered in the text corpus with a well-known topic modeling technique. Our experiments demonstrate state of the art performance in image classification, object detection, and multi-modal retrieval compared to recent self-supervised or natural-supervised approaches.
|
|
|
Utkarsh Porwal, Alicia Fornes and Faisal Shafait, eds. 2022. Frontiers in Handwriting Recognition. International Conference on Frontiers in Handwriting Recognition. 18th International Conference, ICFHR 2022. Springer. (LNCS.)
|
|
|
Dimosthenis Karatzas and Ch. Lioutas. 1998. Software Package Development for Electron Diffraction Image Analysis. Proceedings of the XIV Solid State Physics National Conference.
|
|
|
Anjan Dutta, Umapada Pal, Alicia Fornes and Josep Llados. 2010. An Efficient Staff Removal Technique from Printed Musical Documents. 20th International Conference on Pattern Recognition.1965–1968.
Abstract: Staff removal is an important preprocessing step of the Optical Music Recognition (OMR). The process aims to remove the stafflines from a musical document and retain only the musical symbols, later these symbols are used effectively to identify the music information. This paper proposes a simple but robust method to remove stafflines from printed musical scores. In the proposed methodology we have considered a staffline segment as a horizontal linkage of vertical black runs with uniform height. We have used the neighbouring properties of a staffline segment to validate it as a true segment. We have considered the dataset along with the deformations described in for evaluation purpose. From experimentation we have got encouraging results.
|
|
|
Alicia Fornes, Sergio Escalera, Josep Llados and Ernest Valveny. 2010. Symbol Classification using Dynamic Aligned Shape Descriptor. 20th International Conference on Pattern Recognition.1957–1960.
Abstract: Shape representation is a difficult task because of several symbol distortions, such as occlusions, elastic deformations, gaps or noise. In this paper, we propose a new descriptor and distance computation for coping with the problem of symbol recognition in the domain of Graphical Document Image Analysis. The proposed D-Shape descriptor encodes the arrangement information of object parts in a circular structure, allowing different levels of distortion. The classification is performed using a cyclic Dynamic Time Warping based method, allowing distortions and rotation. The methodology has been validated on different data sets, showing very high recognition rates.
|
|