|
Alicia Fornes and Gemma Sanchez. 2014. Analysis and Recognition of Music Scores. In D. Doermann and K. Tombre, eds. Handbook of Document Image Processing and Recognition. Springer London, 749–774.
Abstract: The analysis and recognition of music scores has attracted the interest of researchers for decades. Optical Music Recognition (OMR) is a classical research field of Document Image Analysis and Recognition (DIAR), whose aim is to extract information from music scores. Music scores contain both graphical and textual information, and for this reason, techniques are closely related to graphics recognition and text recognition. Since music scores use a particular diagrammatic notation that follow the rules of music theory, many approaches make use of context information to guide the recognition and solve ambiguities. This chapter overviews the main Optical Music Recognition (OMR) approaches. Firstly, the different methods are grouped according to the OMR stages, namely, staff removal, music symbol recognition, and syntactical analysis. Secondly, specific approaches for old and handwritten music scores are reviewed. Finally, online approaches and commercial systems are also commented.
|
|
|
Salvatore Tabbone and Oriol Ramos Terrades. 2014. An Overview of Symbol Recognition. In D. Doermann and K. Tombre, eds. Handbook of Document Image Processing and Recognition. Springer London, 523–551.
Abstract: According to the Cambridge Dictionaries Online, a symbol is a sign, shape, or object that is used to represent something else. Symbol recognition is a subfield of general pattern recognition problems that focuses on identifying, detecting, and recognizing symbols in technical drawings, maps, or miscellaneous documents such as logos and musical scores. This chapter aims at providing the reader an overview of the different existing ways of describing and recognizing symbols and how the field has evolved to attain a certain degree of maturity.
Keywords: Pattern recognition; Shape descriptors; Structural descriptors; Symbolrecognition; Symbol spotting
|
|
|
Dimosthenis Karatzas, Sergi Robles and Lluis Gomez. 2014. An on-line platform for ground truthing and performance evaluation of text extraction systems. 11th IAPR International Workshop on Document Analysis and Systems.242–246.
Abstract: This paper presents a set of on-line software tools for creating ground truth and calculating performance evaluation metrics for text extraction tasks such as localization, segmentation and recognition. The platform supports the definition of comprehensive ground truth information at different text representation levels while it offers centralised management and quality control of the ground truthing effort. It implements a range of state of the art performance evaluation algorithms and offers functionality for the definition of evaluation scenarios, on-line calculation of various performance metrics and visualisation of the results. The
presented platform, which comprises the backbone of the ICDAR 2011 (challenge 1) and 2013 (challenges 1 and 2) Robust Reading competitions, is now made available for public use.
|
|
|
Joan Mas, Alicia Fornes and Josep Llados. 2016. An Interactive Transcription System of Census Records using Word-Spotting based Information Transfer. 12th IAPR Workshop on Document Analysis Systems.54–59.
Abstract: This paper presents a system to assist in the transcription of historical handwritten census records in a crowdsourcing platform. Census records have a tabular structured layout. They consist in a sequence of rows with information of homes ordered by street address. For each household snippet in the page, the list of family members is reported. The censuses are recorded in intervals of a few years and the information of individuals in each household is quite stable from a point in time to the next one. This redundancy is used to assist the transcriber, so the redundant information is transferred from the census already transcribed to the next one. Household records are aligned from one year to the next one using the knowledge of the ordering by street address. Given an already transcribed census, a query by string word spotting is applied. Thus, names from the census in time t are used as queries in the corresponding home record in time t+1. Since the search is constrained, the obtained precision-recall values are very high, with an important reduction in the transcription time. The proposed system has been tested in a real citizen-science experience where non expert users transcribe the census data of their home town.
|
|
|
Hongxing Gao, Marçal Rusiñol, Dimosthenis Karatzas, Apostolos Antonacopoulos and Josep Llados. 2013. An interactive appearance-based document retrieval system for historical newspapers. Proceedings of the International Conference on Computer Vision Theory and Applications.84–87.
Abstract: In this paper we present a retrieval-based application aimed at assisting a user to semi-automatically segment an incoming flow of historical newspaper images by automatically detecting a particular type of pages based on their appearance. A visual descriptor is used to assess page similarity while a relevance feedback process allow refining the results iteratively. The application is tested on a large dataset of digitised historic newspapers.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2005. An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars.
|
|
|
Joan Mas, Gemma Sanchez and Josep Llados. 2006. An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars.
|
|
|
Joan Mas, Gemma Sanchez, Josep Llados and B. Lamiroy. 2007. An Incremental On-line Parsing Algorithm for Recognizing Sketching Diagrams. 9th IEEE International Conference on Document Analysis and Recognition.452–456.
|
|
|
Fernando Vilariño, Dimosthenis Karatzas, Marcos Catalan and Alberto Valcarcel. 2015. An horizon for the Public Library as a place for innovation and creativity. The Library Living Lab in Volpelleres. The White Book on Public Library Network from Diputació de Barcelona.
|
|
|
Mohamed Ali Souibgui, Pau Torras, Jialuo Chen and Alicia Fornes. 2023. An Evaluation of Handwritten Text Recognition Methods for Historical Ciphered Manuscripts. 7th International Workshop on Historical Document Imaging and Processing.7–12.
Abstract: This paper investigates the effectiveness of different deep learning HTR families, including LSTM, Seq2Seq, and transformer-based approaches with self-supervised pretraining, in recognizing ciphered manuscripts from different historical periods and cultures. The goal is to identify the most suitable method or training techniques for recognizing ciphered manuscripts and to provide insights into the challenges and opportunities in this field of research. We evaluate the performance of these models on several datasets of ciphered manuscripts and discuss their results. This study contributes to the development of more accurate and efficient methods for recognizing historical manuscripts for the preservation and dissemination of our cultural heritage.
|
|