|
Andreas Fischer, Volkmar Frinken, Alicia Fornes and Horst Bunke. 2011. Transcription Alignment of Latin Manuscripts Using Hidden Markov Models. Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. ACM, 29–36.
Abstract: Transcriptions of historical documents are a valuable source for extracting labeled handwriting images that can be used for training recognition systems. In this paper, we introduce the Saint Gall database that includes images as well as the transcription of a Latin manuscript from the 9th century written in Carolingian script. Although the available transcription is of high quality for a human reader, the spelling of the words is not accurate when compared with the handwriting image. Hence, the transcription poses several challenges for alignment regarding, e.g., line breaks, abbreviations, and capitalization. We propose an alignment system based on character Hidden Markov Models that can cope with these challenges and efficiently aligns complete document pages. On the Saint Gall database, we demonstrate that a considerable alignment accuracy can be achieved, even with weakly trained character models.
|
|
|
Mohamed Ali Souibgui, Asma Bensalah, Jialuo Chen, Alicia Fornes and Michelle Waldispühl. 2023. A User Perspective on HTR methods for the Automatic Transcription of Rare Scripts: The Case of Codex Runicus Just Accepted. JOCCH, 15(4), 1–18.
Abstract: Recent breakthroughs in Artificial Intelligence, Deep Learning and Document Image Analysis and Recognition have significantly eased the creation of digital libraries and the transcription of historical documents. However, for documents in rare scripts with few labelled training data available, current Handwritten Text Recognition (HTR) systems are too constraint. Moreover, research on HTR often focuses on technical aspects only, and rarely puts emphasis on implementing software tools for scholars in Humanities. In this article, we describe, compare and analyse different transcription methods for rare scripts. We evaluate their performance in a real use case of a medieval manuscript written in the runic script (Codex Runicus) and discuss advantages and disadvantages of each method from the user perspective. From this exhaustive analysis and comparison with a fully manual transcription, we raise conclusions and provide recommendations to scholars interested in using automatic transcription tools.
|
|
|
Joana Maria Pujadas-Mora, Alicia Fornes, Josep Llados and Anna Cabre. 2016. Bridging the gap between historical demography and computing: tools for computer-assisted transcription and the analysis of demographic sources. In K.Matthijs, S.Hin, H.Matsuo and J.Kok, eds. The future of historical demography. Upside down and inside out. Acco Publishers, 127–131.
|
|
|
Josep Llados, J. Lopez-Krahe and Enric Marti. 1999. A Hough-based method for hatched pattern detection in maps and diagrams..
|
|
|
Josep Llados, Felipe Lumbreras and X. Varona. 1999. A multidocument platform for automatic reading of identity cards..
|
|
|
A. Pujol and 6 others. 1999. Real time pharmaceutical product recognition using color and shape indexing. Proceedings of the 2nd International Workshop on European Scientific and Industrial Collaboration (WESIC´99), Promotoring Advanced Technologies in Manufacturing..
|
|
|
Josep Llados, Gemma Sanchez and Enric Marti. 1997. A String-Based Method to Recognize Symbols and Structural Textures in Architectural Plans..
|
|
|
Jordi Vitria and 6 others. 1999. Real time recognition of pharmaceutical products by subspace methods.
|
|
|
V. Chapaprieta and Ernest Valveny. 2001. Handwritten Digit Recognition Using Point Distribution Models..
|
|
|
Josep Llados. 1996. Interpretacio de dibuixos linials fets a ma alçada mitjançant isomorfisme entre subgrafs i transformacio de Hough.
|
|