Publicacions CVC -- Query Results

Search & Display Options

Select All Deselect All

<< 1 2 3 4 5 6 7 8 9 10 >> [11–20]

|

Citations

|

	Debora Gil, Jordi Gonzalez and Gemma Sanchez, eds. 2007. Computer Vision: Advances in Research and Development. Bellaterra (Spain), UAB. (2.) Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Alicia Fornes, Volkmar Frinken, Andreas Fischer, Jon Almazan, G. Jackson and Horst Bunke. 2011. A Keyword Spotting Approach Using Blurred Shape Model-Based Descriptors. Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. ACM, 83–90. Abstract: The automatic processing of handwritten historical documents is considered a hard problem in pattern recognition. In addition to the challenges given by modern handwritten data, a lack of training data as well as effects caused by the degradation of documents can be observed. In this scenario, keyword spotting arises to be a viable solution to make documents amenable for searching and browsing. For this task we propose the adaptation of shape descriptors used in symbol recognition. By treating each word image as a shape, it can be represented using the Blurred Shape Model and the De-formable Blurred Shape Model. Experiments on the George Washington database demonstrate that this approach is able to outperform the commonly used Dynamic Time Warping approach. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Andreas Fischer, Volkmar Frinken, Alicia Fornes and Horst Bunke. 2011. Transcription Alignment of Latin Manuscripts Using Hidden Markov Models. Proceedings of the 2011 Workshop on Historical Document Imaging and Processing. ACM, 29–36. Abstract: Transcriptions of historical documents are a valuable source for extracting labeled handwriting images that can be used for training recognition systems. In this paper, we introduce the Saint Gall database that includes images as well as the transcription of a Latin manuscript from the 9th century written in Carolingian script. Although the available transcription is of high quality for a human reader, the spelling of the words is not accurate when compared with the handwriting image. Hence, the transcription poses several challenges for alignment regarding, e.g., line breaks, abbreviations, and capitalization. We propose an alignment system based on character Hidden Markov Models that can cope with these challenges and efficiently aligns complete document pages. On the Saint Gall database, we demonstrate that a considerable alignment accuracy can be achieved, even with weakly trained character models. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Josep Llados, Jaime Lopez-Krahe and Enric Marti. 1996. Hand drawn document understanding using the straight line Hough transform and graph matching. Proceedings of the 13th International Pattern Recognition Conference (ICPR’96). Vienna , Austria, 497–501. Abstract: This paper presents a system to understand hand drawn architectural drawings in a CAD environment. The procedure is to identify in a floor plan the building elements, stored in a library of patterns, and their spatial relationships. The vectorized input document and the patterns to recognize are represented by attributed graphs. To recognize the patterns as such, we apply a structural approach based on subgraph isomorphism techniques. In spite of their value, graph matching techniques do not recognize adequately those building elements characterized by hatching patterns, i.e. walls. Here we focus on the recognition of hatching patterns and develop a straight line Hough transform based method in order to detect the regions filled in with parallel straight fines. This allows not only to recognize filling patterns, but it actually reduces the computational load associated with the subgraph isomorphism computation. The result is that the document can be redrawn by editing all the patterns recognized Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Josep Llados, Ernest Valveny, Gemma Sanchez and Enric Marti. 2003. A Case Study of Pattern Recognition: Symbol Recognition in Graphic Documentsa. Proceedings of Pattern Recognition in Information Systems. ICEIS Press, 1–13. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Gemma Sanchez and 6 others. 2003. A system for virtual prototyping of architectural projects. Proceedings of Fifth IAPR International Workshop on Pattern Recognition.65–74. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Gemma Sanchez and Josep Llados. 2003. Syntactic models to represent perceptually regular repetitive patterns in graphic documents. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Philippe Dosch and Josep Llados. 2003. Vectorial Signatures for Symbol Discrimination. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Josep Llados and Gemma Sanchez. 2003. Symbol Recognition Using Graphs. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Emanuele Vivoli, Ali Furkan Biten, Andres Mafla, Dimosthenis Karatzas and Lluis Gomez. 2022. MUST-VQA: MUltilingual Scene-text VQA. Proceedings European Conference on Computer Vision Workshops.345–358. (LNCS.) Abstract: In this paper, we present a framework for Multilingual Scene Text Visual Question Answering that deals with new languages in a zero-shot fashion. Specifically, we consider the task of Scene Text Visual Question Answering (STVQA) in which the question can be asked in different languages and it is not necessarily aligned to the scene text language. Thus, we first introduce a natural step towards a more generalized version of STVQA: MUST-VQA. Accounting for this, we discuss two evaluation scenarios in the constrained setting, namely IID and zero-shot and we demonstrate that the models can perform on a par on a zero-shot setting. We further provide extensive experimentation and show the effectiveness of adapting multilingual language models into STVQA tasks. Keywords: Visual question answering; Scene text; Translation robustness; Multilingual models; Zero-shot transfer; Power of language models Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML

Select All Deselect All

<< 1 2 3 4 5 6 7 8 9 10 >> [11–20]

|

Citations

|

Cite, Group & Export Options