Publicacions CVC -- Query Results

[11–20] << 21 22 23 24 25 26 27 28 29 30 >> [31–40]

Details

	Records
	Author	Jose Antonio Rodriguez; Gemma Sanchez; Josep Llados
	Title	Rejection strategies involving classifier combination for handwriting recognition			Type	Book Chapter
	Year	2007	Publication	3rd Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2007), J. Marti et al. (Eds.) LNCS 4478:97–104	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract
	Address	Girona (Spain)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ RSL2007a			Serial	777
Permanent link to this record



	Author	Carlos David Martinez Hinarejos; Josep Llados; Alicia Fornes; Francisco Casacuberta; Lluis de Las Heras; Joan Mas; Moises Pastor; Oriol Ramos Terrades; Joan Andreu Sanchez; Enrique Vidal; Fernando Vilariño
	Title	Context, multimodality, and user collaboration in handwritten text processing: the CoMUN-HaT project			Type	Conference Article
	Year	2016	Publication	3rd IberSPEECH	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract	Processing of handwritten documents is a task that is of wide interest for many purposes, such as those related to preserve cultural heritage. Handwritten text recognition techniques have been successfully applied during the last decade to obtain transcriptions of handwritten documents, and keyword spotting techniques have been applied for searching specific terms in image collections of handwritten documents. However, results on transcription and indexing are far from perfect. In this framework, the use of new data sources arises as a new paradigm that will allow for a better transcription and indexing of handwritten documents. Three main different data sources could be considered: context of the document (style, writer, historical time, topics,. . . ), multimodal data (representations of the document in a different modality, such as the speech signal of the dictation of the text), and user feedback (corrections, amendments,. . . ). The CoMUN-HaT project aims at the integration of these different data sources into the transcription and indexing task for handwritten documents: the use of context derived from the analysis of the documents, how multimodality can aid the recognition process to obtain more accurate transcriptions (including transcription in a modern version of the language), and integration into a userin-the-loop assisted text transcription framework. This will be reflected in the construction of a transcription and indexing platform that can be used by both professional and nonprofessional users, contributing to crowd-sourcing activities to preserve cultural heritage and to obtain an accessible version of the involved corpus.
	Address	Lisboa; Portugal; November 2016
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	IberSPEECH
	Notes	DAG; MV; 600.097;SIAI			Approved	no
	Call Number	Admin @ si @MLF2016			Serial	2813
Permanent link to this record



	Author	Agnes Borras; Josep Llados
	Title	A Multi-Scale Layout Descriptor Based on Delaunay Triangulation for Image Retrieval			Type	Conference Article
	Year	2008	Publication	3rd International Conference on Computer Vision Theory and Applications VISAPP (2) 2008	Abbreviated Journal
	Volume	2	Issue		Pages	139-144
	Keywords
	Abstract
	Address	Funchal, Madeira (Portugal)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ BoL2008			Serial	981
Permanent link to this record



	Author	Arnau Baro; Jialuo Chen; Alicia Fornes; Beata Megyesi
	Title	Towards a generic unsupervised method for transcription of encoded manuscripts			Type	Conference Article
	Year	2019	Publication	3rd International Conference on Digital Access to Textual Cultural Heritage	Abbreviated Journal
	Volume		Issue		Pages	73-78
	Keywords	A. Baró, J. Chen, A. Fornés, B. Megyesi.
	Abstract	Historical ciphers, a special type of manuscripts, contain encrypted information, important for the interpretation of our history. The first step towards decipherment is to transcribe the images, either manually or by automatic image processing techniques. Despite the improvements in handwritten text recognition (HTR) thanks to deep learning methodologies, the need of labelled data to train is an important limitation. Given that ciphers often use symbol sets across various alphabets and unique symbols without any transcription scheme available, these supervised HTR techniques are not suitable to transcribe ciphers. In this paper we propose an un-supervised method for transcribing encrypted manuscripts based on clustering and label propagation, which has been successfully applied to community detection in networks. We analyze the performance on ciphers with various symbol sets, and discuss the advantages and drawbacks compared to supervised HTR methods.
	Address	Brussels; May 2019
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	DATeCH
	Notes	DAG; 600.097; 600.140; 600.121			Approved	no
	Call Number	Admin @ si @ BCF2019			Serial	3276
Permanent link to this record



	Author	Jialuo Chen; M.A.Souibgui; Alicia Fornes; Beata Megyesi
	Title	A Web-based Interactive Transcription Tool for Encrypted Manuscripts			Type	Conference Article
	Year	2020	Publication	3rd International Conference on Historical Cryptology	Abbreviated Journal
	Volume		Issue		Pages	52-59
	Keywords
	Abstract	Manual transcription of handwritten text is a time consuming task. In the case of encrypted manuscripts, the recognition is even more complex due to the huge variety of alphabets and symbol sets. To speed up and ease this process, we present a web-based tool aimed to (semi)-automatically transcribe the encrypted sources. The user uploads one or several images of the desired encrypted document(s) as input, and the system returns the transcription(s). This process is carried out in an interactive fashion with the user to obtain more accurate results. For discovering and testing, the developed web tool is freely available.
	Address	Virtual; June 2020
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	HistoCrypt
	Notes	DAG; 600.140; 602.230; 600.121			Approved	no
	Call Number	Admin @ si @ CSF2020			Serial	3447
Permanent link to this record



	Author	Arnau Baro; Carles Badal; Pau Torras; Alicia Fornes
	Title	Handwritten Historical Music Recognition through Sequence-to-Sequence with Attention Mechanism			Type	Conference Article
	Year	2022	Publication	3rd International Workshop on Reading Music Systems (WoRMS2021)	Abbreviated Journal
	Volume		Issue		Pages	55-59
	Keywords	Optical Music Recognition; Digits; Image Classification
	Abstract	Despite decades of research in Optical Music Recognition (OMR), the recognition of old handwritten music scores remains a challenge because of the variabilities in the handwriting styles, paper degradation, lack of standard notation, etc. Therefore, the research in OMR systems adapted to the particularities of old manuscripts is crucial to accelerate the conversion of music scores existing in archives into digital libraries, fostering the dissemination and preservation of our music heritage. In this paper we explore the adaptation of sequence-to-sequence models with attention mechanism (used in translation and handwritten text recognition) and the generation of specific synthetic data for recognizing old music scores. The experimental validation demonstrates that our approach is promising, especially when compared with long short-term memory neural networks.
	Address	July 23, 2021, Alicante (Spain)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	WoRMS
	Notes	DAG; 600.121; 600.162; 602.230; 600.140			Approved	no
	Call Number	Admin @ si @ BBT2022			Serial	3734
Permanent link to this record



	Author	Ali Furkan Biten; Ruben Tito; Andres Mafla; Lluis Gomez; Marçal Rusiñol; M. Mathew; C.V. Jawahar; Ernest Valveny; Dimosthenis Karatzas
	Title	ICDAR 2019 Competition on Scene Text Visual Question Answering			Type	Conference Article
	Year	2019	Publication	3rd Workshop on Closing the Loop Between Vision and Language, in conjunction with ICCV2019	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract	This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (ST-VQA). ST-VQA introduces an important aspect that is not addressed by any Visual Question Answering system up to date, namely the incorporation of scene text to answer questions asked about an image. The competition introduces a new dataset comprising 23, 038 images annotated with 31, 791 question / answer pairs where the answer is always grounded on text instances present in the image. The images are taken from 7 different public computer vision datasets, covering a wide range of scenarios. The competition was structured in three tasks of increasing difficulty, that require reading the text in a scene and understanding it in the context of the scene, to correctly answer a given question. A novel evaluation metric is presented, which elegantly assesses both key capabilities expected from an optimal model: text recognition and image understanding. A detailed analysis of results from different participants is showcased, which provides insight into the current capabilities of VQA systems that can read. We firmly believe the dataset proposed in this challenge will be an important milestone to consider towards a path of more robust and general models that can exploit scene text to achieve holistic image understanding.
	Address	Sydney; Australia; September 2019
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	CLVL
	Notes	DAG; 600.129; 601.338; 600.135; 600.121			Approved	no
	Call Number	Admin @ si @ BTM2019a			Serial	3284
Permanent link to this record



	Author	Josep Llados
	Title	The 5G of Document Intelligence			Type	Conference Article
	Year	2021	Publication	3rd Workshop on Future of Document Analysis and Recognition	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract
	Address	Lausanne; Suissa; September 2021
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	FDAR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @			Serial	3677
Permanent link to this record



	Author	Lei Kang; Juan Ignacio Toledo; Pau Riba; Mauricio Villegas; Alicia Fornes; Marçal Rusiñol
	Title	Convolve, Attend and Spell: An Attention-based Sequence-to-Sequence Model for Handwritten Word Recognition			Type	Conference Article
	Year	2018	Publication	40th German Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	459-472
	Keywords
	Abstract	This paper proposes Convolve, Attend and Spell, an attention based sequence-to-sequence model for handwritten word recognition. The proposed architecture has three main parts: an encoder, consisting of a CNN and a bi-directional GRU, an attention mechanism devoted to focus on the pertinent features and a decoder formed by a one-directional GRU, able to spell the corresponding word, character by character. Compared with the recent state-of-the-art, our model achieves competitive results on the IAM dataset without needing any pre-processing step, predefined lexicon nor language model. Code and additional results are available in https://github.com/omni-us/research-seq2seq-HTR.
	Address	Stuttgart; Germany; October 2018
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	GCPR
	Notes	DAG; 600.097; 603.057; 302.065; 601.302; 600.084; 600.121; 600.129			Approved	no
	Call Number	Admin @ si @ KTR2018			Serial	3167
Permanent link to this record



	Author	Miquel Ferrer; Ernest Valveny; F. Serratosa
	Title	Median Graph Computation by means of a Genetic Approach Based on Minimum Common Supergraph and Maximum Common Subraph			Type	Conference Article
	Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
	Volume	5524	Issue		Pages	346–353
	Keywords
	Abstract	Given a set of graphs, the median graph has been theoretically presented as a useful concept to infer a representative of the set. However, the computation of the median graph is a highly complex task and its practical application has been very limited up to now. In this work we present a new genetic algorithm for the median graph computation. A set of experiments on real data, where none of the existing algorithms for the median graph computation could be applied up to now due to their computational complexity, show that we obtain good approximations of the median graph. Finally, we use the median graph in a real nearest neighbour classification showing that it leaves the box of the only-theoretical concepts and demonstrating, from a practical point of view, that can be a useful tool to represent a set of graphs.
	Address	Póvoa de Varzim, Portugal
	Corporate Author				Thesis
	Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
	Area		Expedition		Conference	IbPRIA
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ FVS2009c			Serial	1174
Permanent link to this record

Select All Deselect All

[11–20] << 21 22 23 24 25 26 27 28 29 30 >> [31–40]

List View

Citations

Details

All Found Records Selected Records:

Save Citations: Format:

Export Records: Format: