Publicacions CVC -- Query Results

[51–60] << 61 62 63 64 65 66 67 68 69 70 >> [71–74]

Details

	Records
	Author	Josep Llados; Ernest Valveny; Gemma Sanchez; Enric Marti
	Title	A Case Study of Pattern Recognition: Symbol Recognition in Graphic Documentsa			Type	Conference Article
	Year	2003	Publication	Proceedings of Pattern Recognition in Information Systems	Abbreviated Journal
	Volume		Issue		Pages	1-13
	Keywords
	Abstract
	Address	Angers, France
	Corporate Author				Thesis
	Publisher	ICEIS Press	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	972-98816-3-4	Medium
	Area		Expedition		Conference	PRIS'03
	Notes	DAG;IAM;			Approved	no
	Call Number	IAM @ iam @ LVS2003			Serial	1576
Permanent link to this record



	Author	Josep Llados; Jaime Lopez-Krahe; Enric Marti
	Title	Hand drawn document understanding using the straight line Hough transform and graph matching			Type	Conference Article
	Year	1996	Publication	Proceedings of the 13th International Pattern Recognition Conference (ICPR’96)	Abbreviated Journal
	Volume	2	Issue		Pages	497-501
	Keywords
	Abstract	This paper presents a system to understand hand drawn architectural drawings in a CAD environment. The procedure is to identify in a floor plan the building elements, stored in a library of patterns, and their spatial relationships. The vectorized input document and the patterns to recognize are represented by attributed graphs. To recognize the patterns as such, we apply a structural approach based on subgraph isomorphism techniques. In spite of their value, graph matching techniques do not recognize adequately those building elements characterized by hatching patterns, i.e. walls. Here we focus on the recognition of hatching patterns and develop a straight line Hough transform based method in order to detect the regions filled in with parallel straight fines. This allows not only to recognize filling patterns, but it actually reduces the computational load associated with the subgraph isomorphism computation. The result is that the document can be redrawn by editing all the patterns recognized
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication	Vienna , Austria	Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	DAG;IAM;			Approved	no
	Call Number	IAM @ iam @ LLM1996			Serial	1579
Permanent link to this record



	Author	Alicia Fornes; Volkmar Frinken; Andreas Fischer; Jon Almazan; G. Jackson; Horst Bunke
	Title	A Keyword Spotting Approach Using Blurred Shape Model-Based Descriptors			Type	Conference Article
	Year	2011	Publication	Proceedings of the 2011 Workshop on Historical Document Imaging and Processing	Abbreviated Journal
	Volume		Issue		Pages	83-90
	Keywords
	Abstract	The automatic processing of handwritten historical documents is considered a hard problem in pattern recognition. In addition to the challenges given by modern handwritten data, a lack of training data as well as effects caused by the degradation of documents can be observed. In this scenario, keyword spotting arises to be a viable solution to make documents amenable for searching and browsing. For this task we propose the adaptation of shape descriptors used in symbol recognition. By treating each word image as a shape, it can be represented using the Blurred Shape Model and the De-formable Blurred Shape Model. Experiments on the George Washington database demonstrate that this approach is able to outperform the commonly used Dynamic Time Warping approach.
	Address
	Corporate Author				Thesis
	Publisher	ACM	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4503-0916-5	Medium
	Area		Expedition		Conference	HIP
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ FFF2011a			Serial	1823
Permanent link to this record



	Author	Andreas Fischer; Volkmar Frinken; Alicia Fornes; Horst Bunke
	Title	Transcription Alignment of Latin Manuscripts Using Hidden Markov Models			Type	Conference Article
	Year	2011	Publication	Proceedings of the 2011 Workshop on Historical Document Imaging and Processing	Abbreviated Journal
	Volume		Issue		Pages	29-36
	Keywords
	Abstract	Transcriptions of historical documents are a valuable source for extracting labeled handwriting images that can be used for training recognition systems. In this paper, we introduce the Saint Gall database that includes images as well as the transcription of a Latin manuscript from the 9th century written in Carolingian script. Although the available transcription is of high quality for a human reader, the spelling of the words is not accurate when compared with the handwriting image. Hence, the transcription poses several challenges for alignment regarding, e.g., line breaks, abbreviations, and capitalization. We propose an alignment system based on character Hidden Markov Models that can cope with these challenges and efficiently aligns complete document pages. On the Saint Gall database, we demonstrate that a considerable alignment accuracy can be achieved, even with weakly trained character models.
	Address
	Corporate Author				Thesis
	Publisher	ACM	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	HIP
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ FFF2011b			Serial	1824
Permanent link to this record



	Author	Debora Gil; Jordi Gonzalez; Gemma Sanchez (eds)
	Title	Computer Vision: Advances in Research and Development			Type	Book Whole
	Year	2007	Publication	Proceedings of the 2nd CVC International Workshop	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract
	Address
	Corporate Author				Thesis
	Publisher	UAB	Place of Publication	Bellaterra (Spain)	Editor	Debora Gil; Jordi Gonzalez; Gemma Sanchez
	Language		Summary Language		Original Title
	Series Editor		Series Title	2	Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-84-935251-4-9	Medium
	Area		Expedition		Conference
	Notes	IAM; ISE; DAG			Approved	no
	Call Number	IAM @ iam @ GGS2007			Serial	1493
Permanent link to this record



	Author	A. Pujol; Jordi Vitria; Petia Radeva; Xavier Binefa; Robert Benavente; Ernest Valveny; Craig Von Land
	Title	Real time pharmaceutical product recognition using color and shape indexing.			Type	Conference Article
	Year	1999	Publication	Proceedings of the 2nd International Workshop on European Scientific and Industrial Collaboration (WESIC´99), Promotoring Advanced Technologies in Manufacturing.	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract
	Address	Wales
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	OR;MILAB;DAG;CIC;MV			Approved	no
	Call Number	BCNPCL @ bcnpcl @ PVR1999			Serial	24
Permanent link to this record



	Author	Mohamed Ali Souibgui; Sanket Biswas; Andres Mafla; Ali Furkan Biten; Alicia Fornes; Yousri Kessentini; Josep Llados; Lluis Gomez; Dimosthenis Karatzas
	Title	Text-DIAE: a self-supervised degradation invariant autoencoder for text recognition and document enhancement			Type	Conference Article
	Year	2023	Publication	Proceedings of the 37th AAAI Conference on Artificial Intelligence	Abbreviated Journal
	Volume	37	Issue	2	Pages
	Keywords	Representation Learning for Vision; CV Applications; CV Language and Vision; ML Unsupervised; Self-Supervised Learning
	Abstract	In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement. We start by employing a transformer-based architecture that incorporates three pretext tasks as learning objectives to be optimized during pre-training without the usage of labelled data. Each of the pretext objectives is specifically tailored for the final downstream tasks. We conduct several ablation experiments that confirm the design choice of the selected pretext tasks. Importantly, the proposed model does not exhibit limitations of previous state-of-the-art methods based on contrastive losses, while at the same time requiring substantially fewer data samples to converge. Finally, we demonstrate that our method surpasses the state-of-the-art in existing supervised and self-supervised settings in handwritten and scene text recognition and document image enhancement. Our code and trained models will be made publicly available at https://github.com/dali92002/SSL-OCR
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	AAAI
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ SBM2023			Serial	3848
Permanent link to this record



	Author	Khanh Nguyen; Ali Furkan Biten; Andres Mafla; Lluis Gomez; Dimosthenis Karatzas
	Title	Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia			Type	Conference Article
	Year	2023	Publication	Proceedings of the 37th AAAI Conference on Artificial Intelligence	Abbreviated Journal
	Volume	37	Issue	2	Pages	1940-1948
	Keywords
	Abstract	Humans exploit prior knowledge to describe images, and are able to adapt their explanation to specific contextual information given, even to the extent of inventing plausible explanations when contextual information and images do not match. In this work, we propose the novel task of captioning Wikipedia images by integrating contextual knowledge. Specifically, we produce models that jointly reason over Wikipedia articles, Wikimedia images and their associated descriptions to produce contextualized captions. The same Wikimedia image can be used to illustrate different articles, and the produced caption needs to be adapted to the specific context allowing us to explore the limits of the model to adjust captions to different contextual information. Dealing with out-of-dictionary words and Named Entities is a challenging task in this domain. To address this, we propose a pre-training objective, Masked Named Entity Modeling (MNEM), and show that this pretext task results to significantly improved models. Furthermore, we verify that a model pre-trained in Wikipedia generalizes well to News Captioning datasets. We further define two different test splits according to the difficulty of the captioning task. We offer insights on the role and the importance of each modality and highlight the limitations of our model.
	Address	Washington; USA; February 2023
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	AAAI
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ NBM2023			Serial	3860
Permanent link to this record



	Author	Partha Pratim Roy; Umapada Pal; Josep Llados
	Title	Multi-oriented English Text Line Extraction using Background and Foreground Information			Type	Conference Article
	Year	2008	Publication	Proceedings of the 8th IAPR International Workshop on Document Analysis Systems,	Abbreviated Journal
	Volume		Issue		Pages	315–322
	Keywords
	Abstract
	Address	Nara (Japo)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	DAS
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ RPL2008b			Serial	1047
Permanent link to this record



	Author	Marçal Rusiñol; Josep Llados
	Title	Word and Symbol Spotting using Spatial Organization of Local Descriptors			Type	Conference Article
	Year	2008	Publication	Proceedings of the 8th IAPR International Workshop on Document Analysis Systems,	Abbreviated Journal
	Volume		Issue		Pages	489–496
	Keywords
	Abstract
	Address	Nara (Japan)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	DAS
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ RuL2008b			Serial	1059
Permanent link to this record

Select All Deselect All

[51–60] << 61 62 63 64 65 66 67 68 69 70 >> [71–74]

List View

Citations

Details

All Found Records Selected Records:

Save Citations: Format:

Export Records: Format: