Publicacions CVC -- Query Results

[21–30] << 31 32 33 34 35 36 37 38 39 40 >> [41–50]

Details

	Records
	Author	David Fernandez; Simone Marinai; Josep Llados; Alicia Fornes
	Title	Contextual Word Spotting in Historical Manuscripts using Markov Logic Networks			Type	Conference Article
	Year	2013	Publication	2nd International Workshop on Historical Document Imaging and Processing	Abbreviated Journal
	Volume		Issue		Pages	36-43
	Keywords
	Abstract	Natural languages can often be modelled by suitable grammars whose knowledge can improve the word spotting results. The implicit contextual information is even more useful when dealing with information that is intrinsically described as one collection of records. In this paper, we present one approach to word spotting which uses the contextual information of records to improve the results. The method relies on Markov Logic Networks to probabilistically model the relational organization of handwritten records. The performance has been evaluated on the Barcelona Marriages Dataset that contains structured handwritten records that summarize marriage information.
	Address	washington; USA; August 2013
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4503-2115-0	Medium
	Area		Expedition		Conference	HIP
	Notes	DAG; 600.056; 600.045; 600.061; 602.006			Approved	no
	Call Number	Admin @ si @ FML2013			Serial	2308
Permanent link to this record



	Author	Pau Riba; Jon Almazan; Alicia Fornes; David Fernandez; Ernest Valveny; Josep Llados
	Title	e-Crowds: a mobile platform for browsing and searching in historical demographyrelated manuscripts			Type	Conference Article
	Year	2014	Publication	14th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	228 - 233
	Keywords
	Abstract	This paper presents a prototype system running on portable devices for browsing and word searching through historical handwritten document collections. The platform adapts the paradigm of eBook reading, where the narrative is not necessarily sequential, but centered on the user actions. The novelty is to replace digitally born books by digitized historical manuscripts of marriage licenses, so document analysis tasks are required in the browser. With an active reading paradigm, the user can cast queries of people names, so he/she can implicitly follow genealogical links. In addition, the system allows combined searches: the user can refine a search by adding more words to search. As a second contribution, the retrieval functionality involves as a core technology a word spotting module with an unified approach, which allows combined query searches, and also two input modalities: query-by-example, and query-by-string.
	Address	Creete Island; Grecia; September 2014
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	2167-6445	ISBN	978-1-4799-4335-7	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG; 600.056; 600.045; 600.061; 602.006; 600.077			Approved	no
	Call Number	Admin @ si @ RAF2014			Serial	2463
Permanent link to this record



	Author	Jon Almazan; Albert Gordo; Alicia Fornes; Ernest Valveny
	Title	Word Spotting and Recognition with Embedded Attributes			Type	Journal Article
	Year	2014	Publication	IEEE Transactions on Pattern Analysis and Machine Intelligence	Abbreviated Journal	TPAMI
	Volume	36	Issue	12	Pages	2552 - 2566
	Keywords
	Abstract	This article addresses the problems of word spotting and word recognition on images. In word spotting, the goal is to find all instances of a query word in a dataset of images. In recognition, the goal is to recognize the content of the word image, usually aided by a dictionary or lexicon. We describe an approach in which both word images and text strings are embedded in a common vectorial subspace. This is achieved by a combination of label embedding and attributes learning, and a common subspace regression. In this subspace, images and strings that represent the same word are close together, allowing one to cast recognition and retrieval tasks as a nearest neighbor problem. Contrary to most other existing methods, our representation has a fixed length, is low dimensional, and is very fast to compute and, especially, to compare. We test our approach on four public datasets of both handwritten documents and natural images showing results comparable or better than the state-of-the-art on spotting and recognition tasks.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	0162-8828	ISBN		Medium
	Area		Expedition		Conference
	Notes	DAG; 600.056; 600.045; 600.061; 602.006; 600.077			Approved	no
	Call Number	Admin @ si @ AGF2014a			Serial	2483
Permanent link to this record



	Author	Volkmar Frinken; Andreas Fischer; Carlos David Martinez Hinarejos
	Title	Handwriting Recognition in Historical Documents using Very Large Vocabularies			Type	Conference Article
	Year	2013	Publication	2nd International Workshop on Historical Document Imaging and Processing	Abbreviated Journal
	Volume		Issue		Pages	67-72
	Keywords
	Abstract	Language models are used in automatic transcription system to resolve ambiguities. This is done by limiting the vocabulary of words that can be recognized as well as estimating the n-gram probability of the words in the given text. In the context of historical documents, a non-unified spelling and the limited amount of written text pose a substantial problem for the selection of the recognizable vocabulary as well as the computation of the word probabilities. In this paper we propose for the transcription of historical Spanish text to keep the corpus for the n-gram limited to a sample of the target text, but expand the vocabulary with words gathered from external resources. We analyze the performance of such a transcription system with different sizes of external vocabularies and demonstrate the applicability and the significant increase in recognition accuracy of using up to 300 thousand external words.
	Address	Washington; USA; August 2013
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4503-2115-0	Medium
	Area		Expedition		Conference	HIP
	Notes	DAG; 600.056; 600.045; 600.061; 602.006; 602.101			Approved	no
	Call Number	Admin @ si @ FFM2013			Serial	2296
Permanent link to this record



	Author	Marçal Rusiñol; V. Poulain d'Andecy; Dimosthenis Karatzas; Josep Llados
	Title	Classification of Administrative Document Images by Logo Identification			Type	Conference Article
	Year	2013	Publication	10th IAPR International Workshop on Graphics Recognition	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract	This paper is focused on the categorization of administrative document images (such as invoices) based on the recognition of the supplier's graphical logo. Two different methods are proposed, the first one uses a bag-of-visual-words model whereas the second one tries to locate logo images described by the blurred shape model descriptor within documents by a sliding-window technique. Preliminar results are reported with a dataset of real administrative documents.
	Address	Bethlehem; PA; USA; August 2013
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	GREC
	Notes	DAG; 600.056; 600.045; 605.203			Approved	no
	Call Number	Admin @ si @			Serial	2348
Permanent link to this record



	Author	Hongxing Gao; Marçal Rusiñol; Dimosthenis Karatzas; Apostolos Antonacopoulos; Josep Llados
	Title	An interactive appearance-based document retrieval system for historical newspapers			Type	Conference Article
	Year	2013	Publication	Proceedings of the International Conference on Computer Vision Theory and Applications	Abbreviated Journal
	Volume		Issue		Pages	84-87
	Keywords
	Abstract	In this paper we present a retrieval-based application aimed at assisting a user to semi-automatically segment an incoming flow of historical newspaper images by automatically detecting a particular type of pages based on their appearance. A visual descriptor is used to assess page similarity while a relevance feedback process allow refining the results iteratively. The application is tested on a large dataset of digitised historic newspapers.
	Address	Barcelona; February 2013
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	VISAPP
	Notes	DAG; 600.056; 600.045; 605.203			Approved	no
	Call Number	Admin @ si @ GRK2013a			Serial	2290
Permanent link to this record



	Author	Marçal Rusiñol; V. Poulain d'Andecy; Dimosthenis Karatzas; Josep Llados
	Title	Classification of Administrative Document Images by Logo Identification			Type	Book Chapter
	Year	2014	Publication	Graphics Recognition. Current Trends and Challenges	Abbreviated Journal
	Volume	8746	Issue		Pages	49-58
	Keywords	Administrative Document Classification; Logo Recognition; Logo Spotting
	Abstract	This paper is focused on the categorization of administrative document images (such as invoices) based on the recognition of the supplier’s graphical logo. Two different methods are proposed, the first one uses a bag-of-visual-words model whereas the second one tries to locate logo images described by the blurred shape model descriptor within documents by a sliding-window technique. Preliminar results are reported with a dataset of real administrative documents.
	Address
	Corporate Author				Thesis
	Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	Bart Lamiroy; Jean-Marc Ogier
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	0302-9743	ISBN	978-3-662-44853-3	Medium
	Area		Expedition		Conference
	Notes	DAG; 600.056; 600.045; 605.203; 600.077			Approved	no
	Call Number	Admin @ si @ RPK2014			Serial	2701
Permanent link to this record



	Author	Antonio Clavelli; Dimosthenis Karatzas; Josep Llados; Mario Ferraro; Giuseppe Boccignone
	Title	Modelling task-dependent eye guidance to objects in pictures			Type	Journal Article
	Year	2014	Publication	Cognitive Computation	Abbreviated Journal	CoCom
	Volume	6	Issue	3	Pages	558-584
	Keywords	Visual attention; Gaze guidance; Value; Payoff; Stochastic fixation prediction
	Abstract	5Y Impact Factor: 1.14 / 3rd (Computer Science, Artificial Intelligence) We introduce a model of attentional eye guidance based on the rationale that the deployment of gaze is to be considered in the context of a general action-perception loop relying on two strictly intertwined processes: sensory processing, depending on current gaze position, identifies sources of information that are most valuable under the given task; motor processing links such information with the oculomotor act by sampling the next gaze position and thus performing the gaze shift. In such a framework, the choice of where to look next is task-dependent and oriented to classes of objects embedded within pictures of complex scenes. The dependence on task is taken into account by exploiting the value and the payoff of gazing at certain image patches or proto-objects that provide a sparse representation of the scene objects. The different levels of the action-perception loop are represented in probabilistic form and eventually give rise to a stochastic process that generates the gaze sequence. This way the model also accounts for statistical properties of gaze shifts such as individual scan path variability. Results of the simulations are compared either with experimental data derived from publicly available datasets and from our own experiments.
	Address
	Corporate Author				Thesis
	Publisher	Springer US	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1866-9956	ISBN		Medium
	Area		Expedition		Conference
	Notes	DAG; 600.056; 600.045; 605.203; 601.212; 600.077			Approved	no
	Call Number	Admin @ si @ CKL2014			Serial	2419
Permanent link to this record



	Author	Hongxing Gao; Marçal Rusiñol; Dimosthenis Karatzas; Josep Llados
	Title	Fast Structural Matching for Document Image Retrieval through Spatial Databases			Type	Conference Article
	Year	2014	Publication	Document Recognition and Retrieval XXI	Abbreviated Journal
	Volume	9021	Issue		Pages
	Keywords	Document image retrieval; distance transform; MSER; spatial database
	Abstract	The structure of document images plays a signicant role in document analysis thus considerable eorts have been made towards extracting and understanding document structure, usually in the form of layout analysis approaches. In this paper, we rst employ Distance Transform based MSER (DTMSER) to eciently extract stable document structural elements in terms of a dendrogram of key-regions. Then a fast structural matching method is proposed to query the structure of document (dendrogram) based on a spatial database which facilitates the formulation of advanced spatial queries. The experiments demonstrate a signicant improvement in a document retrieval scenario when compared to the use of typical Bag of Words (BoW) and pyramidal BoW descriptors.
	Address	Amsterdam; September 2014
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	SPIE-DRR
	Notes	DAG; 600.056; 600.061; 600.077			Approved	no
	Call Number	Admin @ si @ GRK2014a			Serial	2496
Permanent link to this record



	Author	Hongxing Gao; Marçal Rusiñol; Dimosthenis Karatzas; Josep Llados
	Title	Embedding Document Structure to Bag-of-Words through Pair-wise Stable Key-regions			Type	Conference Article
	Year	2014	Publication	22nd International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	2903 - 2908
	Keywords
	Abstract	Since the document structure carries valuable discriminative information, plenty of efforts have been made for extracting and understanding document structure among which layout analysis approaches are the most commonly used. In this paper, Distance Transform based MSER (DTMSER) is employed to efficiently extract the document structure as a dendrogram of key-regions which roughly correspond to structural elements such as characters, words and paragraphs. Inspired by the Bag of Words (BoW) framework, we propose an efficient method for structural document matching by representing the document image as a histogram of key-region pairs encoding structural relationships. Applied to the scenario of document image retrieval, experimental results demonstrate a remarkable improvement when comparing the proposed method with typical BoW and pyramidal BoW methods.
	Address	Stockholm; Sweden; August 2014
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG; 600.056; 600.061; 600.077			Approved	no
	Call Number	Admin @ si @ GRK2014b			Serial	2497
Permanent link to this record

Select All Deselect All

[21–30] << 31 32 33 34 35 36 37 38 39 40 >> [41–50]

List View

Citations

Details

All Found Records Selected Records:

Save Citations: Format:

Export Records: Format: