Records |
Author |
Josep Llados |
Title |
Computer Vision: Progress of Research and Development |
Type |
Book Whole |
Year |
2006 |
Publication |
1st CVC Internal Workshop Computer Vision: Progress of Research and Development, |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
J. Llados (ed.), |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
84-933652-8-9 |
Medium |
|
Area |
|
Expedition |
|
Conference |
CVCRD |
Notes |
DAG |
Approved |
no |
Call Number |
DAG @ dag @ Lla2006b |
Serial |
766 |
Permanent link to this record |
|
|
|
Author |
Enric Marti; Jordi Vitria; Alberto Sanfeliu |
Title |
Reconocimiento de Formas y Análisis de Imágenes |
Type |
Book Whole |
Year |
1998 |
Publication |
Asociación Española de Reconocimientos de Formas y Análisis de Imágenes |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
Los sistemas actuales de reconocimiento automático del lenguaje oral se basan en dos etapas básicas de procesado: la parametrización, que extrae la evolución temporal de los parámetros que caracterizan la voz, y el reconocimiento propiamente dicho, que identifica la cadena de palabras de la elocución recibida con ayuda de los modelos que representan el conocimiento adquirido en la etapa de aprendizaje. Tomando como línea divisoria la palabra, dichos modelos son de tipo acústicofonético o gramatical. Los primeros caracterizan las palabras incluidas en el vocabulario de la aplicación o tarea a la que está orientado el sistema de reconocimiento, usando a menudo para ello modelos de unidades de habla de extensión inferior a la palabra, es decir, de unidades subléxicas. Por otro lado, la gramática incluye el conocimiento acerca de las combinaciones permitidas de palabras para formar las frases o su probabilidad. Queda fuera del esquema la denominada comprensión del habla, que utiliza adicionalmente el conocimiento semántico y pragmático para captar el significado de la elocución de entrada al sistema a partir de la cadena (o cadenas alternativas) de palabras que suministra el reconocedor. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
AERFAI |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
84–922529–4–4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
IAM;OR;MV |
Approved |
no |
Call Number |
IAM @ iam @ MVS1998 |
Serial |
1620 |
Permanent link to this record |
|
|
|
Author |
Josep Llados; Ernest Valveny; Gemma Sanchez; Enric Marti |
Title |
A Case Study of Pattern Recognition: Symbol Recognition in Graphic Documentsa |
Type |
Conference Article |
Year |
2003 |
Publication |
Proceedings of Pattern Recognition in Information Systems |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
1-13 |
Keywords |
|
Abstract |
|
Address |
Angers, France |
Corporate Author |
|
Thesis |
|
Publisher |
ICEIS Press |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
972-98816-3-4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
PRIS'03 |
Notes |
DAG;IAM; |
Approved |
no |
Call Number |
IAM @ iam @ LVS2003 |
Serial |
1576 |
Permanent link to this record |
|
|
|
Author |
Theo Gevers; Arjan Gijsenij; Joost Van de Weijer; J.M. Geusebroek |
Title |
Color in Computer Vision: Fundamentals and Applications |
Type |
Book Whole |
Year |
2012 |
Publication |
Color in Computer Vision: Fundamentals and Applications |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
The Wiley-IS&T Series in Imaging Science and Technology |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-470-89084-4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ALTRES;ISE |
Approved |
no |
Call Number |
Admin @ si @ GGG2012a |
Serial |
2068 |
Permanent link to this record |
|
|
|
Author |
Jose Manuel Alvarez; Antonio Lopez |
Title |
Photometric Invariance by Machine Learning |
Type |
Book Chapter |
Year |
2012 |
Publication |
Color in Computer Vision: Fundamentals and Applications |
Abbreviated Journal |
|
Volume |
7 |
Issue |
|
Pages |
113-134 |
Keywords |
road detection |
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
iConcept Press Ltd |
Place of Publication |
|
Editor |
Theo Gevers, Arjan Gijsenij, Joost van de Weijer, Jan-Mark Geusebroek |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-470-89084-4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ADAS |
Approved |
no |
Call Number |
Admin @ si @ AlL2012 |
Serial |
2186 |
Permanent link to this record |
|
|
|
Author |
Jon Almazan; Alicia Fornes; Ernest Valveny |
Title |
A Non-Rigid Feature Extraction Method for Shape Recognition |
Type |
Conference Article |
Year |
2011 |
Publication |
11th International Conference on Document Analysis and Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
987-991 |
Keywords |
|
Abstract |
This paper presents a methodology for shape recognition that focuses on dealing with the difficult problem of large deformations. The proposed methodology consists in a novel feature extraction technique, which uses a non-rigid representation adaptable to the shape. This technique employs a deformable grid based on the computation of geometrical centroids that follows a region partitioning algorithm. Then, a feature vector is extracted by computing pixel density measures around these geometrical centroids. The result is a shape descriptor that adapts its representation to the given shape and encodes the pixel density distribution. The validity of the method when dealing with large deformations has been experimentally shown over datasets composed of handwritten shapes. It has been applied to signature verification and shape recognition tasks demonstrating high accuracy and low computational cost. |
Address |
Beijing; China; September 2011 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-7695-4520-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICDAR |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ AFV2011 |
Serial |
1763 |
Permanent link to this record |
|
|
|
Author |
Lluis Pere de las Heras; Joan Mas; Gemma Sanchez; Ernest Valveny |
Title |
Wall Patch-Based Segmentation in Architectural Floorplans |
Type |
Conference Article |
Year |
2011 |
Publication |
11th International Conference on Document Analysis and Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
1270-1274 |
Keywords |
|
Abstract |
Segmentation of architectural floor plans is a challenging task, mainly because of the large variability in the notation between different plans. In general, traditional techniques, usually based on analyzing and grouping structural primitives obtained by vectorization, are only able to handle a reduced range of similar notations. In this paper we propose an alternative patch-based segmentation approach working at pixel level, without need of vectorization. The image is divided into a set of patches and a set of features is extracted for every patch. Then, each patch is assigned to a visual word of a previously learned vocabulary and given a probability of belonging to each class of objects. Finally, a post-process assigns the final label for every pixel. This approach has been applied to the detection of walls on two datasets of architectural floor plans with different notations, achieving high accuracy rates. |
Address |
Beiging, China |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
1520-5363 |
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-7695-4520-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICDAR |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ HMS2011a |
Serial |
1792 |
Permanent link to this record |
|
|
|
Author |
Alicia Fornes; Anjan Dutta; Albert Gordo; Josep Llados |
Title |
The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification |
Type |
Conference Article |
Year |
2011 |
Publication |
11th International Conference on Document Analysis and Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
1511-1515 |
Keywords |
|
Abstract |
In the last years, there has been a growing interest in the analysis of handwritten music scores. In this sense, our goal has been to foster the interest in the analysis of handwritten music scores by the proposal of two different competitions: Staff removal and Writer Identification. Both competitions have been tested on the CVC-MUSCIMA database: a ground-truth of handwritten music score images. This paper describes the competition details, including the dataset and ground-truth, the evaluation metrics, and a short description of the participants, their methods, and the obtained results. |
Address |
Beijing, China |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-7695-4520-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICDAR |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ FDG2011b |
Serial |
1794 |
Permanent link to this record |
|
|
|
Author |
Albert Gordo; Florent Perronnin; Ernest Valveny |
Title |
Document classification using multiple views |
Type |
Conference Article |
Year |
2012 |
Publication |
10th IAPR International Workshop on Document Analysis Systems |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
33-37 |
Keywords |
|
Abstract |
The combination of multiple features or views when representing documents or other kinds of objects usually leads to improved results in classification (and retrieval) tasks. Most systems assume that those views will be available both at training and test time. However, some views may be too `expensive' to be available at test time. In this paper, we consider the use of Canonical Correlation Analysis to leverage `expensive' views that are available only at training time. Experimental results show that this information may significantly improve the results in a classification task. |
Address |
Australia |
Corporate Author |
|
Thesis |
|
Publisher |
IEEE Computer Society Washington |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-7695-4661-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
DAS |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ GPV2012 |
Serial |
2049 |
Permanent link to this record |
|
|
|
Author |
Andreas Møgelmose; Chris Bahnsen; Thomas B. Moeslund; Albert Clapes; Sergio Escalera |
Title |
Tri-modal Person Re-identification with RGB, Depth and Thermal Features |
Type |
Conference Article |
Year |
2013 |
Publication |
9th IEEE Workshop on Perception beyond the visible Spectrum, Computer Vision and Pattern Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
301-307 |
Keywords |
|
Abstract |
Person re-identification is about recognizing people who have passed by a sensor earlier. Previous work is mainly based on RGB data, but in this work we for the first time present a system where we combine RGB, depth, and thermal data for re-identification purposes. First, from each of the three modalities, we obtain some particular features: from RGB data, we model color information from different regions of the body, from depth data, we compute different soft body biometrics, and from thermal data, we extract local structural information. Then, the three information types are combined in a joined classifier. The tri-modal system is evaluated on a new RGB-D-T dataset, showing successful results in re-identification scenarios. |
Address |
Portland; oregon; June 2013 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-7695-4990-3 |
Medium |
|
Area |
|
Expedition |
|
Conference |
CVPRW |
Notes |
HUPBA;MILAB |
Approved |
no |
Call Number |
Admin @ si @ MBM2013 |
Serial |
2253 |
Permanent link to this record |
|
|
|
Author |
Isabel Guitart; Jordi Conesa; Luis Villarejo; Agata Lapedriza; David Masip; Antoni Perez; Elena Planas |
Title |
Opinion Mining on Educational Resources at the Open University of Catalonia |
Type |
Conference Article |
Year |
2013 |
Publication |
3rd International Workshop on Adaptive Learning via Interactive, Collaborative and Emotional approaches. In conjunction with CISIS 2013: The 7th International Conference on Complex, Intelligent, and Software Intensive Systems |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
385 - 390 |
Keywords |
|
Abstract |
In order to make improvements to teaching, it is vital to know what students think of the way they are taught. With that purpose in mind, exhaustively analyzing the forums associated with the subjects taught at the Universitat Oberta de Cataluya (UOC) would be extremely helpful, as the university's students often post comments on their learning experiences in them. Exploiting the content of such forums is not a simple undertaking. The volume of data involved is very large, and performing the task manually would require a great deal of effort from lecturers. As a first step to solve this problem, we propose a tool to automatically analyze the posts in forums of communities of UOC students and teachers, with a view to systematically mining the opinions they contain. This article defines the architecture of such tool and explains how lexical-semantic and language technology resources can be used to that end. For pilot testing purposes, the tool has been used to identify students' opinions on the UOC's Business Intelligence master's degree course during the last two years. The paper discusses the results of such test. The contribution of this paper is twofold. Firstly, it demonstrates the feasibility of using natural language parsing techniques to help teachers to make decisions. Secondly, it introduces a simple tool that can be refined and adapted to a virtual environment for the purpose in question. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-7695-4992-7 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ALICE |
Notes |
OR;MV |
Approved |
no |
Call Number |
GCV2013 |
Serial |
2268 |
Permanent link to this record |
|
|
|
Author |
Josep Llados; Marçal Rusiñol |
Title |
Graphics Recognition Techniques |
Type |
Book Chapter |
Year |
2014 |
Publication |
Handbook of Document Image Processing and Recognition |
Abbreviated Journal |
|
Volume |
D |
Issue |
|
Pages |
489-521 |
Keywords |
Dimension recognition; Graphics recognition; Graphic-rich documents; Polygonal approximation; Raster-to-vector conversion; Texture-based primitive extraction; Text-graphics separation |
Abstract |
This chapter describes the most relevant approaches for the analysis of graphical documents. The graphics recognition pipeline can be splitted into three tasks. The low level or lexical task extracts the basic units composing the document. The syntactic level is focused on the structure, i.e., how graphical entities are constructed, and involves the location and classification of the symbols present in the document. The third level is a functional or semantic level, i.e., it models what the graphical symbols do and what they mean in the context where they appear. This chapter covers the lexical level, while the next two chapters are devoted to the syntactic and semantic level, respectively. The main problems reviewed in this chapter are raster-to-vector conversion (vectorization algorithms) and the separation of text and graphics components. The research and industrial communities have provided standard methods achieving reasonable performance levels. Hence, graphics recognition techniques can be considered to be in a mature state from a scientific point of view. Additionally this chapter provides insights on some related problems, namely, the extraction and recognition of dimensions in engineering drawings, and the recognition of hatched and tiled patterns. Both problems are usually associated, even integrated, in the vectorization process. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
D. Doermann; K. Tombre |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-85729-858-4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ LlR2014 |
Serial |
2380 |
Permanent link to this record |
|
|
|
Author |
Salvatore Tabbone; Oriol Ramos Terrades |
Title |
An Overview of Symbol Recognition |
Type |
Book Chapter |
Year |
2014 |
Publication |
Handbook of Document Image Processing and Recognition |
Abbreviated Journal |
|
Volume |
D |
Issue |
|
Pages |
523-551 |
Keywords |
Pattern recognition; Shape descriptors; Structural descriptors; Symbolrecognition; Symbol spotting |
Abstract |
According to the Cambridge Dictionaries Online, a symbol is a sign, shape, or object that is used to represent something else. Symbol recognition is a subfield of general pattern recognition problems that focuses on identifying, detecting, and recognizing symbols in technical drawings, maps, or miscellaneous documents such as logos and musical scores. This chapter aims at providing the reader an overview of the different existing ways of describing and recognizing symbols and how the field has evolved to attain a certain degree of maturity. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
D. Doermann; K. Tombre |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-85729-858-4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ TaT2014 |
Serial |
2489 |
Permanent link to this record |
|
|
|
Author |
A.Kesidis; Dimosthenis Karatzas |
Title |
Logo and Trademark Recognition |
Type |
Book Chapter |
Year |
2014 |
Publication |
Handbook of Document Image Processing and Recognition |
Abbreviated Journal |
|
Volume |
D |
Issue |
|
Pages |
591-646 |
Keywords |
Logo recognition; Logo removal; Logo spotting; Trademark registration; Trademark retrieval systems |
Abstract |
The importance of logos and trademarks in nowadays society is indisputable, variably seen under a positive light as a valuable service for consumers or a negative one as a catalyst of ever-increasing consumerism. This chapter discusses the technical approaches for enabling machines to work with logos, looking into the latest methodologies for logo detection, localization, representation, recognition, retrieval, and spotting in a variety of media. This analysis is presented in the context of three different applications covering the complete depth and breadth of state of the art techniques. These are trademark retrieval systems, logo recognition in document images, and logo detection and removal in images and videos. This chapter, due to the very nature of logos and trademarks, brings together various facets of document image analysis spanning graphical and textual content, while it links document image analysis to other computer vision domains, especially when it comes to the analysis of real-scene videos and images. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
D. Doermann; K. Tombre |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-85729-858-4 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ KeK2014 |
Serial |
2425 |
Permanent link to this record |
|
|
|
Author |
Alicia Fornes; Gemma Sanchez |
Title |
Analysis and Recognition of Music Scores |
Type |
Book Chapter |
Year |
2014 |
Publication |
Handbook of Document Image Processing and Recognition |
Abbreviated Journal |
|
Volume |
E |
Issue |
|
Pages |
749-774 |
Keywords |
|
Abstract |
The analysis and recognition of music scores has attracted the interest of researchers for decades. Optical Music Recognition (OMR) is a classical research field of Document Image Analysis and Recognition (DIAR), whose aim is to extract information from music scores. Music scores contain both graphical and textual information, and for this reason, techniques are closely related to graphics recognition and text recognition. Since music scores use a particular diagrammatic notation that follow the rules of music theory, many approaches make use of context information to guide the recognition and solve ambiguities. This chapter overviews the main Optical Music Recognition (OMR) approaches. Firstly, the different methods are grouped according to the OMR stages, namely, staff removal, music symbol recognition, and syntactical analysis. Secondly, specific approaches for old and handwritten music scores are reviewed. Finally, online approaches and commercial systems are also commented. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
D. Doermann; K. Tombre |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN ![sorted by ISBN field, ascending order (up)](img/sort_asc.gif) |
978-0-85729-860-7 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; ADAS; 600.076; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ FoS2014 |
Serial |
2484 |
Permanent link to this record |