Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >> [11–12]

Details

Records
Author	Jose Manuel Alvarez; Theo Gevers; Antonio Lopez
Title	3D Scene Priors for Road Detection			Type	Conference Article
Year	2010	Publication	23rd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	57–64
Keywords	road detection
Abstract	Vision-based road detection is important in different areas of computer vision such as autonomous driving, car collision warning and pedestrian crossing detection. However, current vision-based road detection methods are usually based on low-level features and they assume structured roads, road homogeneity, and uniform lighting conditions. Therefore, in this paper, contextual 3D information is used in addition to low-level cues. Low-level photometric invariant cues are derived from the appearance of roads. Contextual cues used include horizon lines, vanishing points, 3D scene layout and 3D road stages. Moreover, temporal road cues are included. All these cues are sensitive to different imaging conditions and hence are considered as weak cues. Therefore, they are combined to improve the overall performance of the algorithm. To this end, the low-level, contextual and temporal cues are combined in a Bayesian framework to classify road sequences. Large scale experiments on road sequences show that the road detection method is robust to varying imaging conditions, road types, and scenarios (tunnels, urban and highway). Further, using the combined cues outperforms all other individual cues. Finally, the proposed method provides highest road detection accuracy when compared to state-of-the-art methods.
Address	San Francisco; CA; USA; June 2010
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1063-6919	ISBN	978-1-4244-6984-0	Medium
Area		Expedition		Conference	CVPR
Notes	ADAS;ISE			Approved	no
Call Number	ADAS @ adas @ AGL2010a			Serial	1302
Permanent link to this record



Author	Mohammad Rouhani; Angel Sappa
Title	Relaxing the 3L Algorithm for an Accurate Implicit Polynomial Fitting			Type	Conference Article
Year	2010	Publication	23rd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	3066-3072
Keywords
Abstract	This paper presents a novel method to increase the accuracy of linear fitting of implicit polynomials. The proposed method is based on the 3L algorithm philosophy. The novelty lies on the relaxation of the additional constraints, already imposed by the 3L algorithm. Hence, the accuracy of the final solution is increased due to the proper adjustment of the expected values in the aforementioned additional constraints. Although iterative, the proposed approach solves the fitting problem within a linear framework, which is independent of the threshold tuning. Experimental results, both in 2D and 3D, showing improvements in the accuracy of the fitting are presented. Comparisons with both state of the art algorithms and a geometric based one (non-linear fitting), which is used as a ground truth, are provided.
Address	San Francisco; CA; USA; June 2010
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1063-6919	ISBN	978-1-4244-6984-0	Medium
Area		Expedition		Conference	CVPR
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ RoS2010a			Serial	1303
Permanent link to this record



Author	Javier Marin; David Vazquez; David Geronimo; Antonio Lopez
Title	Learning Appearance in Virtual Scenarios for Pedestrian Detection			Type	Conference Article
Year	2010	Publication	23rd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	137–144
Keywords	Pedestrian Detection; Domain Adaptation
Abstract	Detecting pedestrians in images is a key functionality to avoid vehicle-to-pedestrian collisions. The most promising detectors rely on appearance-based pedestrian classifiers trained with labelled samples. This paper addresses the following question: can a pedestrian appearance model learnt in virtual scenarios work successfully for pedestrian detection in real images? (Fig. 1). Our experiments suggest a positive answer, which is a new and relevant conclusion for research in pedestrian detection. More specifically, we record training sequences in virtual scenarios and then appearance-based pedestrian classifiers are learnt using HOG and linear SVM. We test such classifiers in a publicly available dataset provided by Daimler AG for pedestrian detection benchmarking. This dataset contains real world images acquired from a moving car. The obtained result is compared with the one given by a classifier learnt using samples coming from real images. The comparison reveals that, although virtual samples were not specially selected, both virtual and real based training give rise to classifiers of similar performance.
Address	San Francisco; CA; USA; June 2010
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language	English	Summary Language	English	Original Title	Learning Appearance in Virtual Scenarios for Pedestrian Detection
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1063-6919	ISBN	978-1-4244-6984-0	Medium
Area		Expedition		Conference	CVPR
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ MVG2010			Serial	1304
Permanent link to this record



Author	David Aldavert; Arnau Ramisa; Ramon Lopez de Mantaras; Ricardo Toledo
Title	Fast and Robust Object Segmentation with the Integral Linear Classifier			Type	Conference Article
Year	2010	Publication	23rd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	1046–1053
Keywords
Abstract	We propose an efficient method, built on the popular Bag of Features approach, that obtains robust multiclass pixel-level object segmentation of an image in less than 500ms, with results comparable or better than most state of the art methods. We introduce the Integral Linear Classifier (ILC), that can readily obtain the classification score for any image sub-window with only 6 additions and 1 product by fusing the accumulation and classification steps in a single operation. In order to design a method as efficient as possible, our building blocks are carefully selected from the quickest in the state of the art. More precisely, we evaluate the performance of three popular local descriptors, that can be very efficiently computed using integral images, and two fast quantization methods: the Hierarchical K-Means, and the Extremely Randomized Forest. Finally, we explore the utility of adding spatial bins to the Bag of Features histograms and that of cascade classifiers to improve the obtained segmentation. Our method is compared to the state of the art in the difficult Graz-02 and PASCAL 2007 Segmentation Challenge datasets.
Address	San Francisco; CA; USA; June 2010
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1063-6919	ISBN	978-1-4244-6984-0	Medium
Area		Expedition		Conference	CVPR
Notes	ADAS			Approved	no
Call Number	Admin @ si @ ARL2010a			Serial	1311
Permanent link to this record



Author	Fadi Dornaika; Bogdan Raducanu
Title	Single Snapshot 3D Head Pose Initialization for Tracking in Human Robot Interaction Scenario			Type	Conference Article
Year	2010	Publication	1st International Workshop on Computer Vision for Human-Robot Interaction	Abbreviated Journal
Volume		Issue		Pages	32–39
Keywords	1st International Workshop on Computer Vision for Human-Robot Interaction, in conjunction with IEEE CVPR 2010
Abstract	This paper presents an automatic 3D head pose initialization scheme for a real-time face tracker with application to human-robot interaction. It has two main contributions. First, we propose an automatic 3D head pose and person specific face shape estimation, based on a 3D deformable model. The proposed approach serves to initialize our realtime 3D face tracker. What makes this contribution very attractive is that the initialization step can cope with faces under arbitrary pose, so it is not limited only to near-frontal views. Second, the previous framework is used to develop an application in which the orientation of an AIBO’s camera can be controlled through the imitation of user’s head pose. In our scenario, this application is used to build panoramic images from overlapping snapshots. Experiments on real videos confirm the robustness and usefulness of the proposed methods.
Address	San Francisco; CA; USA; June 2010
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	2160-7508	ISBN	978-1-4244-7029-7	Medium
Area		Expedition		Conference	CVPRW
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ DoR2010a			Serial	1309
Permanent link to this record



Author	Albert Gordo; Alicia Fornes; Ernest Valveny; Josep Llados
Title	A Bag of Notes Approach to Writer Identification in Old Handwritten Music Scores			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	247–254
Keywords
Abstract	Determining the authorship of a document, namely writer identification, can be an important source of information for document categorization. Contrary to text documents, the identification of the writer of graphical documents is still a challenge. In this paper we present a robust approach for writer identification in a particular kind of graphical documents, old music scores. This approach adapts the bag of visual terms method for coping with graphic documents. The identification is performed only using the graphical music notation. For this purpose, we generate a graphic vocabulary without recognizing any music symbols, and consequently, avoiding the difficulties in the recognition of hand-drawn symbols in old and degraded documents. The proposed method has been tested on a database of old music scores from the 17th to 19th centuries, achieving very high identification rates.
Address	Boston; USA;
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ GFV2010			Serial	1320
Permanent link to this record



Author	Albert Gordo; Jaume Gibert; Ernest Valveny; Marçal Rusiñol
Title	A Kernel-based Approach to Document Retrieval			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	377–384
Keywords
Abstract	In this paper we tackle the problem of document image retrieval by combining a similarity measure between documents and the probability that a given document belongs to a certain class. The membership probability to a specific class is computed using Support Vector Machines in conjunction with similarity measure based kernel applied to structural document representations. In the presented experiments, we use different document representations, both visual and structural, and we apply them to a database of historical documents. We show how our method based on similarity kernels outperforms the usual distance-based retrieval.
Address	Boston; USA;
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ GGV2010			Serial	1431
Permanent link to this record



Author	Antonio Clavelli; Dimosthenis Karatzas; Josep Llados
Title	A framework for the assessment of text extraction algorithms on complex colour images			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	19–26
Keywords
Abstract	The availability of open, ground-truthed datasets and clear performance metrics is a crucial factor in the development of an application domain. The domain of colour text image analysis (real scenes, Web and spam images, scanned colour documents) has traditionally suffered from a lack of a comprehensive performance evaluation framework. Such a framework is extremely difficult to specify, and corresponding pixel-level accurate information tedious to define. In this paper we discuss the challenges and technical issues associated with developing such a framework. Then, we describe a complete framework for the evaluation of text extraction methods at multiple levels, provide a detailed ground-truth specification and present a case study on how this framework can be used in a real-life situation.
Address	Boston; USA;
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ CKL2010			Serial	1432
Permanent link to this record



Author	Partha Pratim Roy; Umapada Pal; Josep Llados
Title	Query Driven Word Retrieval in Graphical Documents			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	191–198
Keywords
Abstract	In this paper, we present an approach towards the retrieval of words from graphical document images. In graphical documents, due to presence of multi-oriented characters in non-structured layout, word indexing is a challenging task. The proposed approach uses recognition results of individual components to form character pairs with the neighboring components. An indexing scheme is designed to store the spatial description of components and to access them efficiently. Given a query text word (ascii/unicode format), the character pairs present in it are searched in the document. Next the retrieved character pairs are linked sequentially to form character string. Dynamic programming is applied to find different instances of query words. A string edit distance is used here to match the query word as the objective function. Recognition of multi-scale and multi-oriented character component is done using Support Vector Machine classifier. To consider multi-oriented character strings the features used in the SVM are invariant to character orientation. Experimental results show that the method is efficient to locate a query word from multi-oriented text in graphical documents.
Address	Boston; USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ RPL2010b			Serial	1433
Permanent link to this record



Author	Marçal Rusiñol; Josep Llados
Title	Efficient Logo Retrieval Through Hashing Shape Context Descriptors			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	215–222
Keywords
Abstract	In this paper, we present an approach towards the retrieval of words from graphical document images. In graphical documents, due to presence of multi-oriented characters in non-structured layout, word indexing is a challenging task. The proposed approach uses recognition results of individual components to form character pairs with the neighboring components. An indexing scheme is designed to store the spatial description of components and to access them efficiently. Given a query text word (ascii/unicode format), the character pairs present in it are searched in the document. Next the retrieved character pairs are linked sequentially to form character string. Dynamic programming is applied to find different instances of query words. A string edit distance is used here to match the query word as the objective function. Recognition of multi-scale and multi-oriented character component is done using Support Vector Machine classifier. To consider multi-oriented character strings the features used in the SVM are invariant to character orientation. Experimental results show that the method is efficient to locate a query word from multi-oriented text in graphical documents.
Address	Boston; USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ RuL2010b			Serial	1434
Permanent link to this record



Author	Farshad Nourbakhsh; Dimosthenis Karatzas; Ernest Valveny
Title	A polar-based logo representation based on topological and colour features			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	341–348
Keywords
Abstract	In this paper, we propose a novel rotation and scale invariant method for colour logo retrieval and classification, which involves performing a simple colour segmentation and subsequently describing each of the resultant colour components based on a set of topological and colour features. A polar representation is used to represent the logo and the subsequent logo matching is based on Cyclic Dynamic Time Warping (CDTW). We also show how combining information about the global distribution of the logo components and their local neighbourhood using the Delaunay triangulation allows to improve the results. All experiments are performed on a dataset of 2500 instances of 100 colour logo images in different rotations and scales.
Address	Boston; USA;
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ NKV2010			Serial	1436
Permanent link to this record



Author	Sebastien Mace; Herve Locteau; Ernest Valveny; Salvatore Tabbone
Title	A system to detect rooms in architectural floor plan images			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	167–174
Keywords
Abstract	In this article, a system to detect rooms in architectural floor plan images is described. We first present a primitive extraction algorithm for line detection. It is based on an original coupling of classical Hough transform with image vectorization in order to perform robust and efficient line detection. We show how the lines that satisfy some graphical arrangements are combined into walls. We also present the way we detect some door hypothesis thanks to the extraction of arcs. Walls and door hypothesis are then used by our room segmentation strategy; it consists in recursively decomposing the image until getting nearly convex regions. The notion of convexity is difficult to quantify, and the selection of separation lines between regions can also be rough. We take advantage of knowledge associated to architectural floor plans in order to obtain mostly rectangular rooms. Qualitative and quantitative evaluations performed on a corpus of real documents show promising results.
Address	Boston; USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ MLV2010			Serial	1437
Permanent link to this record



Author	Oriol Ramos Terrades; N. Serrano; Albert Gordo; Ernest Valveny; Alfons Juan-Ciscar
Title	Interactive-predictive detection of handwritten text blocks			Type	Conference Article
Year	2010	Publication	17th Document Recognition and Retrieval Conference, part of the IS&T-SPIE Electronic Imaging Symposium	Abbreviated Journal
Volume	7534	Issue		Pages	75340Q–75340Q–10
Keywords
Abstract	A method for text block detection is introduced for old handwritten documents. The proposed method takes advantage of sequential book structure, taking into account layout information from pages previously transcribed. This glance at the past is used to predict the position of text blocks in the current page with the help of conventional layout analysis methods. The method is integrated into the GIDOC prototype: a first attempt to provide integrated support for interactive-predictive page layout analysis, text line detection and handwritten text transcription. Results are given in a transcription task on a 764-page Spanish manuscript from 1891.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	DRR
Notes	DAG			Approved	no
Call Number	DAG @ dag @ TSG2010			Serial	1479
Permanent link to this record



Author	Marco Pedersoli; Jordi Gonzalez; Andrew Bagdanov; Juan J. Villanueva
Title	Recursive Coarse-to-Fine Localization for fast Object Recognition			Type	Conference Article
Year	2010	Publication	11th European Conference on Computer Vision	Abbreviated Journal
Volume	6313	Issue	II	Pages	280–293
Keywords
Abstract	Cascading techniques are commonly used to speed-up the scan of an image for object detection. However, cascades of detectors are slow to train due to the high number of detectors and corresponding thresholds to learn. Furthermore, they do not use any prior knowledge about the scene structure to decide where to focus the search. To handle these problems, we propose a new way to scan an image, where we couple a recursive coarse-to-fine refinement together with spatial constraints of the object location. For doing that we split an image into a set of uniformly distributed neighborhood regions, and for each of these we apply a local greedy search over feature resolutions. The neighborhood is defined as a scanning region that only one object can occupy. Therefore the best hypothesis is obtained as the location with maximum score and no thresholds are needed. We present an implementation of our method using a pyramid of HOG features and we evaluate it on two standard databases, VOC2007 and INRIA dataset. Results show that the Recursive Coarse-to-Fine Localization (RCFL) achieves a 12x speed-up compared to standard sliding windows. Compared with a cascade of multiple resolutions approach our method has slightly better performance in speed and Average-Precision. Furthermore, in contrast to cascading approach, the speed-up is independent of image conditions, the number of detected objects and clutter.
Address	Crete (Greece)
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-15566-6	Medium
Area		Expedition		Conference	ECCV
Notes	ISE			Approved	no
Call Number	DAG @ dag @ PGB2010			Serial	1438
Permanent link to this record



Author	Carles Fernandez; Jordi Gonzalez; Xavier Roca
Title	Automatic Learning of Background Semantics in Generic Surveilled Scenes			Type	Conference Article
Year	2010	Publication	11th European Conference on Computer Vision	Abbreviated Journal
Volume	6313	Issue	II	Pages	678–692
Keywords
Abstract	Advanced surveillance systems for behavior recognition in outdoor traffic scenes depend strongly on the particular configuration of the scenario. Scene-independent trajectory analysis techniques statistically infer semantics in locations where motion occurs, and such inferences are typically limited to abnormality. Thus, it is interesting to design contributions that automatically categorize more specific semantic regions. State-of-the-art approaches for unsupervised scene labeling exploit trajectory data to segment areas like sources, sinks, or waiting zones. Our method, in addition, incorporates scene-independent knowledge to assign more meaningful labels like crosswalks, sidewalks, or parking spaces. First, a spatiotemporal scene model is obtained from trajectory analysis. Subsequently, a so-called GI-MRF inference process reinforces spatial coherence, and incorporates taxonomy-guided smoothness constraints. Our method achieves automatic and effective labeling of conceptual regions in urban scenarios, and is robust to tracking errors. Experimental validation on 5 surveillance databases has been conducted to assess the generality and accuracy of the segmentations. The resulting scene models are used for model-based behavior analysis.
Address	Crete (Greece)
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-15551-2	Medium
Area		Expedition		Conference	ECCV
Notes	ISE			Approved	no
Call Number	ISE @ ise @ FGR2010			Serial	1439
Permanent link to this record