Publicacions CVC -- Query Results

[201–210] << 211 212 213 214 215 216 217 218 219 220 >> [221–228]

Details

Records
Author	Mohammad Rouhani; Angel Sappa
Title	Non-Rigid Shape Registration: A Single Linear Least Squares Framework			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision	Abbreviated Journal
Volume	7578	Issue		Pages	264-277
Keywords
Abstract	This paper proposes a non-rigid registration formulation capturing both global and local deformations in a single framework. This formulation is based on a quadratic estimation of the registration distance together with a quadratic regularization term. Hence, the optimal transformation parameters are easily obtained by solving a liner system of equations, which guarantee a fast convergence. Experimental results with challenging 2D and 3D shapes are presented to show the validity of the proposed framework. Furthermore, comparisons with the most relevant approaches are provided.
Address	Florencia
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33785-7	Medium
Area		Expedition		Conference	ECCV
Notes	ADAS			Approved	no
Call Number	Admin @ si @ RoS2012a			Serial	2158
Permanent link to this record



Author	Patricia Marquez;Debora Gil;Aura Hernandez-Sabate
Title	A Complete Confidence Framework for Optical Flow			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision – Workshops and Demonstrations	Abbreviated Journal
Volume	7584	Issue	2	Pages	124-133
Keywords	Optical flow, confidence measures, sparsification plots, error prediction plots
Abstract	Medial representations are powerful tools for describing and parameterizing the volumetric shape of anatomical structures. Existing methods show excellent results when applied to 2D objects, but their quality drops across dimensions. This paper contributes to the computation of medial manifolds in two aspects. First, we provide a standard scheme for the computation of medial manifolds that avoid degenerated medial axis segments; second, we introduce an energy based method which performs independently of the dimension. We evaluate quantitatively the performance of our method with respect to existing approaches, by applying them to synthetic shapes of known medial geometry. Finally, we show results on shape representation of multiple abdominal organs, exploring the use of medial manifolds for the representation of multi-organ relations.
Address
Corporate Author				Thesis
Publisher	Springer-Verlag	Place of Publication	Florence, Italy, October 7-13, 2012	Editor	Andrea Fusiello, Vittorio Murino ,Rita Cucchiara
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-642-33867-0	Medium
Area		Expedition		Conference	ECCVW
Notes	IAM;ADAS;			Approved	no
Call Number	IAM @ iam @ MGH2012b			Serial	1991
Permanent link to this record



Author	David Masip; Alexander Todorov; Jordi Vitria
Title	The Role of Facial Regions in Evaluating Social Dime			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision – Workshops and Demonstrations	Abbreviated Journal
Volume	7584	Issue	II	Pages	210-219
Keywords	Workshops and Demonstrations
Abstract	Facial trait judgments are an important information cue for people. Recent works in the Psychology field have stated the basis of face evaluation, defining a set of traits that we evaluate from faces (e.g. dominance, trustworthiness, aggressiveness, attractiveness, threatening or intelligence among others). We rapidly infer information from others faces, usually after a short period of time (< 1000ms) we perceive a certain degree of dominance or trustworthiness of another person from the face. Although these perceptions are not necessarily accurate, they influence many important social outcomes (such as the results of the elections or the court decisions). This topic has also attracted the attention of Computer Vision scientists, and recently a computational model to automatically predict trait evaluations from faces has been proposed. These systems try to mimic the human perception by means of applying machine learning classifiers to a set of labeled data. In this paper we perform an experimental study on the specific facial features that trigger the social inferences. Using previous results from the literature, we propose to use simple similarity maps to evaluate which regions of the face influence the most the trait inferences. The correlation analysis is performed using only appearance, and the results from the experiments suggest that each trait is correlated with specific facial characteristics.
Address	Florence, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	Andrea Fusiello, Vittorio Murino, Rita Cucchiara
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33867-0	Medium
Area		Expedition		Conference	ECCVW
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ MTV2012			Serial	2171
Permanent link to this record



Author	Bogdan Raducanu; Fadi Dornaika
Title	Pose-Invariant Face Recognition in Videos for Human-Machine Interaction			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision	Abbreviated Journal
Volume	7584	Issue		Pages	566.575
Keywords
Abstract	Human-machine interaction is a hot topic nowadays in the communities of computer vision and robotics. In this context, face recognition algorithms (used as primary cue for a person’s identity assessment) work well under controlled conditions but degrade significantly when tested in real-world environments. This is mostly due to the difficulty of simultaneously handling variations in illumination, pose, and occlusions. In this paper, we propose a novel approach for robust pose-invariant face recognition for human-robot interaction based on the real-time fitting of a 3D deformable model to input images taken from video sequences. More concrete, our approach generates a rectified face image irrespective with the actual head-pose orientation. Experimental results performed on Honda video database, using several manifold learning techniques, show a distinct advantage of the proposed method over the standard 2D appearance-based snapshot approach.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33867-0	Medium
Area		Expedition		Conference	ECCVW
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ RaD2012e			Serial	2182
Permanent link to this record



Author	Jose Manuel Alvarez; Y. LeCun; Theo Gevers; Antonio Lopez
Title	Semantic Road Segmentation via Multi-Scale Ensembles of Learned Features			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision – Workshops and Demonstrations	Abbreviated Journal
Volume	7584	Issue		Pages	586-595
Keywords	road detection
Abstract	Semantic segmentation refers to the process of assigning an object label (e.g., building, road, sidewalk, car, pedestrian) to every pixel in an image. Common approaches formulate the task as a random field labeling problem modeling the interactions between labels by combining local and contextual features such as color, depth, edges, SIFT or HoG. These models are trained to maximize the likelihood of the correct classification given a training set. However, these approaches rely on hand–designed features (e.g., texture, SIFT or HoG) and a higher computational time required in the inference process. Therefore, in this paper, we focus on estimating the unary potentials of a conditional random field via ensembles of learned features. We propose an algorithm based on convolutional neural networks to learn local features from training data at different scales and resolutions. Then, diversification between these features is exploited using a weighted linear combination. Experiments on a publicly available database show the effectiveness of the proposed method to perform semantic road scene segmentation in still images. The algorithm outperforms appearance based methods and its performance is similar compared to state–of–the–art methods using other sources of information such as depth, motion or stereo.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33867-0	Medium
Area		Expedition		Conference	ECCVW
Notes	ADAS;ISE			Approved	no
Call Number	Admin @ si @ ALG2012; ADAS @ adas			Serial	2187
Permanent link to this record



Author	Muhammad Muzzamil Luqman; Jean-Yves Ramel; Josep Llados
Title	Improving Fuzzy Multilevel Graph Embedding through Feature Selection Technique			Type	Conference Article
Year	2012	Publication	Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop	Abbreviated Journal
Volume	7626	Issue		Pages	243-253
Keywords
Abstract	Graphs are the most powerful, expressive and convenient data structures but there is a lack of efficient computational tools and algorithms for processing them. The embedding of graphs into numeric vector spaces permits them to access the state-of-the-art computational efficient statistical models and tools. In this paper we take forward our work on explicit graph embedding and present an improvement to our earlier proposed method, named “fuzzy multilevel graph embedding – FMGE”, through feature selection technique. FMGE achieves the embedding of attributed graphs into low dimensional vector spaces by performing a multilevel analysis of graphs and extracting a set of global, structural and elementary level features. Feature selection permits FMGE to select the subset of most discriminating features and to discard the confusing ones for underlying graph dataset. Experimental results for graph classification experimentation on IAM letter, GREC and fingerprint graph databases, show improvement in the performance of FMGE.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34165-6	Medium
Area		Expedition		Conference	SSPR&SPR
Notes	DAG			Approved	no
Call Number	Admin @ si @ LRL2012			Serial	2381
Permanent link to this record



Author	Volkmar Frinken; Alicia Fornes; Josep Llados; Jean-Marc Ogier
Title	Bidirectional Language Model for Handwriting Recognition			Type	Conference Article
Year	2012	Publication	Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop	Abbreviated Journal
Volume	7626	Issue		Pages	611-619
Keywords
Abstract	In order to improve the results of automatically recognized handwritten text, information about the language is commonly included in the recognition process. A common approach is to represent a text line as a sequence. It is processed in one direction and the language information via n-grams is directly included in the decoding. This approach, however, only uses context on one side to estimate a word’s probability. Therefore, we propose a bidirectional recognition in this paper, using distinct forward and a backward language models. By combining decoding hypotheses from both directions, we achieve a significant increase in recognition accuracy for the off-line writer independent handwriting recognition task. Both language models are of the same type and can be estimated on the same corpus. Hence, the increase in recognition accuracy comes without any additional need for training data or language modeling complexity.
Address	Japan
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34165-6	Medium
Area		Expedition		Conference	SSPR&SPR
Notes	DAG			Approved	no
Call Number	Admin @ si @ FFL2012			Serial	2057
Permanent link to this record



Author	Klaus Broelemann; Anjan Dutta; Xiaoyi Jiang; Josep Llados
Title	Hierarchical graph representation for symbol spotting in graphical document images			Type	Conference Article
Year	2012	Publication	Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop	Abbreviated Journal
Volume	7626	Issue		Pages	529-538
Keywords
Abstract	Symbol spotting can be defined as locating given query symbol in a large collection of graphical documents. In this paper we present a hierarchical graph representation for symbols. This representation allows graph matching methods to deal with low-level vectorization errors and, thus, to perform a robust symbol spotting. To show the potential of this approach, we conduct an experiment with the SESYD dataset.
Address	Miyajima-Itsukushima, Hiroshima
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34165-6	Medium
Area		Expedition		Conference	SSPR&SPR
Notes	DAG			Approved	no
Call Number	Admin @ si @ BDJ2012			Serial	2126
Permanent link to this record



Author	Jaume Gibert; Ernest Valveny; Horst Bunke; Alicia Fornes
Title	On the Correlation of Graph Edit Distance and L1 Distance in the Attribute Statistics Embedding Space			Type	Conference Article
Year	2012	Publication	Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop	Abbreviated Journal
Volume	7626	Issue		Pages	135-143
Keywords
Abstract	Graph embeddings in vector spaces aim at assigning a pattern vector to every graph so that the problems of graph classification and clustering can be solved by using data processing algorithms originally developed for statistical feature vectors. An important requirement graph features should fulfil is that they reproduce as much as possible the properties among objects in the graph domain. In particular, it is usually desired that distances between pairs of graphs in the graph domain closely resemble those between their corresponding vectorial representations. In this work, we analyse relations between the edit distance in the graph domain and the L1 distance of the attribute statistics based embedding, for which good classification performance has been reported on various datasets. We show that there is actually a high correlation between the two kinds of distances provided that the corresponding parameter values that account for balancing the weight between node and edge based features are properly selected.
Address
Corporate Author				Thesis
Publisher	Springer-Berlag, Berlin	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-642-34165-6	Medium
Area		Expedition		Conference	SSPR&SPR
Notes	DAG			Approved	no
Call Number	Admin @ si @ GVB2012c			Serial	2167
Permanent link to this record



Author	Fadi Dornaika; A.Assoum; Bogdan Raducanu
Title	Automatic Dimensionality Estimation for Manifold Learning through Optimal Feature Selection			Type	Conference Article
Year	2012	Publication	Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop	Abbreviated Journal
Volume	7626	Issue		Pages	575-583
Keywords
Abstract	A very important aspect in manifold learning is represented by automatic estimation of the intrinsic dimensionality. Unfortunately, this problem has received few attention in the literature of manifold learning. In this paper, we argue that feature selection paradigm can be used to the problem of automatic dimensionality estimation. Besides this, it also leads to improved recognition rates. Our approach for optimal feature selection is based on a Genetic Algorithm. As a case study for manifold learning, we have considered Laplacian Eigenmaps (LE) and Locally Linear Embedding (LLE). The effectiveness of the proposed framework was tested on the face recognition problem. Extensive experiments carried out on ORL, UMIST, Yale, and Extended Yale face data sets confirmed our hypothesis.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34165-6	Medium
Area		Expedition		Conference	SSPR&SPR
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ DAR2012			Serial	2174
Permanent link to this record



Author	Bogdan Raducanu; Fadi Dornaika
Title	Out-of-Sample Embedding by Sparse Representation			Type	Conference Article
Year	2012	Publication	Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop	Abbreviated Journal
Volume	7626	Issue		Pages	336-344
Keywords
Abstract	A critical aspect of non-linear dimensionality reduction techniques is represented by the construction of the adjacency graph. The difficulty resides in finding the optimal parameters, a process which, in general, is heuristically driven. Recently, sparse representation has been proposed as a non-parametric solution to overcome this problem. In this paper, we demonstrate that this approach not only serves for the graph construction, but also represents an efficient and accurate alternative for out-of-sample embedding. Considering for a case study the Laplacian Eigenmaps, we applied our method to the face recognition problem. Experimental results conducted on some challenging datasets confirmed the robustness of our approach and its superiority when compared to existing techniques.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34165-6	Medium
Area		Expedition		Conference	SSPR&SPR
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ RaD2012c			Serial	2175
Permanent link to this record



Author	Karel Paleček; David Geronimo; Frederic Lerasle
Title	Pre-attention cues for person detection			Type	Conference Article
Year	2012	Publication	Cognitive Behavioural Systems, COST 2102 International Training School	Abbreviated Journal
Volume		Issue		Pages	225-235
Keywords
Abstract	Current state-of-the-art person detectors have been proven reliable and achieve very good detection rates. However, the performance is often far from real time, which limits their use to low resolution images only. In this paper, we deal with candidate window generation problem for person detection, i.e. we want to reduce the computational complexity of a person detector by reducing the number of regions that has to be evaluated. We base our work on Alexe’s paper [1], which introduced several pre-attention cues for generic object detection. We evaluate these cues in the context of person detection and show that their performance degrades rapidly for scenes containing multiple objects of interest such as pictures from urban environment. We extend this set by new cues, which better suits our class-specific task. The cues are designed to be simple and efficient, so that they can be used in the pre-attention phase of a more complex sliding window based person detector.
Address	Dresden, Germany
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34583-8	Medium
Area		Expedition		Conference	COST-TS
Notes	ADAS			Approved	no
Call Number	Admin @ si @ PGL2012			Serial	2148
Permanent link to this record



Author	Ernest Valveny; Oriol Ramos Terrades; Joan Mas; Marçal Rusiñol
Title	Interactive Document Retrieval and Classification.			Type	Book Chapter
Year	2013	Publication	Multimodal Interaction in Image and Video Applications	Abbreviated Journal
Volume	48	Issue		Pages	17-30
Keywords
Abstract	In this chapter we describe a system for document retrieval and classification following the interactive-predictive framework. In particular, the system addresses two different scenarios of document analysis: document classification based on visual appearance and logo detection. These two classical problems of document analysis are formulated following the interactive-predictive model, taking the user interaction into account to make easier the process of annotating and labelling the documents. A system implementing this model in a real scenario is presented and analyzed. This system also takes advantage of active learning techniques to speed up the task of labelling the documents.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	Angel Sappa; Jordi Vitria
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1868-4394	ISBN	978-3-642-35931-6	Medium
Area		Expedition		Conference
Notes	DAG			Approved	no
Call Number	Admin @ si @ VRM2013			Serial	2341
Permanent link to this record



Author	Joost Van de Weijer; Fahad Shahbaz Khan; Marc Masana
Title	Interactive Visual and Semantic Image Retrieval			Type	Book Chapter
Year	2013	Publication	Multimodal Interaction in Image and Video Applications	Abbreviated Journal
Volume	48	Issue		Pages	31-35
Keywords
Abstract	One direct consequence of recent advances in digital visual data generation and the direct availability of this information through the World-Wide Web, is a urgent demand for efficient image retrieval systems. The objective of image retrieval is to allow users to efficiently browse through this abundance of images. Due to the non-expert nature of the majority of the internet users, such systems should be user friendly, and therefore avoid complex user interfaces. In this chapter we investigate how high-level information provided by recently developed object recognition techniques can improve interactive image retrieval. Wel apply a bagof- word based image representation method to automatically classify images in a number of categories. These additional labels are then applied to improve the image retrieval system. Next to these high-level semantic labels, we also apply a low-level image description to describe the composition and color scheme of the scene. Both descriptions are incorporated in a user feedback image retrieval setting. The main objective is to show that automatic labeling of images with semantic labels can improve image retrieval results.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	Angel Sappa; Jordi Vitria
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1868-4394	ISBN	978-3-642-35931-6	Medium
Area		Expedition		Conference
Notes	CIC; 605.203; 600.048			Approved	no
Call Number	Admin @ si @ WKC2013			Serial	2284
Permanent link to this record



Author	Abel Gonzalez-Garcia; Robert Benavente; Olivier Penacchio; Javier Vazquez; Maria Vanrell; C. Alejandro Parraga
Title	Coloresia: An Interactive Colour Perception Device for the Visually Impaired			Type	Book Chapter
Year	2013	Publication	Multimodal Interaction in Image and Video Applications	Abbreviated Journal
Volume	48	Issue		Pages	47-66
Keywords
Abstract	A significative percentage of the human population suffer from impairments in their capacity to distinguish or even see colours. For them, everyday tasks like navigating through a train or metro network map becomes demanding. We present a novel technique for extracting colour information from everyday natural stimuli and presenting it to visually impaired users as pleasant, non-invasive sound. This technique was implemented inside a Personal Digital Assistant (PDA) portable device. In this implementation, colour information is extracted from the input image and categorised according to how human observers segment the colour space. This information is subsequently converted into sound and sent to the user via speakers or headphones. In the original implementation, it is possible for the user to send its feedback to reconfigure the system, however several features such as these were not implemented because the current technology is limited.We are confident that the full implementation will be possible in the near future as PDA technology improves.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1868-4394	ISBN	978-3-642-35931-6	Medium
Area		Expedition		Conference
Notes	CIC; 600.052; 605.203			Approved	no
Call Number	Admin @ si @ GBP2013			Serial	2266
Permanent link to this record