Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >> [11–14]

Details

Records
Author	Hamdi Dibeklioglu; Theo Gevers; Albert Ali Salah
Title	Are You Really Smiling at Me? Spontaneous versus Posed Enjoyment Smiles			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision	Abbreviated Journal
Volume	7574	Issue	III	Pages	525-538
Keywords
Abstract	Smiling is an indispensable element of nonverbal social interaction. Besides, automatic distinction between spontaneous and posed expressions is important for visual analysis of social signals. Therefore, in this paper, we propose a method to distinguish between spontaneous and posed enjoyment smiles by using the dynamics of eyelid, cheek, and lip corner movements. The discriminative power of these movements, and the effect of different fusion levels are investigated on multiple databases. Our results improve the state-of-the-art. We also introduce the largest spontaneous/posed enjoyment smile database collected to date, and report new empirical and conceptual findings on smile dynamics. The collected database consists of 1240 samples of 400 subjects. Moreover, it has the unique property of having an age range from 8 to 76 years. Large scale experiments on the new database indicate that eyelid dynamics are highly relevant for smile classification, and there are age-related differences in smile dynamics.
Address	Florence, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33711-6	Medium
Area		Expedition		Conference	ECCV
Notes	ALTRES;ISE			Approved	no
Call Number	Admin @ si @ DGS2012			Serial	2024
Permanent link to this record



Author	Marçal Rusiñol; Dimosthenis Karatzas; Andrew Bagdanov; Josep Llados
Title	Multipage Document Retrieval by Textual and Visual Representations			Type	Conference Article
Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	521-524
Keywords
Abstract	In this paper we present a multipage administrative document image retrieval system based on textual and visual representations of document pages. Individual pages are represented by textual or visual information using a bag-of-words framework. Different fusion strategies are evaluated which allow the system to perform multipage document retrieval on the basis of a single page retrieval system. Results are reported on a large dataset of document images sampled from a banking workflow.
Address	Tsukuba Science City, Japan
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
Area		Expedition		Conference	ICPR
Notes	DAG			Approved	no
Call Number	Admin @ si @ RKB2012			Serial	2053
Permanent link to this record



Author	Mohammad Ali Bagheri; Qigang Gao; Sergio Escalera
Title	Error Correcting Output Codes for multiclass classification: Application to two image vision problems			Type	Conference Article
Year	2012	Publication	16th symposium on Artificial Intelligence & Signal Processing	Abbreviated Journal
Volume		Issue		Pages	508-513
Keywords
Abstract	Error-correcting output codes (ECOC) represents a powerful framework to deal with multiclass classification problems based on combining binary classifiers. The key factor affecting the performance of ECOC methods is the independence of binary classifiers, without which the ECOC method would be ineffective. In spite of its ability on classification of problems with relatively large number of classes, it has been applied in few real world problems. In this paper, we investigate the behavior of the ECOC approach on two image vision problems: logo recognition and shape classification using Decision Tree and AdaBoost as the base learners. The results show that the ECOC method can be used to improve the classification performance in comparison with the classical multiclass approaches.
Address	Shiraz, Iran
Corporate Author				Thesis
Publisher	IEEE Xplore	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4673-1478-7	Medium
Area		Expedition		Conference	AISP
Notes	HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ BGE2012b			Serial	2042
Permanent link to this record



Author	Noha Elfiky; Jordi Gonzalez; Xavier Roca
Title	Compact and Adaptive Spatial Pyramids for Scene Recognition			Type	Journal Article
Year	2012	Publication	Image and Vision Computing	Abbreviated Journal	IMAVIS
Volume	30	Issue	8	Pages	492–500
Keywords
Abstract	Most successful approaches on scenerecognition tend to efficiently combine global image features with spatial local appearance and shape cues. On the other hand, less attention has been devoted for studying spatial texture features within scenes. Our method is based on the insight that scenes can be seen as a composition of micro-texture patterns. This paper analyzes the role of texture along with its spatial layout for scenerecognition. However, one main drawback of the resulting spatial representation is its huge dimensionality. Hence, we propose a technique that addresses this problem by presenting a compactSpatialPyramid (SP) representation. The basis of our compact representation, namely, CompactAdaptiveSpatialPyramid (CASP) consists of a two-stages compression strategy. This strategy is based on the Agglomerative Information Bottleneck (AIB) theory for (i) compressing the least informative SP features, and, (ii) automatically learning the most appropriate shape for each category. Our method exceeds the state-of-the-art results on several challenging scenerecognition data sets.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	Admin @ si @ EGR2012			Serial	2004
Permanent link to this record



Author	Pedro Martins; Carlo Gatta; Paulo Carvalho
Title	Feature-driven Maximally Stable Extremal Regions			Type	Conference Article
Year	2012	Publication	7th International Conference on Computer Vision Theory and Applications	Abbreviated Journal
Volume		Issue		Pages	490-497
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	VISAPP
Notes	MILAB			Approved	no
Call Number	Admin @ si @ MGC2012			Serial	2139
Permanent link to this record



Author	Monica Piñol; Angel Sappa; Ricardo Toledo
Title	MultiTable Reinforcement for Visual Object Recognition			Type	Conference Article
Year	2012	Publication	4th International Conference on Signal and Image Processing	Abbreviated Journal
Volume	221	Issue		Pages	469-480
Keywords
Abstract	This paper presents a bag of feature based method for visual object recognition. Our contribution is focussed on the selection of the best feature descriptor. It is implemented by using a novel multi-table reinforcement learning method that selects among five of classical descriptors (i.e., Spin, SIFT, SURF, C-SIFT and PHOW) the one that best describes each image. Experimental results and comparisons are provided showing the improvements achieved with the proposed approach.
Address	Coimbatore, India
Corporate Author				Thesis
Publisher	Springer India	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	1876-1100	ISBN	978-81-322-0996-6	Medium
Area		Expedition		Conference	ICSIP
Notes	ADAS			Approved	no
Call Number	Admin @ si @ PST2012			Serial	2157
Permanent link to this record



Author	David Geronimo; Frederic Lerasle; Antonio Lopez
Title	State-driven particle filter for multi-person tracking			Type	Conference Article
Year	2012	Publication	11th International Conference on Advanced Concepts for Intelligent Vision Systems	Abbreviated Journal
Volume	7517	Issue		Pages	467-478
Keywords	human tracking
Abstract	Multi-person tracking can be exploited in applications such as driver assistance, surveillance, multimedia and human-robot interaction. With the help of human detectors, particle filters offer a robust method able to filter noisy detections and provide temporal coherence. However, some traditional problems such as occlusions with other targets or the scene, temporal drifting or even the lost targets detection are rarely considered, making the systems performance decrease. Some authors propose to overcome these problems using heuristics not explained and formalized in the papers, for instance by defining exceptions to the model updating depending on tracks overlapping. In this paper we propose to formalize these events by the use of a state-graph, defining the current state of the track (e.g., potential , tracked, occluded or lost) and the transitions between states in an explicit way. This approach has the advantage of linking track actions such as the online underlying models updating, which gives flexibility to the system. It provides an explicit representation to adapt the multiple parallel trackers depending on the context, i.e., each track can make use of a specific filtering strategy, dynamic model, number of particles, etc. depending on its state. We implement this technique in a single-camera multi-person tracker and test it in public video sequences.
Address	Brno, Chzech Republic
Corporate Author				Thesis
Publisher	Springer	Place of Publication	Heidelberg	Editor	J. Blanc-Talon et al.
Language	English	Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ACIVS
Notes	ADAS			Approved	yes
Call Number	GLL2012; ADAS @ adas @ gll2012a			Serial	1990
Permanent link to this record



Author	Bogdan Raducanu; Fadi Dornaika
Title	Appearance-based Face Recognition Using A Supervised Manifold Learning Framework			Type	Conference Article
Year	2012	Publication	IEEE Workshop on the Applications of Computer Vision	Abbreviated Journal
Volume		Issue		Pages	465-470
Keywords
Abstract	Many natural image sets, depicting objects whose appearance is changing due to motion, pose or light variations, can be considered samples of a low-dimension nonlinear manifold embedded in the high-dimensional observation space (the space of all possible images). The main contribution of our work is represented by a Supervised Laplacian Eigemaps (S-LE) algorithm, which exploits the class label information for mapping the original data in the embedded space. Our proposed approach benefits from two important properties: i) it is discriminative, and ii) it adaptively selects the neighbors of a sample without using any predefined neighborhood size. Experiments were conducted on four face databases and the results demonstrate that the proposed algorithm significantly outperforms many linear and non-linear embedding techniques. Although we've focused on the face recognition problem, the proposed approach could also be extended to other category of objects characterized by large variance in their appearance.
Address	Breckenridge; CO; USA
Corporate Author				Thesis
Publisher	IEEE Xplore	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1550-5790	ISBN	978-1-4673-0233-3	Medium
Area		Expedition		Conference	WACV
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ RaD2012d			Serial	1890
Permanent link to this record



Author	Jon Almazan; David Fernandez; Alicia Fornes; Josep Llados; Ernest Valveny
Title	A Coarse-to-Fine Approach for Handwritten Word Spotting in Large Scale Historical Documents Collection			Type	Conference Article
Year	2012	Publication	13th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
Volume		Issue		Pages	453-458
Keywords
Abstract	In this paper we propose an approach for word spotting in handwritten document images. We state the problem from a focused retrieval perspective, i.e. locating instances of a query word in a large scale dataset of digitized manuscripts. We combine two approaches, namely one based on word segmentation and another one segmentation-free. The first approach uses a hashing strategy to coarsely prune word images that are unlikely to be instances of the query word. This process is fast but has a low precision due to the errors introduced in the segmentation step. The regions containing candidate words are sent to the second process based on a state of the art technique from the visual object detection field. This discriminative model represents the appearance of the query word and computes a similarity score. In this way we propose a coarse-to-fine approach achieving a compromise between efficiency and accuracy. The validation of the model is shown using a collection of old handwritten manuscripts. We appreciate a substantial improvement in terms of precision regarding the previous proposed method with a low computational cost increase.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4673-2262-1	Medium
Area		Expedition		Conference	ICFHR
Notes	DAG			Approved	no
Call Number	DAG @ dag @ AFF2012			Serial	1983
Permanent link to this record



Author	Fernando Barrera; Felipe Lumbreras; Angel Sappa
Title	Multimodal Stereo Vision System: 3D Data Extraction and Algorithm Evaluation			Type	Journal Article
Year	2012	Publication	IEEE Journal of Selected Topics in Signal Processing	Abbreviated Journal	J-STSP
Volume	6	Issue	5	Pages	437-446
Keywords
Abstract	This paper proposes an imaging system for computing sparse depth maps from multispectral images. A special stereo head consisting of an infrared and a color camera defines the proposed multimodal acquisition system. The cameras are rigidly attached so that their image planes are parallel. Details about the calibration and image rectification procedure are provided. Sparse disparity maps are obtained by the combined use of mutual information enriched with gradient information. The proposed approach is evaluated using a Receiver Operating Characteristics curve. Furthermore, a multispectral dataset, color and infrared images, together with their corresponding ground truth disparity maps, is generated and used as a test bed. Experimental results in real outdoor scenarios are provided showing its viability and that the proposed approach is not restricted to a specific domain.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1932-4553	ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	Admin @ si @ BLS2012b			Serial	2155
Permanent link to this record



Author	Diego Cheda; Daniel Ponsa; Antonio Lopez
Title	Monocular Egomotion Estimation based on Image Matching			Type	Conference Article
Year	2012	Publication	1st International Conference on Pattern Recognition Applications and Methods	Abbreviated Journal
Volume		Issue		Pages	425-430
Keywords	SLAM
Abstract
Address	Portugal
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICPRAM
Notes	ADAS			Approved	no
Call Number	Admin @ si @ CPL2012a;; ADAS @ adas @			Serial	2011
Permanent link to this record



Author	Bhaskar Chakraborty; Michael Holte; Thomas B. Moeslund; Jordi Gonzalez
Title	Selective Spatio-Temporal Interest Points			Type	Journal Article
Year	2012	Publication	Computer Vision and Image Understanding	Abbreviated Journal	CVIU
Volume	116	Issue	3	Pages	396-410
Keywords
Abstract	Recent progress in the field of human action recognition points towards the use of Spatio-TemporalInterestPoints (STIPs) for local descriptor-based recognition strategies. In this paper, we present a novel approach for robust and selective STIP detection, by applying surround suppression combined with local and temporal constraints. This new method is significantly different from existing STIP detection techniques and improves the performance by detecting more repeatable, stable and distinctive STIPs for human actors, while suppressing unwanted background STIPs. For action representation we use a bag-of-video words (BoV) model of local N-jet features to build a vocabulary of visual-words. To this end, we introduce a novel vocabulary building strategy by combining spatial pyramid and vocabulary compression techniques, resulting in improved performance and efficiency. Action class specific Support Vector Machine (SVM) classifiers are trained for categorization of human actions. A comprehensive set of experiments on popular benchmark datasets (KTH and Weizmann), more challenging datasets of complex scenes with background clutter and camera motion (CVC and CMU), movie and YouTube video clips (Hollywood 2 and YouTube), and complex scenes with multiple actors (MSR I and Multi-KTH), validates our approach and show state-of-the-art performance. Due to the unavailability of ground truth action annotation data for the Multi-KTH dataset, we introduce an actor specific spatio-temporal clustering of STIPs to address the problem of automatic action annotation of multiple simultaneous actors. Additionally, we perform cross-data action recognition by training on source datasets (KTH and Weizmann) and testing on completely different and more challenging target datasets (CVC, CMU, MSR I and Multi-KTH). This documents the robustness of our proposed approach in the realistic scenario, using separate training and test datasets, which in general has been a shortcoming in the performance evaluation of human action recognition techniques.
Address
Corporate Author				Thesis
Publisher	Elsevier	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1077-3142	ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	Admin @ si @ CHM2012			Serial	1806
Permanent link to this record



Author	Fadi Dornaika; Alireza Bosaghzadeh; Bogdan Raducanu
Title	LSDA Solution Schemes for Modelless 3D Head Pose Estimation			Type	Conference Article
Year	2012	Publication	IEEE Workshop on the Applications of Computer Vision	Abbreviated Journal
Volume		Issue		Pages	393-398
Keywords
Abstract
Address	Breckenridge; USA;
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	WACV
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ DBR2012			Serial	1889
Permanent link to this record



Author	Jose Manuel Alvarez; Theo Gevers; Y. LeCun; Antonio Lopez
Title	Road Scene Segmentation from a Single Image			Type	Conference Article
Year	2012	Publication	12th European Conference on Computer Vision	Abbreviated Journal
Volume	7578	Issue	VII	Pages	376-389
Keywords	road detection
Abstract	Road scene segmentation is important in computer vision for different applications such as autonomous driving and pedestrian detection. Recovering the 3D structure of road scenes provides relevant contextual information to improve their understanding. In this paper, we use a convolutional neural network based algorithm to learn features from noisy labels to recover the 3D scene layout of a road image. The novelty of the algorithm relies on generating training labels by applying an algorithm trained on a general image dataset to classify on–board images. Further, we propose a novel texture descriptor based on a learned color plane fusion to obtain maximal uniformity in road areas. Finally, acquired (off–line) and current (on–line) information are combined to detect road areas in single images. From quantitative and qualitative experiments, conducted on publicly available datasets, it is concluded that convolutional neural networks are suitable for learning 3D scene layout from noisy labels and provides a relative improvement of 7% compared to the baseline. Furthermore, combining color planes provides a statistical description of road areas that exhibits maximal uniformity and provides a relative improvement of 8% compared to the baseline. Finally, the improvement is even bigger when acquired and current information from a single image are combined
Address	Florence, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33785-7	Medium
Area		Expedition		Conference	ECCV
Notes	ADAS;ISE			Approved	no
Call Number	Admin @ si @ AGL2012; ADAS @ adas @ agl2012a			Serial	2022
Permanent link to this record



Author	Ekaterina Zaytseva; Santiago Segui; Jordi Vitria
Title	Sketchable Histograms of Oriented Gradients for Object Detection			Type	Conference Article
Year	2012	Publication	17th Iberomerican Conference on Pattern Recognition	Abbreviated Journal
Volume	7441	Issue		Pages	374-381
Keywords
Abstract	In this paper we investigate a new representation approach for visual object recognition. The new representation, called sketchable-HoG, extends the classical histogram of oriented gradients (HoG) feature by adding two different aspects: the stability of the majority orientation and the continuity of gradient orientations. In this way, the sketchable-HoG locally characterizes the complexity of an object model and introduces global structure information while still keeping simplicity, compactness and robustness. We evaluated the proposed image descriptor on publicly Catltech 101 dataset. The obtained results outperforms classical HoG descriptor as well as other reported descriptors in the literature.
Address	Buenos Aires, Argentina
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33274-6	Medium
Area		Expedition		Conference	CIARP
Notes	OR; MILAB;MV			Approved	no
Call Number	Admin @ si @ ZSV2012			Serial	2048
Permanent link to this record