Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >> [11–20]

Details

Records
Author	Marc Oliu; Ciprian Corneanu; Laszlo A. Jeni; Jeffrey F. Cohn; Takeo Kanade; Sergio Escalera
Title	Continuous Supervised Descent Method for Facial Landmark Localisation			Type	Conference Article
Year	2016	Publication	13th Asian Conference on Computer Vision	Abbreviated Journal
Volume	10112	Issue		Pages	121-135
Keywords
Abstract	Recent methods for facial landmark location perform well on close-to-frontal faces but have problems in generalising to large head rotations. In order to address this issue we propose a second order linear regression method that is both compact and robust against strong rotations. We provide a closed form solution, making the method fast to train. We test the method’s performance on two challenging datasets. The first has been intensely used by the community. The second has been specially generated from a well known 3D face dataset. It is considerably more challenging, including a high diversity of rotations and more samples than any other existing public dataset. The proposed method is compared against state-of-the-art approaches, including RCPR, CGPRT, LBF, CFSS, and GSDM. Results upon both datasets show that the proposed method offers state-of-the-art performance on near frontal view data, improves state-of-the-art methods on more challenging head rotation problems and keeps a compact model size.
Address	Taipei; Taiwan; November 2016
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ACCV
Notes	HuPBA;MILAB;			Approved	no
Call Number	Admin @ si @ OCJ2016			Serial	2838
Permanent link to this record



Author	Aura Hernandez-Sabate; Lluis Albarracin; Daniel Calvo; Nuria Gorgorio
Title	EyeMath: Identifying Mathematics Problem Solving Processes in a RTS Video Game			Type	Conference Article
Year	2016	Publication	5th International Conference Games and Learning Alliance	Abbreviated Journal
Volume	10056	Issue		Pages	50-59
Keywords	Simulation environment; Automated Driving; Driver-Vehicle interaction
Abstract	Photorealistic virtual environments are crucial for developing and testing automated driving systems in a safe way during trials. As commercially available simulators are expensive and bulky, this paper presents a low-cost, extendable, and easy-to-use (LEE) virtual environment with the aim to highlight its utility for level 3 driving automation. In particular, an experiment is performed using the presented simulator to explore the influence of different variables regarding control transfer of the car after the system was driving autonomously in a highway scenario. The results show that the speed of the car at the time when the system needs to transfer the control to the human driver is critical.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	GALA
Notes	ADAS;IAM;			Approved	no
Call Number	HAC2016			Serial	2864
Permanent link to this record



Author	Juan Ignacio Toledo; Sebastian Sudholt; Alicia Fornes; Jordi Cucurull; A. Fink; Josep Llados
Title	Handwritten Word Image Categorization with Convolutional Neural Networks and Spatial Pyramid Pooling			Type	Conference Article
Year	2016	Publication	Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)	Abbreviated Journal
Volume	10029	Issue		Pages	543-552
Keywords	Document image analysis; Word image categorization; Convolutional neural networks; Named entity detection
Abstract	The extraction of relevant information from historical document collections is one of the key steps in order to make these documents available for access and searches. The usual approach combines transcription and grammars in order to extract semantically meaningful entities. In this paper, we describe a new method to obtain word categories directly from non-preprocessed handwritten word images. The method can be used to directly extract information, being an alternative to the transcription. Thus it can be used as a first step in any kind of syntactical analysis. The approach is based on Convolutional Neural Networks with a Spatial Pyramid Pooling layer to deal with the different shapes of the input images. We performed the experiments on a historical marriage record dataset, obtaining promising results.
Address	Merida; Mexico; December 2016
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-319-49054-0	Medium
Area		Expedition		Conference	S+SSPR
Notes	DAG; 600.097; 602.006			Approved	no
Call Number	Admin @ si @ TSF2016			Serial	2877
Permanent link to this record



Author	Saad Minhas; Aura Hernandez-Sabate; Shoaib Ehsan; Katerine Diaz; Ales Leonardis; Antonio Lopez; Klaus McDonald Maier
Title	LEE: A photorealistic Virtual Environment for Assessing Driver-Vehicle Interactions in Self-Driving Mode			Type	Conference Article
Year	2016	Publication	14th European Conference on Computer Vision Workshops	Abbreviated Journal
Volume	9915	Issue		Pages	894-900
Keywords	Simulation environment; Automated Driving; Driver-Vehicle interaction
Abstract	Photorealistic virtual environments are crucial for developing and testing automated driving systems in a safe way during trials. As commercially available simulators are expensive and bulky, this paper presents a low-cost, extendable, and easy-to-use (LEE) virtual environment with the aim to highlight its utility for level 3 driving automation. In particular, an experiment is performed using the presented simulator to explore the influence of different variables regarding control transfer of the car after the system was driving autonomously in a highway scenario. The results show that the speed of the car at the time when the system needs to transfer the control to the human driver is critical.
Address	Amsterdam; The Netherlands; October 2016
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCVW
Notes	ADAS;IAM; 600.085; 600.076			Approved	no
Call Number	MHE2016			Serial	2865
Permanent link to this record



Author	Maedeh Aghaei; Mariella Dimiccoli; Petia Radeva
Title	Towards social interaction detection in egocentric photo-streams			Type	Conference Article
Year	2015	Publication	Proceedings of SPIE, 8th International Conference on Machine Vision , ICMV 2015	Abbreviated Journal
Volume	9875	Issue		Pages
Keywords
Abstract	Detecting social interaction in videos relying solely on visual cues is a valuable task that is receiving increasing attention in recent years. In this work, we address this problem in the challenging domain of egocentric photo-streams captured by a low temporal resolution wearable camera (2fpm). The major difficulties to be handled in this context are the sparsity of observations as well as unpredictability of camera motion and attention orientation due to the fact that the camera is worn as part of clothing. Our method consists of four steps: multi-faces localization and tracking, 3D localization, pose estimation and analysis of f-formations. By estimating pair-to-pair interaction probabilities over the sequence, our method states the presence or absence of interaction with the camera wearer and specifies which people are more involved in the interaction. We tested our method over a dataset of 18.000 images and we show its reliability on our considered purpose. © (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICMV
Notes	MILAB			Approved	no
Call Number	Admin @ si @ ADR2015a			Serial	2702
Permanent link to this record



Author	Mohammad Ali Bagheri; Qigang Gao; Sergio Escalera
Title	Action Recognition by Pairwise Proximity Function Support Vector Machines with Dynamic Time Warping Kernels			Type	Conference Article
Year	2016	Publication	29th Canadian Conference on Artificial Intelligence	Abbreviated Journal
Volume	9673	Issue		Pages	3-14
Keywords
Abstract	In the context of human action recognition using skeleton data, the 3D trajectories of joint points may be considered as multi-dimensional time series. The traditional recognition technique in the literature is based on time series dis(similarity) measures (such as Dynamic Time Warping). For these general dis(similarity) measures, k-nearest neighbor algorithms are a natural choice. However, k-NN classifiers are known to be sensitive to noise and outliers. In this paper, a new class of Support Vector Machine that is applicable to trajectory classification, such as action recognition, is developed by incorporating an efficient time-series distances measure into the kernel function. More specifically, the derivative of Dynamic Time Warping (DTW) distance measure is employed as the SVM kernel. In addition, the pairwise proximity learning strategy is utilized in order to make use of non-positive semi-definite (PSD) kernels in the SVM formulation. The recognition results of the proposed technique on two action recognition datasets demonstrates the ourperformance of our methodology compared to the state-of-the-art methods. Remarkably, we obtained 89 % accuracy on the well-known MSRAction3D dataset using only 3D trajectories of body joints obtained by Kinect
Address	Victoria; Canada; May 2016
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	AI
Notes	HuPBA;MILAB;			Approved	no
Call Number	Admin @ si @ BGE2016b			Serial	2770
Permanent link to this record



Author	Aleksandr Setkov; Fabio Martinez Carillo; Michele Gouiffes; Christian Jacquemin; Maria Vanrell; Ramon Baldrich
Title	DAcImPro: A Novel Database of Acquired Image Projections and Its Application to Object Recognition			Type	Conference Article
Year	2015	Publication	Advances in Visual Computing. Proceedings of 11th International Symposium, ISVC 2015 Part II	Abbreviated Journal
Volume	9475	Issue		Pages	463-473
Keywords	Projector-camera systems; Feature descriptors; Object recognition
Abstract	Projector-camera systems are designed to improve the projection quality by comparing original images with their captured projections, which is usually complicated due to high photometric and geometric variations. Many research works address this problem using their own test data which makes it extremely difficult to compare different proposals. This paper has two main contributions. Firstly, we introduce a new database of acquired image projections (DAcImPro) that, covering photometric and geometric conditions and providing data for ground-truth computation, can serve to evaluate different algorithms in projector-camera systems. Secondly, a new object recognition scenario from acquired projections is presented, which could be of a great interest in such domains, as home video projections and public presentations. We show that the task is more challenging than the classical recognition problem and thus requires additional pre-processing, such as color compensation or projection area selection.
Address
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-27862-9	Medium
Area		Expedition		Conference	ISVC
Notes	CIC			Approved	no
Call Number	Admin @ si @ SMG2015			Serial	2736
Permanent link to this record



Author	Marçal Rusiñol; Dimosthenis Karatzas; Josep Llados
Title	Automatic Verification of Properly Signed Multi-page Document Images			Type	Conference Article
Year	2015	Publication	Proceedings of the Eleventh International Symposium on Visual Computing	Abbreviated Journal
Volume	9475	Issue		Pages	327-336
Keywords	Document Image; Manual Inspection; Signature Verification; Rejection Criterion; Document Flow
Abstract	In this paper we present an industrial application for the automatic screening of incoming multi-page documents in a banking workflow aimed at determining whether these documents are properly signed or not. The proposed method is divided in three main steps. First individual pages are classified in order to identify the pages that should contain a signature. In a second step, we segment within those key pages the location where the signatures should appear. The last step checks whether the signatures are present or not. Our method is tested in a real large-scale environment and we report the results when checking two different types of real multi-page contracts, having in total more than 14,500 pages.
Address	Las Vegas, Nevada, USA; December 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume	9475	Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ISVC
Notes	DAG; 600.077			Approved	no
Call Number	Admin @ si @			Serial	3189
Permanent link to this record



Author	Carles Sanchez; Debora Gil; Jorge Bernal; F. Javier Sanchez; Marta Diez-Ferrer; Antoni Rosell
Title	Navigation Path Retrieval from Videobronchoscopy using Bronchial Branches			Type	Conference Article
Year	2016	Publication	19th International Conference on Medical Image Computing and Computer Assisted Intervention Workshops	Abbreviated Journal
Volume	9401	Issue		Pages	62-70
Keywords	Bronchoscopy navigation; Lumen center; Brochial branches; Navigation path; Videobronchoscopy
Abstract	Bronchoscopy biopsy can be used to diagnose lung cancer without risking complications of other interventions like transthoracic needle aspiration. During bronchoscopy, the clinician has to navigate through the bronchial tree to the target lesion. A main drawback is the difficulty to check whether the exploration is following the correct path. The usual guidance using fluoroscopy implies repeated radiation of the clinician, while alternative systems (like electromagnetic navigation) require specific equipment that increases intervention costs. We propose to compute the navigated path using anatomical landmarks extracted from the sole analysis of videobronchoscopy images. Such landmarks allow matching the current exploration to the path previously planned on a CT to indicate clinician whether the planning is being correctly followed or not. We present a feasibility study of our landmark based CT-video matching using bronchoscopic videos simulated on a virtual bronchoscopy interactive interface.
Address	Quebec; Canada; September 2016
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	MICCAIW
Notes	IAM; MV; 600.060; 600.075			Approved	no
Call Number	Admin @ si @ SGB2016			Serial	2885
Permanent link to this record



Author	Dennis G.Romero; Anselmo Frizera; Angel Sappa; Boris X. Vintimilla; Teodiano F.Bastos
Title	A predictive model for human activity recognition by observing actions and context			Type	Conference Article
Year	2015	Publication	Advanced Concepts for Intelligent Vision Systems, Proceedings of 16th International Conference, ACIVS 2015	Abbreviated Journal
Volume	9386	Issue		Pages	323-333
Keywords
Abstract	This paper presents a novel model to estimate human activities — a human activity is defined by a set of human actions. The proposed approach is based on the usage of Recurrent Neural Networks (RNN) and Bayesian inference through the continuous monitoring of human actions and its surrounding environment. In the current work human activities are inferred considering not only visual analysis but also additional resources; external sources of information, such as context information, are incorporated to contribute to the activity estimation. The novelty of the proposed approach lies in the way the information is encoded, so that it can be later associated according to a predefined semantic structure. Hence, a pattern representing a given activity can be defined by a set of actions, plus contextual information or other kind of information that could be relevant to describe the activity. Experimental results with real data are provided showing the validity of the proposed approach.
Address	Catania; Italy; October 2015
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-25902-4	Medium
Area		Expedition		Conference	ACIVS
Notes	ADAS; 600.076			Approved	no
Call Number	Admin @ si @ RFS2015			Serial	2661
Permanent link to this record



Author	Fahad Shahbaz Khan; Muhammad Anwer Rao; Joost Van de Weijer; Michael Felsberg; J.Laaksonen
Title	Deep semantic pyramids for human attributes and action recognition			Type	Conference Article
Year	2015	Publication	Image Analysis, Proceedings of 19th Scandinavian Conference , SCIA 2015	Abbreviated Journal
Volume	9127	Issue		Pages	341-353
Keywords	Action recognition; Human attributes; Semantic pyramids
Abstract	Describing persons and their actions is a challenging problem due to variations in pose, scale and viewpoint in real-world images. Recently, semantic pyramids approach [1] for pose normalization has shown to provide excellent results for gender and action recognition. The performance of semantic pyramids approach relies on robust image description and is therefore limited due to the use of shallow local features. In the context of object recognition [2] and object detection [3], convolutional neural networks (CNNs) or deep features have shown to improve the performance over the conventional shallow features. We propose deep semantic pyramids for human attributes and action recognition. The method works by constructing spatial pyramids based on CNNs of different part locations. These pyramids are then combined to obtain a single semantic representation. We validate our approach on the Berkeley and 27 Human Attributes datasets for attributes classification. For action recognition, we perform experiments on two challenging datasets: Willow and PASCAL VOC 2010. The proposed deep semantic pyramids provide a significant gain of 17.2%, 13.9%, 24.3% and 22.6% compared to the standard shallow semantic pyramids on Berkeley, 27 Human Attributes, Willow and PASCAL VOC 2010 datasets respectively. Our results also show that deep semantic pyramids outperform conventional CNNs based on the full bounding box of the person. Finally, we compare our approach with state-of-the-art methods and show a gain in performance compared to best methods in literature.
Address	Denmark; Copenhagen; June 2015
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-19664-0	Medium
Area		Expedition		Conference	SCIA
Notes	LAMP; 600.068; 600.079;ADAS			Approved	no
Call Number	Admin @ si @ KRW2015b			Serial	2672
Permanent link to this record



Author	Alejandro Gonzalez Alzate; Gabriel Villalonga; German Ros; David Vazquez; Antonio Lopez
Title	3D-Guided Multiscale Sliding Window for Pedestrian Detection			Type	Conference Article
Year	2015	Publication	Pattern Recognition and Image Analysis, Proceedings of 7th Iberian Conference , ibPRIA 2015	Abbreviated Journal
Volume	9117	Issue		Pages	560-568
Keywords	Pedestrian Detection
Abstract	The most relevant modules of a pedestrian detector are the candidate generation and the candidate classification. The former aims at presenting image windows to the latter so that they are classified as containing a pedestrian or not. Much attention has being paid to the classification module, while candidate generation has mainly relied on (multiscale) sliding window pyramid. However, candidate generation is critical for achieving real-time. In this paper we assume a context of autonomous driving based on stereo vision. Accordingly, we evaluate the effect of taking into account the 3D information (derived from the stereo) in order to prune the hundred of thousands windows per image generated by classical pyramidal sliding window. For our study we use a multimodal (RGB, disparity) and multi-descriptor (HOG, LBP, HOG+LBP) holistic ensemble based on linear SVM. Evaluation on data from the challenging KITTI benchmark suite shows the effectiveness of using 3D information to dramatically reduce the number of candidate windows, even improving the overall pedestrian detection accuracy.
Address	Santiago de Compostela; España; June 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area	ACDC	Expedition		Conference	IbPRIA
Notes	ADAS; 600.076; 600.057; 600.054			Approved	no
Call Number	ADAS @ adas @ GVR2015			Serial	2585
Permanent link to this record



Author	Marc Bolaños; Maite Garolera; Petia Radeva
Title	Object Discovery using CNN Features in Egocentric Videos			Type	Conference Article
Year	2015	Publication	Pattern Recognition and Image Analysis, Proceedings of 7th Iberian Conference , ibPRIA 2015	Abbreviated Journal
Volume	9117	Issue		Pages	67-74
Keywords	Object discovery; Egocentric videos; Lifelogging; CNN
Abstract	Lifelogging devices based on photo/video are spreading faster everyday. This growth can represent great benefits to develop methods for extraction of meaningful information about the user wearing the device and his/her environment. In this paper, we propose a semi-supervised strategy for easily discovering objects relevant to the person wearing a first-person camera. The egocentric video sequence acquired by the camera, uses both the appearance extracted by means of a deep convolutional neural network and an object refill methodology that allow to discover objects even in case of small amount of object appearance in the collection of images. We validate our method on a sequence of 1000 egocentric daily images and obtain results with an F-measure of 0.5, 0.17 better than the state of the art approach.
Address	Santiago de Compostela; España; June 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-19389-2	Medium
Area		Expedition		Conference	IbPRIA
Notes	MILAB			Approved	no
Call Number	Admin @ si @ BGR2015			Serial	2596
Permanent link to this record



Author	Estefania Talavera; Mariella Dimiccoli; Marc Bolaños; Maedeh Aghaei; Petia Radeva
Title	R-clustering for egocentric video segmentation			Type	Conference Article
Year	2015	Publication	Pattern Recognition and Image Analysis, Proceedings of 7th Iberian Conference , ibPRIA 2015	Abbreviated Journal
Volume	9117	Issue		Pages	327-336
Keywords	Temporal video segmentation; Egocentric videos; Clustering
Abstract	In this paper, we present a new method for egocentric video temporal segmentation based on integrating a statistical mean change detector and agglomerative clustering(AC) within an energy-minimization framework. Given the tendency of most AC methods to oversegment video sequences when clustering their frames, we combine the clustering with a concept drift detection technique (ADWIN) that has rigorous guarantee of performances. ADWIN serves as a statistical upper bound for the clustering-based video segmentation. We integrate both techniques in an energy-minimization framework that serves to disambiguate the decision of both techniques and to complete the segmentation taking into account the temporal continuity of video frames descriptors. We present experiments over egocentric sets of more than 13.000 images acquired with different wearable cameras, showing that our method outperforms state-of-the-art clustering methods.
Address	Santiago de Compostela; España; June 2015
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-19389-2	Medium
Area		Expedition		Conference	IbPRIA
Notes	MILAB			Approved	no
Call Number	Admin @ si @ TDB2015			Serial	2597
Permanent link to this record



Author	Onur Ferhat; Arcadi Llanza; Fernando Vilariño
Title	A Feature-Based Gaze Estimation Algorithm for Natural Light Scenarios			Type	Conference Article
Year	2015	Publication	Pattern Recognition and Image Analysis, Proceedings of 7th Iberian Conference , ibPRIA 2015	Abbreviated Journal
Volume	9117	Issue		Pages	569-576
Keywords	Eye tracking; Gaze estimation; Natural light; Webcam
Abstract	We present an eye tracking system that works with regular webcams. We base our work on open source CVC Eye Tracker [7] and we propose a number of improvements and a novel gaze estimation method. The new method uses features extracted from iris segmentation and it does not fall into the traditional categorization of appearance–based/model–based methods. Our experiments show that our approach reduces the gaze estimation errors by 34 % in the horizontal direction and by 12 % in the vertical direction compared to the baseline system.
Address	Santiago de Compostela; June 2015
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-19389-2	Medium
Area		Expedition		Conference	IbPRIA
Notes	MV;SIAI			Approved	no
Call Number	Admin @ si @ FLV2015a			Serial	2646
Permanent link to this record