Publicacions CVC -- Query Results

[141–150] << 151 152 153 154 155 156 157 158 159 160 >> [161–170]

Details

Records
Author	Sergio Escalera; Xavier Baro; Jordi Gonzalez; Miguel Angel Bautista; Meysam Madadi; Miguel Reyes; Victor Ponce; Hugo Jair Escalante; Jaime Shotton; Isabelle Guyon
Title	ChaLearn Looking at People Challenge 2014: Dataset and Results			Type	Conference Article
Year	2014	Publication	ECCV Workshop on ChaLearn Looking at People	Abbreviated Journal
Volume	8925	Issue		Pages	459-473
Keywords	Human Pose Recovery; Behavior Analysis; Action and in- teractions; Multi-modal gestures; recognition
Abstract	This paper summarizes the ChaLearn Looking at People 2014 challenge data and the results obtained by the participants. The competition was split into three independent tracks: human pose recovery from RGB data, action and interaction recognition from RGB data sequences, and multi-modal gesture recognition from RGB-Depth sequences. For all the tracks, the goal was to perform user-independent recognition in sequences of continuous images using the overlapping Jaccard index as the evaluation measure. In this edition of the ChaLearn challenge, two large novel data sets were made publicly available and the Microsoft Codalab platform were used to manage the competition. Outstanding results were achieved in the three challenge tracks, with accuracy results of 0.20, 0.50, and 0.85 for pose recovery, action/interaction recognition, and multi-modal gesture recognition, respectively.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCVW
Notes	HuPBA; ISE; 600.063;MV			Approved	no
Call Number	Admin @ si @ EBG2014			Serial	2529
Permanent link to this record



Author	Francisco Cruz; Oriol Ramos Terrades
Title	EM-Based Layout Analysis Method for Structured Documents			Type	Conference Article
Year	2014	Publication	22nd International Conference on Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	315-320
Keywords
Abstract	In this paper we present a method to perform layout analysis in structured documents. We proposed an EM-based algorithm to fit a set of Gaussian mixtures to the different regions according to the logical distribution along the page. After the convergence, we estimate the final shape of the regions according to the parameters computed for each component of the mixture. We evaluated our method in the task of record detection in a collection of historical structured documents and performed a comparison with other previous works in this task.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1051-4651	ISBN		Medium
Area		Expedition		Conference	ICPR
Notes	DAG; 602.006; 600.061; 600.077			Approved	no
Call Number	Admin @ si @ CrR2014			Serial	2530
Permanent link to this record



Author	Miguel Oliveira; Victor Santos; Angel Sappa
Title	Multimodal Inverse Perspective Mapping			Type	Journal Article
Year	2015	Publication	Information Fusion	Abbreviated Journal	IF
Volume	24	Issue		Pages	108–121
Keywords	Inverse perspective mapping; Multimodal sensor fusion; Intelligent vehicles
Abstract	Over the past years, inverse perspective mapping has been successfully applied to several problems in the field of Intelligent Transportation Systems. In brief, the method consists of mapping images to a new coordinate system where perspective effects are removed. The removal of perspective associated effects facilitates road and obstacle detection and also assists in free space estimation. There is, however, a significant limitation in the inverse perspective mapping: the presence of obstacles on the road disrupts the effectiveness of the mapping. The current paper proposes a robust solution based on the use of multimodal sensor fusion. Data from a laser range finder is fused with images from the cameras, so that the mapping is not computed in the regions where obstacles are present. As shown in the results, this considerably improves the effectiveness of the algorithm and reduces computation time when compared with the classical inverse perspective mapping. Furthermore, the proposed approach is also able to cope with several cameras with different lenses or image resolutions, as well as dynamic viewpoints.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS; 600.055; 600.076			Approved	no
Call Number	Admin @ si @ OSS2015c			Serial	2532
Permanent link to this record



Author	Mohammad Rouhani; E. Boyer; Angel Sappa
Title	Non-Rigid Registration meets Surface Reconstruction			Type	Conference Article
Year	2014	Publication	International Conference on 3D Vision	Abbreviated Journal
Volume		Issue		Pages	617-624
Keywords
Abstract	Non rigid registration is an important task in computer vision with many applications in shape and motion modeling. A fundamental step of the registration is the data association between the source and the target sets. Such association proves difficult in practice, due to the discrete nature of the information and its corruption by various types of noise, e.g. outliers and missing data. In this paper we investigate the benefit of the implicit representations for the non-rigid registration of 3D point clouds. First, the target points are described with small quadratic patches that are blended through partition of unity weighting. Then, the discrete association between the source and the target can be replaced by a continuous distance field induced by the interface. By combining this distance field with a proper deformation term, the registration energy can be expressed in a linear least square form that is easy and fast to solve. This significantly eases the registration by avoiding direct association between points. Moreover, a hierarchical approach can be easily implemented by employing coarse-to-fine representations. Experimental results are provided for point clouds from multi-view data sets. The qualitative and quantitative comparisons show the outperformance and robustness of our framework. %in presence of noise and outliers.
Address	Tokyo; Japan; December 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	3DV
Notes	ADAS; 600.055; 600.076			Approved	no
Call Number	Admin @ si @ RBS2014			Serial	2534
Permanent link to this record



Author	Lluis Pere de las Heras; Ernest Valveny; Gemma Sanchez
Title	Unsupervised and Notation-Independent Wall Segmentation in Floor Plans Using a Combination of Statistical and Structural Strategies			Type	Book Chapter
Year	2014	Publication	Graphics Recognition. Current Trends and Challenges	Abbreviated Journal
Volume	8746	Issue		Pages	109-121
Keywords	Graphics recognition; Floor plan analysis; Object segmentation
Abstract	In this paper we present a wall segmentation approach in floor plans that is able to work independently to the graphical notation, does not need any pre-annotated data for learning, and is able to segment multiple-shaped walls such as beams and curved-walls. This method results from the combination of the wall segmentation approaches [3, 5] presented recently by the authors. Firstly, potential straight wall segments are extracted in an unsupervised way similar to [3], but restricting even more the wall candidates considered in the original approach. Then, based on [5], these segments are used to learn the texture pattern of walls and spot the lost instances. The presented combination of both methods has been tested on 4 available datasets with different notations and compared qualitatively and quantitatively to the state-of-the-art applied on these collections. Additionally, some qualitative results on floor plans directly downloaded from the Internet are reported in the paper. The overall performance of the method demonstrates either its adaptability to different wall notations and shapes, and to document qualities and resolutions.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-662-44853-3	Medium
Area		Expedition		Conference
Notes	DAG; ADAS; 600.076; 600.077			Approved	no
Call Number	Admin @ si @ HVS2014			Serial	2535
Permanent link to this record



Author	Lluis Pere de las Heras; David Fernandez; Alicia Fornes; Ernest Valveny; Gemma Sanchez; Josep Llados
Title	Runlength Histogram Image Signature for Perceptual Retrieval of Architectural Floor Plans			Type	Book Chapter
Year	2014	Publication	Graphics Recognition. Current Trends and Challenges	Abbreviated Journal
Volume	8746	Issue		Pages	135-146
Keywords	Graphics recognition; Graphics retrieval; Image classification
Abstract	This paper proposes a runlength histogram signature as a perceptual descriptor of architectural plans in a retrieval scenario. The style of an architectural drawing is characterized by the perception of lines, shapes and texture. Such visual stimuli are the basis for defining semantic concepts as space properties, symmetry, density, etc. We propose runlength histograms extracted in vertical, horizontal and diagonal directions as a characterization of line and space properties in floorplans, so it can be roughly associated to a description of walls and room structure. A retrieval application illustrates the performance of the proposed approach, where given a plan as a query, similar ones are obtained from a database. A ground truth based on human observation has been constructed to validate the hypothesis. Additional retrieval results on sketched building’s facades are reported qualitatively in this paper. Its good description and its adaptability to two different sketch drawings despite its simplicity shows the interest of the proposed approach and opens a challenging research line in graphics recognition.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-662-44853-3	Medium
Area		Expedition		Conference
Notes	DAG; ADAS; 600.045; 600.056; 600.061; 600.076; 600.077			Approved	no
Call Number	Admin @ si @ HFF2014			Serial	2536
Permanent link to this record



Author	Lluis Gomez; Dimosthenis Karatzas
Title	Scene Text Recognition: No Country for Old Men?			Type	Conference Article
Year	2014	Publication	1st International Workshop on Robust Reading	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	IWRR
Notes	DAG; 600.077			Approved	no
Call Number	Admin @ si @ GoK2014c			Serial	2538
Permanent link to this record



Author	Xavier Perez Sala; Fernando De la Torre; Laura Igual; Sergio Escalera; Cecilio Angulo
Title	Subspace Procrustes Analysis			Type	Conference Article
Year	2014	Publication	ECCV Workshop on ChaLearn Looking at People	Abbreviated Journal
Volume	8925	Issue		Pages	654-668
Keywords
Abstract	Procrustes Analysis (PA) has been a popular technique to align and build 2-D statistical models of shapes. Given a set of 2-D shapes PA is applied to remove rigid transformations. Then, a non-rigid 2-D model is computed by modeling (e.g., PCA) the residual. Although PA has been widely used, it has several limitations for modeling 2-D shapes: occluded landmarks and missing data can result in local minima solutions, and there is no guarantee that the 2-D shapes provide a uniform sampling of the 3-D space of rotations for the object. To address previous issues, this paper proposes Subspace PA (SPA). Given several instances of a 3-D object, SPA computes the mean and a 2-D subspace that can simultaneously model all rigid and non-rigid deformations of the 3-D object. We propose a discrete (DSPA) and continuous (CSPA) formulation for SPA, assuming that 3-D samples of an object are provided. DSPA extends the traditional PA, and produces unbiased 2-D models by uniformly sampling dierent views of the 3-D object. CSPA provides a continuous approach to uniformly sample the space of 3-D rotations, being more ecient in space and time. Experiments using SPA to learn 2-D models of bodies from motion capture data illustrate the benets of our approach.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCVW
Notes	OR; HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ PTI2014			Serial	2539
Permanent link to this record



Author	E. Bondi ; L. Sidenari; Andrew Bagdanov; Alberto del Bimbo
Title	Real-time people counting from depth imagery of crowded environments			Type	Conference Article
Year	2014	Publication	11th IEEE International Conference on Advanced Video and Signal based Surveillance	Abbreviated Journal
Volume		Issue		Pages	337 - 342
Keywords
Abstract	In this paper we describe a system for automatic people counting in crowded environments. The approach we propose is a counting-by-detection method based on depth imagery. It is designed to be deployed as an autonomous appliance for crowd analysis in video surveillance application scenarios. Our system performs foreground/background segmentation on depth image streams in order to coarsely segment persons, then depth information is used to localize head candidates which are then tracked in time on an automatically estimated ground plane. The system runs in real-time, at a frame-rate of about 20 fps. We collected a dataset of RGB-D sequences representing three typical and challenging surveillance scenarios, including crowds, queuing and groups. An extensive comparative evaluation is given between our system and more complex, Latent SVM-based head localization for person counting applications.
Address	Seoul; Korea; August 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	AVSS
Notes	LAMP; 600.079			Approved	no
Call Number	Admin @ si @ BSB2014			Serial	2540
Permanent link to this record



Author	Thanh Ha Do; Salvatore Tabbone; Oriol Ramos Terrades
Title	Spotting Symbol Using Sparsity over Learned Dictionary of Local Descriptors			Type	Conference Article
Year	2014	Publication	11th IAPR International Workshop on Document Analysis and Systems	Abbreviated Journal
Volume		Issue		Pages	156-160
Keywords
Abstract	This paper proposes a new approach to spot symbols into graphical documents using sparse representations. More specifically, a dictionary is learned from a training database of local descriptors defined over the documents. Following their sparse representations, interest points sharing similar properties are used to define interest regions. Using an original adaptation of information retrieval techniques, a vector model for interest regions and for a query symbol is built based on its sparsity in a visual vocabulary where the visual words are columns in the learned dictionary. The matching process is performed comparing the similarity between vector models. Evaluation on SESYD datasets demonstrates that our method is promising.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4799-3243-6	Medium
Area		Expedition		Conference	DAS
Notes	DAG; 600.077			Approved	no
Call Number	Admin @ si @ DTR2014			Serial	2543
Permanent link to this record



Author	Marçal Rusiñol; J. Chazalon; Jean-Marc Ogier
Title	Combining Focus Measure Operators to Predict OCR Accuracy in Mobile-Captured Document Images			Type	Conference Article
Year	2014	Publication	11th IAPR International Workshop on Document Analysis and Systems	Abbreviated Journal
Volume		Issue		Pages	181 - 185
Keywords
Abstract	Mobile document image acquisition is a new trend raising serious issues in business document processing workflows. Such digitization procedure is unreliable, and integrates many distortions which must be detected as soon as possible, on the mobile, to avoid paying data transmission fees, and losing information due to the inability to re-capture later a document with temporary availability. In this context, out-of-focus blur is major issue: users have no direct control over it, and it seriously degrades OCR recognition. In this paper, we concentrate on the estimation of focus quality, to ensure a sufficient legibility of a document image for OCR processing. We propose two contributions to improve OCR accuracy prediction for mobile-captured document images. First, we present 24 focus measures, never tested on document images, which are fast to compute and require no training. Second, we show that a combination of those measures enables state-of-the art performance regarding the correlation with OCR accuracy. The resulting approach is fast, robust, and easy to implement in a mobile device. Experiments are performed on a public dataset, and precise details about image processing are given.
Address	Tours; France; April 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4799-3243-6	Medium
Area		Expedition		Conference	DAS
Notes	DAG; 601.223; 600.077			Approved	no
Call Number	Admin @ si @ RCO2014a			Serial	2545
Permanent link to this record



Author	Marçal Rusiñol; J. Chazalon; Jean-Marc Ogier
Title	Normalisation et validation d'images de documents capturées en mobilité			Type	Conference Article
Year	2014	Publication	Colloque International Francophone sur l'Écrit et le Document	Abbreviated Journal
Volume		Issue		Pages	109-124
Keywords	mobile document image acquisition; perspective correction; illumination correction; quality assessment; focus measure; OCR accuracy prediction
Abstract	Mobile document image acquisition integrates many distortions which must be corrected or detected on the device, before the document becomes unavailable or paying data transmission fees. In this paper, we propose a system to correct perspective and illumination issues, and estimate the sharpness of the image for OCR recognition. The correction step relies on fast and accurate border detection followed by illumination normalization. Its evaluation on a private dataset shows a clear improvement on OCR accuracy. The quality assessment step relies on a combination of focus measures. Its evaluation on a public dataset shows that this simple method compares well to state of the art, learning-based methods which cannot be embedded on a mobile, and outperforms metric-based methods.
Address	Nancy; France; March 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CIFED
Notes	DAG; 601.223; 600.077			Approved	no
Call Number	Admin @ si @ RCO2014b			Serial	2546
Permanent link to this record



Author	Frederic Sampedro; Sergio Escalera; Anna Puig
Title	Iterative Multiclass Multiscale Stacked Sequential Learning: definition and application to medical volume segmentation			Type	Journal Article
Year	2014	Publication	Pattern Recognition Letters	Abbreviated Journal	PRL
Volume	46	Issue		Pages	1-10
Keywords	Machine learning; Sequential learning; Multi-class problems; Contextual learning; Medical volume segmentation
Abstract	In this work we present the iterative multi-class multi-scale stacked sequential learning framework (IMMSSL), a novel learning scheme that is particularly suited for medical volume segmentation applications. This model exploits the inherent voxel contextual information of the structures of interest in order to improve its segmentation performance results. Without any feature set or learning algorithm prior assumption, the proposed scheme directly seeks to learn the contextual properties of a region from the predicted classifications of previous classifiers within an iterative scheme. Performance results regarding segmentation accuracy in three two-class and multi-class medical volume datasets show a significant improvement with respect to state of the art alternatives. Due to its easiness of implementation and its independence of feature space and learning algorithm, the presented machine learning framework could be taken into consideration as a first choice in complex volume segmentation scenarios.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ SEP2014			Serial	2550
Permanent link to this record



Author	Eloi Puertas; Miguel Angel Bautista; Daniel Sanchez; Sergio Escalera; Oriol Pujol
Title	Learning to Segment Humans by Stacking their Body Parts,			Type	Conference Article
Year	2014	Publication	ECCV Workshop on ChaLearn Looking at People	Abbreviated Journal
Volume	8925	Issue		Pages	685-697
Keywords	Human body segmentation; Stacked Sequential Learning
Abstract	Human segmentation in still images is a complex task due to the wide range of body poses and drastic changes in environmental conditions. Usually, human body segmentation is treated in a two-stage fashion. First, a human body part detection step is performed, and then, human part detections are used as prior knowledge to be optimized by segmentation strategies. In this paper, we present a two-stage scheme based on Multi-Scale Stacked Sequential Learning (MSSL). We define an extended feature set by stacking a multi-scale decomposition of body part likelihood maps. These likelihood maps are obtained in a first stage by means of a ECOC ensemble of soft body part detectors. In a second stage, contextual relations of part predictions are learnt by a binary classifier, obtaining an accurate body confidence map. The obtained confidence map is fed to a graph cut optimization procedure to obtain the final segmentation. Results show improved segmentation when MSSL is included in the human segmentation pipeline.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCVW
Notes	HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ PBS2014			Serial	2553
Permanent link to this record



Author	Marc Bolaños; Maite Garolera; Petia Radeva
Title	Video Segmentation of Life-Logging Videos			Type	Conference Article
Year	2014	Publication	8th Conference on Articulated Motion and Deformable Objects	Abbreviated Journal
Volume	8563	Issue		Pages	1-9
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	AMDO
Notes	MILAB			Approved	no
Call Number	Admin @ si @ BGR2014			Serial	2558
Permanent link to this record