Publicacions CVC -- Query Results

[1–10] << 11 >>

Details

Records
Author	Naveen Onkarappa; Angel Sappa
Title	Synthetic sequences and ground-truth flow field generation for algorithm validation			Type	Journal Article
Year	2015	Publication	Multimedia Tools and Applications	Abbreviated Journal	MTAP
Volume	74	Issue	9	Pages	3121-3135
Keywords	Ground-truth optical flow; Synthetic sequence; Algorithm validation
Abstract	Research in computer vision is advancing by the availability of good datasets that help to improve algorithms, validate results and obtain comparative analysis. The datasets can be real or synthetic. For some of the computer vision problems such as optical flow it is not possible to obtain ground-truth optical flow with high accuracy in natural outdoor real scenarios directly by any sensor, although it is possible to obtain ground-truth data of real scenarios in a laboratory setup with limited motion. In this difficult situation computer graphics offers a viable option for creating realistic virtual scenarios. In the current work we present a framework to design virtual scenes and generate sequences as well as ground-truth flow fields. Particularly, we generate a dataset containing sequences of driving scenarios. The sequences in the dataset vary in different speeds of the on-board vision system, different road textures, complex motion of vehicle and independent moving vehicles in the scene. This dataset enables analyzing and adaptation of existing optical flow methods, and leads to invention of new approaches particularly for driver assistance systems.
Address
Corporate Author				Thesis
Publisher	Springer US	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1380-7501	ISBN		Medium
Area		Expedition		Conference
Notes	ADAS; 600.055; 601.215; 600.076			Approved	no
Call Number	Admin @ si @ OnS2014b			Serial	2472
Permanent link to this record



Author	Xavier Otazu; Olivier Penacchio; Xim Cerda-Company
Title	Brightness and colour induction through contextual influences in V1			Type	Conference Article
Year	2015	Publication	Scottish Vision Group 2015 SGV2015	Abbreviated Journal
Volume	12	Issue	9	Pages	1208-2012
Keywords
Abstract
Address	Carnoustie; Scotland; March 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	SGV
Notes	NEUROBIT;			Approved	no
Call Number	Admin @ si @ OPC2015a			Serial	2632
Permanent link to this record



Author	R.A.Bendezu; E.Barba; E.Burri; D.Cisternas; Carolina Malagelada; Santiago Segui; Anna Accarino; S.Quiroga; E.Monclus; I.Navazo
Title	Intestinal gas content and distribution in health and in patients with functional gut symptoms			Type	Journal Article
Year	2015	Publication	Neurogastroenterology & Motility	Abbreviated Journal	NEUMOT
Volume	27	Issue	9	Pages	1249-1257
Keywords
Abstract	BACKGROUND: The precise relation of intestinal gas to symptoms, particularly abdominal bloating and distension remains incompletely elucidated. Our aim was to define the normal values of intestinal gas volume and distribution and to identify abnormalities in relation to functional-type symptoms. METHODS: Abdominal computed tomography scans were evaluated in healthy subjects (n = 37) and in patients in three conditions: basal (when they were feeling well; n = 88), during an episode of abdominal distension (n = 82) and after a challenge diet (n = 24). Intestinal gas content and distribution were measured by an original analysis program. Identification of patients outside the normal range was performed by machine learning techniques (one-class classifier). Results are expressed as median (IQR) or mean ± SE, as appropriate. KEY RESULTS: In healthy subjects the gut contained 95 (71, 141) mL gas distributed along the entire lumen. No differences were detected between patients studied under asymptomatic basal conditions and healthy subjects. However, either during a spontaneous bloating episode or once challenged with a flatulogenic diet, luminal gas was found to be increased and/or abnormally distributed in about one-fourth of the patients. These patients detected outside the normal range by the classifier exhibited a significantly greater number of abnormal features than those within the normal range (3.7 ± 0.4 vs 0.4 ± 0.1; p < 0.001). CONCLUSIONS & INFERENCES: The analysis of a large cohort of subjects using original techniques provides unique and heretofore unavailable information on the volume and distribution of intestinal gas in normal conditions and in relation to functional gastrointestinal symptoms.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB			Approved	no
Call Number	Admin @ si @ BBB2015			Serial	2667
Permanent link to this record



Author	Monica Piñol; Angel Sappa; Ricardo Toledo
Title	Adaptive Feature Descriptor Selection based on a Multi-Table Reinforcement Learning Strategy			Type	Journal Article
Year	2015	Publication	Neurocomputing	Abbreviated Journal	NEUCOM
Volume	150	Issue	A	Pages	106–115
Keywords	Reinforcement learning; Q-learning; Bag of features; Descriptors
Abstract	This paper presents and evaluates a framework to improve the performance of visual object classification methods, which are based on the usage of image feature descriptors as inputs. The goal of the proposed framework is to learn the best descriptor for each image in a given database. This goal is reached by means of a reinforcement learning process using the minimum information. The visual classification system used to demonstrate the proposed framework is based on a bag of features scheme, and the reinforcement learning technique is implemented through the Q-learning approach. The behavior of the reinforcement learning with different state definitions is evaluated. Additionally, a method that combines all these states is formulated in order to select the optimal state. Finally, the chosen actions are obtained from the best set of image descriptors in the literature: PHOW, SIFT, C-SIFT, SURF and Spin. Experimental results using two public databases (ETH and COIL) are provided showing both the validity of the proposed approach and comparisons with state of the art. In all the cases the best results are obtained with the proposed approach.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS; 600.055; 600.076			Approved	no
Call Number	Admin @ si @ PST2015			Serial	2473
Permanent link to this record



Author	Francisco Alvaro; Francisco Cruz; Joan Andreu Sanchez; Oriol Ramos Terrades; Jose Miguel Benedi
Title	Structure Detection and Segmentation of Documents Using 2D Stochastic Context-Free Grammars			Type	Journal Article
Year	2015	Publication	Neurocomputing	Abbreviated Journal	NEUCOM
Volume	150	Issue	A	Pages	147-154
Keywords	document image analysis; stochastic context-free grammars; text classication features
Abstract	In this paper we dene a bidimensional extension of Stochastic Context-Free Grammars for structure detection and segmentation of images of documents. Two sets of text classication features are used to perform an initial classication of each zone of the page. Then, the document segmentation is obtained as the most likely hypothesis according to a stochastic grammar. We used a dataset of historical marriage license books to validate this approach. We also tested several inference algorithms for Probabilistic Graphical Models and the results showed that the proposed grammatical model outperformed the other methods. Furthermore, grammars also provide the document structure along with its segmentation.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG; 601.158; 600.077; 600.061			Approved	no
Call Number	Admin @ si @ ACS2015			Serial	2531
Permanent link to this record



Author	Daniel Sanchez; Miguel Angel Bautista; Sergio Escalera
Title	HuPBA 8k+: Dataset and ECOC-GraphCut based Segmentation of Human Limbs			Type	Journal Article
Year	2015	Publication	Neurocomputing	Abbreviated Journal	NEUCOM
Volume	150	Issue	A	Pages	173–188
Keywords	Human limb segmentation; ECOC; Graph-Cuts
Abstract	Human multi-limb segmentation in RGB images has attracted a lot of interest in the research community because of the huge amount of possible applications in fields like Human-Computer Interaction, Surveillance, eHealth, or Gaming. Nevertheless, human multi-limb segmentation is a very hard task because of the changes in appearance produced by different points of view, clothing, lighting conditions, occlusions, and number of articulations of the human body. Furthermore, this huge pose variability makes the availability of large annotated datasets difficult. In this paper, we introduce the HuPBA8k+ dataset. The dataset contains more than 8000 labeled frames at pixel precision, including more than 120000 manually labeled samples of 14 different limbs. For completeness, the dataset is also labeled at frame-level with action annotations drawn from an 11 action dictionary which includes both single person actions and person-person interactive actions. Furthermore, we also propose a two-stage approach for the segmentation of human limbs. In a first stage, human limbs are trained using cascades of classifiers to be split in a tree-structure way, which is included in an Error-Correcting Output Codes (ECOC) framework to define a body-like probability map. This map is used to obtain a binary mask of the subject by means of GMM color modelling and GraphCuts theory. In a second stage, we embed a similar tree-structure in an ECOC framework to build a more accurate set of limb-like probability maps within the segmented user mask, that are fed to a multi-label GraphCut procedure to obtain final multi-limb segmentation. The methodology is tested on the novel HuPBA8k+ dataset, showing performance improvements in comparison to state-of-the-art approaches. In addition, a baseline of standard action recognition methods for the 11 actions categories of the novel dataset is also provided.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ SBE2015			Serial	2552
Permanent link to this record



Author	Julie Digne; Mariella Dimiccoli; Neus Sabater; Philippe Salembier
Title	Neighborhood Filters and the Recovery of 3D Information			Type	Book Chapter
Year	2015	Publication	Handbook of Mathematical Methods in Imaging	Abbreviated Journal
Volume		Issue	III	Pages	1645-1673
Keywords
Abstract	Following their success in image processing (see Chapter Local Smoothing Neighborhood Filters), neighborhood filters have been extended to 3D surface processing. This adaptation is not straightforward. It has led to several variants for surfaces depending on whether the surface is defined as a mesh, or as a raw data point set. The image gray level in the bilateral similarity measure is replaced by a geometric information such as the normal or the curvature. The first section of this chapter reviews the variants of 3D mesh bilateral filters and compares them to the simplest possible isotropic filter, the mean curvature motion.In a second part, this chapter reviews applications of the bilateral filter to a data composed of a sparse depth map (or of depth cues) and of the image on which they have been computed. Such sparse depth cues can be obtained by stereovision or by psychophysical techniques. The underlying assumption to these applications is that pixels with similar intensity around a region are likely to have similar depths. Therefore, when diffusing depth information with a bilateral filter based on locality and color similarity, the discontinuities in depth are assured to be consistent with the color discontinuities, which is generally a desirable property. In the reviewed applications, this ends up with the reconstruction of a dense perceptual depth map from the joint data of an image and of depth cues.
Address
Corporate Author				Thesis
Publisher	Springer New York	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4939-0789-2	Medium
Area		Expedition		Conference
Notes	MILAB			Approved	no
Call Number	Admin @ si @ DDS2015			Serial	2710
Permanent link to this record



Author	David Sanchez-Mendoza; David Masip; Agata Lapedriza
Title	Emotion recognition from mid-level features			Type	Journal Article
Year	2015	Publication	Pattern Recognition Letters	Abbreviated Journal	PRL
Volume	67	Issue	Part 1	Pages	66–74
Keywords	Facial expression; Emotion recognition; Action units; Computer vision
Abstract	In this paper we present a study on the use of Action Units as mid-level features for automatically recognizing basic and subtle emotions. We propose a representation model based on mid-level facial muscular movement features. We encode these movements dynamically using the Facial Action Coding System, and propose to use these intermediate features based on Action Units (AUs) to classify emotions. AUs activations are detected fusing a set of spatiotemporal geometric and appearance features. The algorithm is validated in two applications: (i) the recognition of 7 basic emotions using the publicly available Cohn-Kanade database, and (ii) the inference of subtle emotional cues in the Newscast database. In this second scenario, we consider emotions that are perceived cumulatively in longer periods of time. In particular, we Automatically classify whether video shoots from public News TV channels refer to Good or Bad news. To deal with the different video lengths we propose a Histogram of Action Units and compute it using a sliding window strategy on the frame sequences. Our approach achieves accuracies close to human perception.
Address
Corporate Author				Thesis
Publisher	Elsevier B.V.	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0167-8655	ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ SML2015			Serial	2746
Permanent link to this record