Publicacions CVC -- Query Results

[121–130] << 131 132 133 134 135 136 137 138 139 140 >> [141–150]

Details

Records
Author	Marco Pedersoli
Title	A Multiresolution Cascade for Human Detection			Type	Miscellaneous
Year	2008	Publication	CVC Technical Report #126	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Barcelona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	Admin @ si @ Ped2008			Serial	1148
Permanent link to this record



Author	Bhaskar Chakraborty
Title	View-Invariant Human-Body Detection with Extension to Human Action Recognition using Component Wise HMM of Body Parts			Type	Miscellaneous
Year	2008	Publication	CVC Technical Report #123	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Barcelona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	Admin @ si @ Cha2008			Serial	1149
Permanent link to this record



Author	Pierluigi Casale
Title	Social Environment Description from Data Collected with a Wearable Device			Type	Miscellaneous
Year	2008	Publication	CVC Technical Report #124	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Barcelona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes				Approved	no
Call Number	Admin @ si @ Cas2008			Serial	1151
Permanent link to this record



Author	Bogdan Raducanu; Jordi Vitria; D. Gatica-Perez
Title	You are Fired! Nonverbal Role Analysis in Competitive Meetings			Type	Conference Article
Year	2009	Publication	IEEE International Conference on Audio, Speech and Signal Processing	Abbreviated Journal
Volume		Issue		Pages	1949–1952
Keywords
Abstract	This paper addresses the problem of social interaction analysis in competitive meetings, using nonverbal cues. For our study, we made use of ldquoThe Apprenticerdquo reality TV show, which features a competition for a real, highly paid corporate job. Our analysis is centered around two tasks regarding a person's role in a meeting: predicting the person with the highest status and predicting the fired candidates. The current study was carried out using nonverbal audio cues. Results obtained from the analysis of a full season of the show, representing around 90 minutes of audio data, are very promising (up to 85.7% of accuracy in the first case and up to 92.8% in the second case). Our approach is based only on the nonverbal interaction dynamics during the meeting without relying on the spoken words.
Address	Taipei, Taiwan
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1520-6149	ISBN	978-1-4244-2353-8	Medium
Area		Expedition		Conference	ICASSP
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ RVG2009			Serial	1154
Permanent link to this record



Author	Jose Manuel Alvarez; Theo Gevers; Antonio Lopez
Title	Learning Photometric Invariance from Diversified Color Model Ensembles			Type	Conference Article
Year	2009	Publication	22nd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	565–572
Keywords	road detection
Abstract	Color is a powerful visual cue for many computer vision applications such as image segmentation and object recognition. However, most of the existing color models depend on the imaging conditions affecting negatively the performance of the task at hand. Often, a reflection model (e.g., Lambertian or dichromatic reflectance) is used to derive color invariant models. However, those reflection models might be too restricted to model real-world scenes in which different reflectance mechanisms may hold simultaneously. Therefore, in this paper, we aim to derive color invariance by learning from color models to obtain diversified color invariant ensembles. First, a photometrical orthogonal and non-redundant color model set is taken on input composed of both color variants and invariants. Then, the proposed method combines and weights these color models to arrive at a diversified color ensemble yielding a proper balance between invariance (repeatability) and discriminative power (distinctiveness). To achieve this, the fusion method uses a multi-view approach to minimize the estimation error. In this way, the method is robust to data uncertainty and produces properly diversified color invariant ensembles. Experiments are conducted on three different image datasets to validate the method. From the theoretical and experimental results, it is concluded that the method is robust against severe variations in imaging conditions. The method is not restricted to a certain reflection model or parameter tuning. Further, the method outperforms state-of- the-art detection techniques in the field of object, skin and road recognition.
Address	Miami (USA)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1063-6919	ISBN	978-1-4244-3992-8	Medium
Area		Expedition		Conference	CVPR
Notes	ADAS;ISE			Approved	no
Call Number	ADAS @ adas @ AGL2009			Serial	1169
Permanent link to this record



Author	Antonio Clavelli; Dimosthenis Karatzas
Title	Text Segmentation in Colour Posters from the Spanish Civil War Era			Type	Conference Article
Year	2009	Publication	10th International Conference on Document Analysis and Recognition	Abbreviated Journal
Volume		Issue		Pages	181 - 185
Keywords
Abstract	The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the Posters from the Spanish Civil War.
Address	Barcelona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1520-5363	ISBN	978-1-4244-4500-4	Medium
Area		Expedition		Conference	ICDAR
Notes	DAG			Approved	no
Call Number	DAG @ dag @ ClK2009			Serial	1172
Permanent link to this record



Author	Albert Gordo; Ernest Valveny
Title	A rotation invariant page layout descriptor for document classification and retrieval			Type	Conference Article
Year	2009	Publication	10th International Conference on Document Analysis and Recognition	Abbreviated Journal
Volume		Issue		Pages	481–485
Keywords
Abstract	Document classification usually requires of structural features such as the physical layout to obtain good accuracy rates on complex documents. This paper introduces a descriptor of the layout and a distance measure based on the cyclic dynamic time warping which can be computed in O(n2). This descriptor is translation invariant and can be easily modified to be scale and rotation invariant. Experiments with this descriptor and its rotation invariant modification are performed on the Girona archives database and compared against another common layout distance, the minimum weight edge cover. The experiments show that these methods outperform the MWEC both in accuracy and speed, particularly on rotated documents.
Address	Barcelona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1520-5363	ISBN	978-1-4244-4500-4	Medium
Area		Expedition		Conference	ICDAR
Notes	DAG			Approved	no
Call Number	DAG @ dag @ GoV2009a			Serial	1175
Permanent link to this record



Author	Marçal Rusiñol; Josep Llados
Title	Logo Spotting by a Bag-of-words Approach for Document Categorization			Type	Conference Article
Year	2009	Publication	10th International Conference on Document Analysis and Recognition	Abbreviated Journal
Volume		Issue		Pages	111–115
Keywords
Abstract	In this paper we present a method for document categorization which processes incoming document images such as invoices or receipts. The categorization of these document images is done in terms of the presence of a certain graphical logo detected without segmentation. The graphical logos are described by a set of local features and the categorization of the documents is performed by the use of a bag-of-words model. Spatial coherence rules are added to reinforce the correct category hypothesis, aiming also to spot the logo inside the document image. Experiments which demonstrate the effectiveness of this system on a large set of real data are presented.
Address	Barcelona; Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1520-5363	ISBN	978-1-4244-4500-4	Medium
Area		Expedition		Conference	ICDAR
Notes	DAG			Approved	no
Call Number	DAG @ dag @ RuL2009b			Serial	1179
Permanent link to this record



Author	Sergio Escalera; Alicia Fornes; Oriol Pujol; Alberto Escudero; Petia Radeva
Title	Circular Blurred Shape Model for Symbol Spotting in Documents			Type	Conference Article
Year	2009	Publication	16th IEEE International Conference on Image Processing	Abbreviated Journal
Volume		Issue		Pages	1985-1988
Keywords
Abstract	Symbol spotting problem requires feature extraction strategies able to generalize from training samples and to localize the target object while discarding most part of the image. In the case of document analysis, symbol spotting techniques have to deal with a high variability of symbols' appearance. In this paper, we propose the Circular Blurred Shape Model descriptor. Feature extraction is performed capturing the spatial arrangement of significant object characteristics in a correlogram structure. Shape information from objects is shared among correlogram regions, being tolerant to the irregular deformations. Descriptors are learnt using a cascade of classifiers and Abadoost as the base classifier. Finally, symbol spotting is performed by means of a windowing strategy using the learnt cascade over plan and old musical score documents. Spotting and multi-class categorization results show better performance comparing with the state-of-the-art descriptors.
Address	Cairo, Egypt
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4244-5653-6	Medium
Area		Expedition		Conference	ICIP
Notes	MILAB;HuPBA;DAG			Approved	no
Call Number	BCNPCL @ bcnpcl @ EFP2009b			Serial	1184
Permanent link to this record



Author	Sergio Escalera; Eloi Puertas; Petia Radeva; Oriol Pujol
Title	Multimodal laughter recognition in video conversations			Type	Conference Article
Year	2009	Publication	2nd IEEE Workshop on CVPR for Human communicative Behavior analysis	Abbreviated Journal
Volume		Issue		Pages	110–115
Keywords
Abstract	Laughter detection is an important area of interest in the Affective Computing and Human-computer Interaction fields. In this paper, we propose a multi-modal methodology based on the fusion of audio and visual cues to deal with the laughter recognition problem in face-to-face conversations. The audio features are extracted from the spectogram and the video features are obtained estimating the mouth movement degree and using a smile and laughter classifier. Finally, the multi-modal cues are included in a sequential classifier. Results over videos from the public discussion blog of the New York Times show that both types of features perform better when considered together by the classifier. Moreover, the sequential methodology shows to significantly outperform the results obtained by an Adaboost classifier.
Address	Miami (USA)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	2160-7508	ISBN	978-1-4244-3994-2	Medium
Area		Expedition		Conference	CVPR
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPR2009c			Serial	1188
Permanent link to this record



Author	Xavier Baro; Sergio Escalera; Petia Radeva; Jordi Vitria
Title	Visual Content Layer for Scalable Recognition in Urban Image Databases, Internet Multimedia Search and Mining			Type	Conference Article
Year	2009	Publication	10th IEEE International Conference on Multimedia and Expo	Abbreviated Journal
Volume		Issue		Pages	1616–1619
Keywords
Abstract	Rich online map interaction represents a useful tool to get multimedia information related to physical places. With this type of systems, users can automatically compute the optimal route for a trip or to look for entertainment places or hotels near their actual position. Standard maps are defined as a fusion of layers, where each one contains specific data such height, streets, or a particular business location. In this paper we propose the construction of a visual content layer which describes the visual appearance of geographic locations in a city. We captured, by means of a Mobile Mapping system, a huge set of georeferenced images (> 500K) which cover the whole city of Barcelona. For each image, hundreds of region descriptions are computed off-line and described as a hash code. This allows an efficient and scalable way of accessing maps by visual content.
Address	New York (USA)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4244-4291-1	Medium
Area		Expedition		Conference	ICME
Notes	OR;MILAB;HuPBA;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ BER2009			Serial	1189
Permanent link to this record



Author	Fahad Shahbaz Khan; Joost Van de Weijer; Maria Vanrell
Title	Top-Down Color Attention for Object Recognition			Type	Conference Article
Year	2009	Publication	12th International Conference on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	979 - 986
Keywords
Abstract	Generally the bag-of-words based image representation follows a bottom-up paradigm. The subsequent stages of the process: feature detection, feature description, vocabulary construction and image representation are performed independent of the intentioned object classes to be detected. In such a framework, combining multiple cues such as shape and color often provides below-expected results. This paper presents a novel method for recognizing object categories when using multiple cues by separating the shape and color cue. Color is used to guide attention by means of a top-down category-specific attention map. The color attention map is then further deployed to modulate the shape features by taking more features from regions within an image that are likely to contain an object instance. This procedure leads to a category-specific image histogram representation for each category. Furthermore, we argue that the method combines the advantages of both early and late fusion. We compare our approach with existing methods that combine color and shape cues on three data sets containing varied importance of both cues, namely, Soccer ( color predominance), Flower (color and shape parity), and PASCAL VOC Challenge 2007 (shape predominance). The experiments clearly demonstrate that in all three data sets our proposed framework significantly outperforms the state-of-the-art methods for combining color and shape information.
Address	Kyoto, Japan
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1550-5499	ISBN	978-1-4244-4420-5	Medium
Area		Expedition		Conference	ICCV
Notes	CIC			Approved	no
Call Number	CAT @ cat @ SWV2009			Serial	1196
Permanent link to this record



Author	Arjan Gijsenij; Theo Gevers; Joost Van de Weijer
Title	Physics-based Edge Evaluation for Improved Color Constancy			Type	Conference Article
Year	2009	Publication	22nd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	581 – 588
Keywords
Abstract	Edge-based color constancy makes use of image derivatives to estimate the illuminant. However, different edge types exist in real-world images such as shadow, geometry, material and highlight edges. These different edge types may have a distinctive influence on the performance of the illuminant estimation.
Address	Miami, USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1063-6919	ISBN	978-1-4244-3992-8	Medium
Area		Expedition		Conference	CVPR
Notes	CAT;ISE			Approved	no
Call Number	CAT @ cat @ GGW2009			Serial	1197
Permanent link to this record



Author	Jose Manuel Alvarez; Ferran Diego; Joan Serrat; Antonio Lopez
Title	Automatic Ground-truthing using video registration for on-board detection algorithms			Type	Conference Article
Year	2009	Publication	16th IEEE International Conference on Image Processing	Abbreviated Journal
Volume		Issue		Pages	4389 - 4392
Keywords
Abstract	Ground-truth data is essential for the objective evaluation of object detection methods in computer vision. Many works claim their method is robust but they support it with experiments which are not quantitatively assessed with regard some ground-truth. This is one of the main obstacles to properly evaluate and compare such methods. One of the main reasons is that creating an extensive and representative ground-truth is very time consuming, specially in the case of video sequences, where thousands of frames have to be labelled. Could such a ground-truth be generated, at least in part, automatically? Though it may seem a contradictory question, we show that this is possible for the case of video sequences recorded from a moving camera. The key idea is transferring existing frame segmentations from a reference sequence into another video sequence recorded at a different time on the same track, possibly under a different ambient lighting. We have carried out experiments on several video sequence pairs and quantitatively assessed the precision of the transformed ground-truth, which prove that our approach is not only feasible but also quite accurate.
Address	Cairo, Egypt
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1522-4880	ISBN	978-1-4244-5653-6	Medium
Area		Expedition		Conference	ICIP
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ ADS2009			Serial	1201
Permanent link to this record



Author	Enric Marti; Jaume Rocarias; Ricardo Toledo; Aura Hernandez-Sabate
Title	Caronte: plataforma Moodle con gestion flexible de grupos. Primeras experiencias en asignaturas de Ingenieria Informatica			Type	Miscellaneous
Year	2009	Publication	15th Jornadas de Enseñanza Universitaria de la Informatica	Abbreviated Journal
Volume		Issue		Pages	461–468
Keywords
Abstract	En este artículo se presenta Caronte, entorno LMS (Learning Management System) basado en Moodle. Una característica importante del entorno es la gestión flexible de grupos en una asignatura. Entendemos por grupo un conjunto de alumnos que realizan un trabajo y uno de ellos entrega la actividad propuesta (práctica, encuesta, etc.) en representación del grupo. Hemos trabajado en la confección de estos grupos, implementando un sistema de inscripción por contraseña. Caronte ofrece un conjunto de actividades basadas en este concepto de grupo: encuestas, tareas (entrega de trabajos o prácticas), encuestas de autoevaluación y cuestionarios, entre otras. Basada en nuestra actividad de encuesta, hemos definido una actividad de Control, que permite un cierto feedback electrónico del profesor sobre la actividad de los alumnos. Finalmente, se presenta un resumen de las experiencias de uso de Caronte sobre asignaturas de Ingeniería Informática en el curso 2007-08.
Address	Barcelona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-84-692-2758-9	Medium
Area		Expedition		Conference	JENUI
Notes	IAM;RV;ADAS			Approved	no
Call Number	IAM @ iam @ MRT2009			Serial	1202
Permanent link to this record