Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >>

Details

Records
Author	Sergio Escalera; Alicia Fornes; O. Pujol; Petia Radeva; Gemma Sanchez; Josep Llados
Title	Blurred Shape Model for Binary and Grey-level Symbol Recognition			Type	Journal Article
Year	2009	Publication	Pattern Recognition Letters	Abbreviated Journal	PRL
Volume	30	Issue	15	Pages	1424–1433
Keywords
Abstract	Many symbol recognition problems require the use of robust descriptors in order to obtain rich information of the data. However, the research of a good descriptor is still an open issue due to the high variability of symbols appearance. Rotation, partial occlusions, elastic deformations, intra-class and inter-class variations, or high variability among symbols due to different writing styles, are just a few problems. In this paper, we introduce a symbol shape description to deal with the changes in appearance that these types of symbols suffer. The shape of the symbol is aligned based on principal components to make the recognition invariant to rotation and reflection. Then, we present the Blurred Shape Model descriptor (BSM), where new features encode the probability of appearance of each pixel that outlines the symbols shape. Moreover, we include the new descriptor in a system to deal with multi-class symbol categorization problems. Adaboost is used to train the binary classifiers, learning the BSM features that better split symbol classes. Then, the binary problems are embedded in an Error-Correcting Output Codes framework (ECOC) to deal with the multi-class case. The methodology is evaluated on different synthetic and real data sets. State-of-the-art descriptors and classifiers are compared, showing the robustness and better performance of the present scheme to classify symbols with high variability of appearance.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	HuPBA; DAG; MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ EFP2009a			Serial	1180
Permanent link to this record



Author	Sergio Escalera; Alicia Fornes; Oriol Pujol; Alberto Escudero; Petia Radeva
Title	Circular Blurred Shape Model for Symbol Spotting in Documents			Type	Conference Article
Year	2009	Publication	16th IEEE International Conference on Image Processing	Abbreviated Journal
Volume		Issue		Pages	1985-1988
Keywords
Abstract	Symbol spotting problem requires feature extraction strategies able to generalize from training samples and to localize the target object while discarding most part of the image. In the case of document analysis, symbol spotting techniques have to deal with a high variability of symbols' appearance. In this paper, we propose the Circular Blurred Shape Model descriptor. Feature extraction is performed capturing the spatial arrangement of significant object characteristics in a correlogram structure. Shape information from objects is shared among correlogram regions, being tolerant to the irregular deformations. Descriptors are learnt using a cascade of classifiers and Abadoost as the base classifier. Finally, symbol spotting is performed by means of a windowing strategy using the learnt cascade over plan and old musical score documents. Spotting and multi-class categorization results show better performance comparing with the state-of-the-art descriptors.
Address	Cairo, Egypt
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4244-5653-6	Medium
Area		Expedition		Conference	ICIP
Notes	MILAB;HuPBA;DAG			Approved	no
Call Number	BCNPCL @ bcnpcl @ EFP2009b			Serial	1184
Permanent link to this record



Author	Sergio Escalera; Alicia Fornes; Oriol Pujol; Petia Radeva
Title	Multi-class Binary Symbol Classification with Circular Blurred Shape Models			Type	Conference Article
Year	2009	Publication	15th International Conference on Image Analysis and Processing	Abbreviated Journal
Volume	5716	Issue		Pages	1005–1014
Keywords
Abstract	Multi-class binary symbol classification requires the use of rich descriptors and robust classifiers. Shape representation is a difficult task because of several symbol distortions, such as occlusions, elastic deformations, gaps or noise. In this paper, we present the Circular Blurred Shape Model descriptor. This descriptor encodes the arrangement information of object parts in a correlogram structure. A prior blurring degree defines the level of distortion allowed to the symbol. Moreover, we learn the new feature space using a set of Adaboost classifiers, which are combined in the Error-Correcting Output Codes framework to deal with the multi-class categorization problem. The presented work has been validated over different multi-class data sets, and compared to the state-of-the-art descriptors, showing significant performance improvements.
Address	Salerno, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-04145-7	Medium
Area		Expedition		Conference	ICIAP
Notes	MILAB;HuPBA;DAG			Approved	no
Call Number	BCNPCL @ bcnpcl @ EFP2009c			Serial	1186
Permanent link to this record



Author	Sergio Escalera; R. M. Martinez; Jordi Vitria; Petia Radeva; Maria Teresa Anguera
Title	Dominance Detection in Face-to-face Conversations			Type	Conference Article
Year	2009	Publication	2nd IEEE Workshop on CVPR for Human communicative Behavior analysis	Abbreviated Journal
Volume		Issue		Pages	97–102
Keywords
Abstract	Dominance is referred to the level of influence a person has in a conversation. Dominance is an important research area in social psychology, but the problem of its automatic estimation is a very recent topic in the contexts of social and wearable computing. In this paper, we focus on dominance detection from visual cues. We estimate the correlation among observers by categorizing the dominant people in a set of face-to-face conversations. Different dominance indicators from gestural communication are defined, manually annotated, and compared to the observers opinion. Moreover, the considered indicators are automatically extracted from video sequences and learnt by using binary classifiers. Results from the three analysis shows a high correlation and allows the categorization of dominant people in public discussion video sequences.
Address	Miami, USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	2160-7508	ISBN	978-1-4244-3994-2	Medium
Area		Expedition		Conference	CVPR
Notes	HuPBA; OR; MILAB;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ EMV2009			Serial	1227
Permanent link to this record



Author	Sergio Escalera; Oriol Pujol; J. Mauri; Petia Radeva
Title	Intravascular Ultrasound Tissue Characterization with Sub-class Error-Correcting Output Codes			Type	Journal Article
Year	2009	Publication	Journal of Signal Processing Systems	Abbreviated Journal
Volume	55	Issue	1-3	Pages	35–47
Keywords
Abstract	Intravascular ultrasound (IVUS) represents a powerful imaging technique to explore coronary vessels and to study their morphology and histologic properties. In this paper, we characterize different tissues based on radial frequency, texture-based, and combined features. To deal with the classification of multiple tissues, we require the use of robust multi-class learning techniques. In this sense, error-correcting output codes (ECOC) show to robustly combine binary classifiers to solve multi-class problems. In this context, we propose a strategy to model multi-class classification tasks using sub-classes information in the ECOC framework. The new strategy splits the classes into different sub-sets according to the applied base classifier. Complex IVUS data sets containing overlapping data are learnt by splitting the original set of classes into sub-classes, and embedding the binary problems in a problem-dependent ECOC design. The method automatically characterizes different tissues, showing performance improvements over the state-of-the-art ECOC techniques for different base classifiers. Furthermore, the combination of RF and texture-based features also shows improvements over the state-of-the-art approaches.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1939-8018	ISBN		Medium
Area		Expedition		Conference
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPM2009			Serial	1258
Permanent link to this record



Author	Sergio Escalera; Oriol Pujol; Petia Radeva
Title	Separability of Ternary Codes for Sparse Designs of Error-Correcting Output Codes			Type	Journal Article
Year	2009	Publication	Pattern Recognition Letters	Abbreviated Journal	PRL
Volume	30	Issue	3	Pages	285–297
Keywords
Abstract	Error Correcting Output Codes (ECOC) represent a successful framework to deal with multi-class categorization problems based on combining binary classiﬁers. In this paper, we present a new formulation of the ternary ECOC distance and the error-correcting capabilities in the ternary ECOC framework. Based on the new measure, we stress on how to design coding matrices preventing codiﬁcation ambiguity and propose a new Sparse Random coding matrix with ternary distance maximization. The results on the UCI Repository and in a real speed trafﬁc categorization problem show that when the coding design satisﬁes the new ternary measures, signiﬁcant performance improvement is obtained independently of the decoding strategy applied.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPR2009a			Serial	1153
Permanent link to this record



Author	Sergio Escalera; Oriol Pujol; Petia Radeva; Jordi Vitria
Title	Measuring Interest of Human Dyadic Interactions			Type	Conference Article
Year	2009	Publication	12th International Conference of the Catalan Association for Artificial Intelligence	Abbreviated Journal
Volume	202	Issue		Pages	45-54
Keywords
Abstract	In this paper, we argue that only using behavioural motion information, we are able to predict the interest of observers when looking at face-to-face interactions. We propose a set of movement-related features from body, face, and mouth activity in order to define a set of higher level interaction features, such as stress, activity, speaking engagement, and corporal engagement. Error-Correcting Output Codes framework with an Adaboost base classifier is used to learn to rank the perceived observer's interest in face-to-face interactions. The automatic system shows good correlation between the automatic categorization results and the manual ranking made by the observers. In particular, the learning system shows that stress features have a high predictive power for ranking interest of observers when looking at of face-to-face interactions.
Address	Cardona (Spain)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60750-061-2	Medium
Area		Expedition		Conference	CCIA
Notes	OR;MILAB;HuPBA;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPR2009b			Serial	1182
Permanent link to this record



Author	Sergio Escalera; Eloi Puertas; Petia Radeva; Oriol Pujol
Title	Multimodal laughter recognition in video conversations			Type	Conference Article
Year	2009	Publication	2nd IEEE Workshop on CVPR for Human communicative Behavior analysis	Abbreviated Journal
Volume		Issue		Pages	110–115
Keywords
Abstract	Laughter detection is an important area of interest in the Affective Computing and Human-computer Interaction fields. In this paper, we propose a multi-modal methodology based on the fusion of audio and visual cues to deal with the laughter recognition problem in face-to-face conversations. The audio features are extracted from the spectogram and the video features are obtained estimating the mouth movement degree and using a smile and laughter classifier. Finally, the multi-modal cues are included in a sequential classifier. Results over videos from the public discussion blog of the New York Times show that both types of features perform better when considered together by the classifier. Moreover, the sequential methodology shows to significantly outperform the results obtained by an Adaboost classifier.
Address	Miami (USA)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	2160-7508	ISBN	978-1-4244-3994-2	Medium
Area		Expedition		Conference	CVPR
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPR2009c			Serial	1188
Permanent link to this record



Author	Sergio Escalera; Oriol Pujol; Petia Radeva
Title	Recoding Error-Correcting Output Codes			Type	Conference Article
Year	2009	Publication	8th International Workshop of Multiple Classifier Systems	Abbreviated Journal
Volume	5519	Issue		Pages	11–21
Keywords
Abstract	One of the most widely applied techniques to deal with multi- class categorization problems is the pairwise voting procedure. Recently, this classical approach has been embedded in the Error-Correcting Output Codes framework (ECOC). This framework is based on a coding step, where a set of binary problems are learnt and coded in a matrix, and a decoding step, where a new sample is tested and classified according to a comparison with the positions of the coded matrix. In this paper, we present a novel approach to redefine without retraining, in a problem-dependent way, the one-versus-one coding matrix so that the new coded information increases the generalization capability of the system. Moreover, the final classification can be tuned with the inclusion of a weighting matrix in the decoding step. The approach has been validated over several UCI Machine Learning repository data sets and two real multi-class problems: traffic sign and face categorization. The results show that performance improvements are obtained when comparing the new approach to one of the best ECOC designs (one-versus-one). Furthermore, the novel methodology obtains at least the same performance than the one-versus-one ECOC design.
Address	Reykjavik (Iceland)
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02325-5	Medium
Area		Expedition		Conference	MCS
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPR2009d			Serial	1190
Permanent link to this record



Author	Carlo Gatta; Petia Radeva
Title	Bilateral Enhancers			Type	Conference Article
Year	2009	Publication	16th IEEE International Conference on Image Processing	Abbreviated Journal
Volume		Issue		Pages	3161-3165
Keywords
Abstract	Ten years ago the concept of bilateral filtering (BF) became popular in the image processing community. The core of the idea is to blend the effect of a spatial filter, as e.g. the Gaussian filter, with the effect of a filter that acts on image values. The two filters acts on orthogonal domains of a picture: the 2D lattice of the image support and the intensity (or color) domain. The BF approach is an intuitive way to blend these two filters giving rise to algorithms that perform difficult tasks requiring a relatively simple design. In this paper we extend the concept of BF, proposing the bilateral enhancers (BE). We show how to design proper functions to obtain an edge-preserving smoothing and a selective sharpening. Moreover, we show that the proposed algorithm can perform edge-preserving smoothing and selective sharpening simultaneously in a single filtering.
Address	Cairo, Egypt
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1522-4880	ISBN	978-1-4244-5653-6	Medium
Area		Expedition		Conference	ICIP
Notes	MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ GaR2009b			Serial	1243
Permanent link to this record



Author	Carlo Gatta; Juan Diego Gomez; Francesco Ciompi; Oriol Rodriguez-Leor; Petia Radeva
Title	Toward robust myocardial blush grade estimation in contrast angiography			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages	249–256
Keywords
Abstract	The assessment of Myocardial Blush Grade after primary angioplasty is a precious diagnostic tool to understand if the patient needs further medication or the use of specifics drugs. Unfortunately, the assessment of MBG is difficult for non highly specialized staff. Experimental data show that there is poor correlation between MBG assessment of low and high specialized staff, thus reducing its applicability. This paper proposes a method able to achieve an objective measure of MBG, or a set of parameters that correlates with the MBG. The method tracks the blush area starting from just one single frame tagged by the physician. As a consequence, the blush area is kept isolated from contaminating phenomena such as diaphragm and arteries movements. We also present a method to extract four parameters that are expected to correlate with the MBG. Preliminary results show that the method is capable of extracting interesting information regarding the behavior of the myocardial perfusion.
Address	Póvoa de Varzim, Portugal
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ GGC2009			Serial	1161
Permanent link to this record



Author	Carlo Gatta; Oriol Pujol; Oriol Rodriguez-Leor; J. M. Ferre; Petia Radeva
Title	Fast Rigid Registration of Vascular Structures in IVUS Sequences			Type	Journal Article
Year	2009	Publication	IEEE Transactions on Information Technology in Biomedicine	Abbreviated Journal
Volume	13	Issue	6	Pages	106-1011
Keywords
Abstract	Intravascular ultrasound (IVUS) technology permits visualization of high-resolution images of internal vascular structures. IVUS is a unique image-guiding tool to display longitudinal view of the vessels, and estimate the length and size of vascular structures with the goal of accurate diagnosis. Unfortunately, due to pulsatile contraction and expansion of the heart, the captured images are affected by different motion artifacts that make visual inspection difficult. In this paper, we propose an efficient algorithm that aligns vascular structures and strongly reduces the saw-shaped oscillation, simplifying the inspection of longitudinal cuts; it reduces the motion artifacts caused by the displacement of the catheter in the short-axis plane and the catheter rotation due to vessel tortuosity. The algorithm prototype aligns 3.16 frames/s and clearly outperforms state-of-the-art methods with similar computational cost. The speed of the algorithm is crucial since it allows to inspect the corrected sequence during patient intervention. Moreover, we improved an indirect methodology for IVUS rigid registration algorithm evaluation.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1089-7771	ISBN		Medium
Area		Expedition		Conference
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ GPL2009			Serial	1250
Permanent link to this record



Author	D. Jayagopi; Bogdan Raducanu; D. Gatica-Perez
Title	Characterizing conversational group dynamics using nonverbal behaviour			Type	Conference Article
Year	2009	Publication	10th IEEE International Conference on Multimedia and Expo	Abbreviated Journal
Volume		Issue		Pages	370–373
Keywords
Abstract	This paper addresses the novel problem of characterizing conversational group dynamics. It is well documented in social psychology that depending on the objectives a group, the dynamics are different. For example, a competitive meeting has a different objective from that of a collaborative meeting. We propose a method to characterize group dynamics based on the joint description of a group members' aggregated acoustical nonverbal behaviour to classify two meeting datasets (one being cooperative-type and the other being competitive-type). We use 4.5 hours of real behavioural multi-party data and show that our methodology can achieve a classification rate of upto 100%.
Address	New York, USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1945-7871	ISBN	978-1-4244-4290-4	Medium
Area		Expedition		Conference	ICME
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ JRG2009			Serial	1217
Permanent link to this record



Author	Agata Lapedriza
Title	Multitask Learning Techniques for Automatic Face Classification			Type	Book Whole
Year	2009	Publication	PhD Thesis, Universitat Autonoma de Barcelona-CVC	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Automatic face classification is currently a popular research area in Computer Vision. It involves several subproblems, such as subject recognition, gender classification or subject verification. Current systems of automatic face classification need a large amount of training data to robustly learn a task. However, the collection of labeled data is usually a difficult issue. For this reason, the research on methods that are able to learn from a small sized training set is essential. The dependency on the abundance of training data is not so evident in human learning processes. We are able to learn from a very small number of examples, given that we use, additionally, some prior knowledge to learn a new task. For example, we frequently find patterns and analogies from other domains to reuse them in new situations, or exploit training data from other experiences. In computer science, Multitask Learning is a new Machine Learning approach that studies this idea of knowledge transfer among different tasks, to overcome the effects of the small sample sized problem. This thesis explores, proposes and tests some Multitask Learning methods specially developed for face classification purposes. Moreover, it presents two more contributions dealing with the small sample sized problem, out of the Multitask Learning context. The first one is a method to extract external face features, to be used as an additional information source in automatic face classification problems. The second one is an empirical study on the most suitable face image resolution to perform automatic subject recognition.
Address	Barcelona (Spain)
Corporate Author				Thesis	Ph.D. thesis
Publisher	Ediciones Graficas Rey	Place of Publication		Editor	Jordi Vitria;David Masip
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ Lap2009			Serial	1263
Permanent link to this record



Author	Mehdi Mirza-Mohammadi; Sergio Escalera; Petia Radeva
Title	Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization			Type	Conference Article
Year	2009	Publication	13th International Conference on Computer Analysis of Images and Patterns	Abbreviated Journal
Volume	5702	Issue		Pages	748–756
Keywords
Abstract	Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the object categorization problem, an image is represented by an unsorted set of discrete visual words (BOVW). In these models, relations among visual words are performed after dictionary construction. However, close object regions can have far descriptions in the feature space, being grouped as different visual words. In this paper, we present a method for considering geometrical information of visual words in the dictionary construction step. Object interest regions are obtained by means of the Harris-Affine detector and then described using the SIFT descriptor. Afterward, a contextual-space and a feature-space are defined, and a merging process is used to fuse feature words based on their proximity in the contextual-space. Moreover, we use the Error Correcting Output Codes framework to learn the new dictionary in order to perform multi-class classification. Results show significant classification improvements when spatial information is taken into account in the dictionary construction step.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-03766-5	Medium
Area		Expedition		Conference	CAIP
Notes	HuPBA; MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ MEP2009			Serial	1185
Permanent link to this record