Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >>

Details

Records
Author	Albert Gordo; Ernest Valveny
Title	The diagonal split: A pre-segmentation step for page layout analysis & classification			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages	290–297
Keywords
Abstract	Document classification is an important task in all the processes related to document storage and retrieval. In the case of complex documents, structural features are needed to achieve a correct classification. Unfortunately, physical layout analysis is error prone. In this paper we present a pre-segmentation step based on a divide & conquer strategy that can be used to improve the page segmentation results, independently of the segmentation algorithm used. This pre-segmentation step is evaluated in classification and retrieval using the selective CRLA algorithm for layout segmentation together with a clustering based on the voronoi area diagram, and tested on two different databases, MARG and Girona Archives.
Address	Póvoa de Varzim, Portugal
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	DAG			Approved	no
Call Number	DAG @ dag @ Gov2009b			Serial	1176
Permanent link to this record



Author	Pierluigi Casale; Oriol Pujol; Petia Radeva
Title	Face-to-face social activity detection using data collected with a wearable device			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages	56–63
Keywords
Abstract	In this work the feasibility of building a socially aware badge that learns from user activities is explored. A wearable multisensor device has been prototyped for collecting data about user movements and photos of the environment where the user acts. Using motion data, speaking and other activities have been classified. Images have been analysed in order to complement motion data and help for the detection of social behaviours. A face detector and an activity classifier are both used for detecting if users have a social activity in the time they worn the device. Good results encourage the improvement of the system at both hardware and software level
Address	Póvoa de Varzim, Portugal
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ CPR2009b			Serial	1206
Permanent link to this record



Author	Marco Pedersoli; Jordi Gonzalez; Juan J. Villanueva
Title	High-Speed Human Detection Using a Multiresolution Cascade of Histograms of Oriented Gradients			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages
Keywords
Abstract	This paper presents a new method for human detection based on a multiresolution cascade of Histograms of Oriented Gradients (HOG) that can highly reduce the computational cost of the detection search without affecting accuracy. The method consists of a cascade of sliding window detectors. Each detector is a Support Vector Machine (SVM) composed by features at different resolution, from coarse for the first level to fine for the last one. Considering that the spatial stride of the sliding window search is affected by the HOG features size, unlike previous methods based on Adaboost cascades, we can adopt a spatial stride inversely proportional to the features resolution. This produces that the speed-up of the cascade is not only due to the low number of features that need to be computed in the first levels, but also to the lower number of detection windows that needs to be evaluated. Experimental results shows that our method permits a detection rate comparable with the state of the art, but at the same time a gain in the speed of the detection search of 10-20 times depending on the cascade configuration.
Address	Póvoa de Varzim, Portugal
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	ISE			Approved	no
Call Number	ISE @ ise @ PGV2009			Serial	1214
Permanent link to this record



Author	Bhaskar Chakraborty; Andrew Bagdanov; Jordi Gonzalez
Title	Towards Real-Time Human Action Recognition			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages
Keywords
Abstract	This work presents a novel approach to human detection based action-recognition in real-time. To realize this goal our method first detects humans in different poses using a correlation-based approach. Recognition of actions is done afterward based on the change of the angular values subtended by various body parts. Real-time human detection and action recognition are very challenging, and most state-of-the-art approaches employ complex feature extraction and classification techniques, which ultimately becomes a handicap for real-time recognition. Our correlation-based method, on the other hand, is computationally efficient and uses very simple gradient-based features. For action recognition angular features of body parts are extracted using a skeleton technique. Results for action recognition are comparable with the present state-of-the-art.
Address	Póvoa de Varzim, Portugal
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	ISE			Approved	no
Call Number	DAG @ dag @ CBG2009			Serial	1215
Permanent link to this record



Author	Fernando Vilariño; Panagiota Spyridonos; Petia Radeva; Jordi Vitria; Fernando Azpiroz; Juan Malagelada
Title	Device, system and method for measurement and analysis of contractile activity			Type	Patent
Year	2009	Publication	US 2009/0202117 A1	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	A method and system for determining intestinal dysfunction condition are provided by classifying and analyzing image frames captured in-vivo. The method and system also relate to the detection of contractile activity in intestinal tracts, to automatic detection of video image frames taken in the gastrointestinal tract including contractile activity, and more particularly to measurement and analysis of contractile activity of the GI tract based on image intensity of in vivo image data.
Address	Pearl Cohen Zedek Latzer
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area	800	Expedition		Conference
Notes	MV;OR;MILAB;SIAI			Approved	no
Call Number	IAM @ iam @ VSR2009			Serial	1704
Permanent link to this record



Author	Sergio Escalera; Oriol Pujol; Petia Radeva
Title	Recoding Error-Correcting Output Codes			Type	Conference Article
Year	2009	Publication	8th International Workshop of Multiple Classifier Systems	Abbreviated Journal
Volume	5519	Issue		Pages	11–21
Keywords
Abstract	One of the most widely applied techniques to deal with multi- class categorization problems is the pairwise voting procedure. Recently, this classical approach has been embedded in the Error-Correcting Output Codes framework (ECOC). This framework is based on a coding step, where a set of binary problems are learnt and coded in a matrix, and a decoding step, where a new sample is tested and classified according to a comparison with the positions of the coded matrix. In this paper, we present a novel approach to redefine without retraining, in a problem-dependent way, the one-versus-one coding matrix so that the new coded information increases the generalization capability of the system. Moreover, the final classification can be tuned with the inclusion of a weighting matrix in the decoding step. The approach has been validated over several UCI Machine Learning repository data sets and two real multi-class problems: traffic sign and face categorization. The results show that performance improvements are obtained when comparing the new approach to one of the best ECOC designs (one-versus-one). Furthermore, the novel methodology obtains at least the same performance than the one-versus-one ECOC design.
Address	Reykjavik (Iceland)
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02325-5	Medium
Area		Expedition		Conference	MCS
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ EPR2009d			Serial	1190
Permanent link to this record



Author	Oriol Pujol; Eloi Puertas; Carlo Gatta
Title	Multi-scale Stacked Sequential Learning			Type	Conference Article
Year	2009	Publication	8th International Workshop of Multiple Classifier Systems	Abbreviated Journal
Volume	5519	Issue		Pages	262–271
Keywords
Abstract	One of the most widely used assumptions in supervised learning is that data is independent and identically distributed. This assumption does not hold true in many real cases. Sequential learning is the discipline of machine learning that deals with dependent data such that neighboring examples exhibit some kind of relationship. In the literature, there are different approaches that try to capture and exploit this correlation, by means of different methodologies. In this paper we focus on meta-learning strategies and, in particular, the stacked sequential learning approach. The main contribution of this work is two-fold: first, we generalize the stacked sequential learning. This generalization reflects the key role of neighboring interactions modeling. Second, we propose an effective and efficient way of capturing and exploiting sequential correlations that takes into account long-range interactions by means of a multi-scale pyramidal decomposition of the predicted labels. Additionally, this new method subsumes the standard stacked sequential learning approach. We tested the proposed method on two different classification tasks: text lines classification in a FAQ data set and image classification. Results on these tasks clearly show that our approach outperforms the standard stacked sequential learning. Moreover, we show that the proposed method allows to control the trade-off between the detail and the desired range of the interactions.
Address	Reykjavik, Iceland
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02325-5	Medium
Area		Expedition		Conference	MCS
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ PPG2009			Serial	1260
Permanent link to this record



Author	Sergio Escalera; Alicia Fornes; Oriol Pujol; Petia Radeva
Title	Multi-class Binary Symbol Classification with Circular Blurred Shape Models			Type	Conference Article
Year	2009	Publication	15th International Conference on Image Analysis and Processing	Abbreviated Journal
Volume	5716	Issue		Pages	1005–1014
Keywords
Abstract	Multi-class binary symbol classification requires the use of rich descriptors and robust classifiers. Shape representation is a difficult task because of several symbol distortions, such as occlusions, elastic deformations, gaps or noise. In this paper, we present the Circular Blurred Shape Model descriptor. This descriptor encodes the arrangement information of object parts in a correlogram structure. A prior blurring degree defines the level of distortion allowed to the symbol. Moreover, we learn the new feature space using a set of Adaboost classifiers, which are combined in the Error-Correcting Output Codes framework to deal with the multi-class categorization problem. The presented work has been validated over different multi-class data sets, and compared to the state-of-the-art descriptors, showing significant performance improvements.
Address	Salerno, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-04145-7	Medium
Area		Expedition		Conference	ICIAP
Notes	MILAB;HuPBA;DAG			Approved	no
Call Number	BCNPCL @ bcnpcl @ EFP2009c			Serial	1186
Permanent link to this record



Author	Maria Salamo; Sergio Escalera; Petia Radeva
Title	Quality Enhancement based on Reinforcement Learning and Feature Weighting for a Critiquing-Based Recommender			Type	Conference Article
Year	2009	Publication	8th International Conference on Case-Based Reasoning	Abbreviated Journal
Volume	5650	Issue		Pages	298–312
Keywords
Abstract	Personalizing the product recommendation task is a major focus of research in the area of conversational recommender systems. Conversational case-based recommender systems help users to navigate through product spaces, alternatively making product suggestions and eliciting users feedback. Critiquing is a common form of feedback and incremental critiquing-based recommender system has shown its efficiency to personalize products based primarily on a quality measure. This quality measure influences the recommendation process and it is obtained by the combination of compatibility and similarity scores. In this paper, we describe new compatibility strategies whose basis is on reinforcement learning and a new feature weighting technique which is based on the user’s history of critiques. Moreover, we show that our methodology can significantly improve recommendation efficiency in comparison with the state-of-the-art approaches.
Address	Seattle, USA
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02998-1	Medium
Area		Expedition		Conference	ICCBR
Notes	HuPBA; MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ SER2009			Serial	1187
Permanent link to this record



Author	Murad Al Haj; Andrew Bagdanov; Jordi Gonzalez; Xavier Roca
Title	Robust and Efficient Multipose Face Detection Using Skin Color Segmentation			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages
Keywords
Abstract	In this paper we describe an efficient technique for detecting faces in arbitrary images and video sequences. The approach is based on segmentation of images or video frames into skin-colored blobs using a pixel-based heuristic. Scale and translation invariant features are then computed from these segmented blobs which are used to perform statistical discrimination between face and non-face classes. We train and evaluate our method on a standard, publicly available database of face images and analyze its performance over a range of statistical pattern classifiers. The generalization of our approach is illustrated by testing on an independent sequence of frames containing many faces and non-faces. These experiments indicate that our proposed approach obtains false positive rates comparable to more complex, state-of-the-art techniques, and that it generalizes better to new data. Furthermore, the use of skin blobs and invariant features requires fewer training samples since significantly fewer non-face candidate regions must be considered when compared to AdaBoost-based approaches.
Address	Springer Berlin Heidelberg
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	ISE			Approved	no
Call Number	DAG @ dag @ ABG2009			Serial	1216
Permanent link to this record



Author	Bogdan Raducanu; Jordi Vitria; D. Gatica-Perez
Title	You are Fired! Nonverbal Role Analysis in Competitive Meetings			Type	Conference Article
Year	2009	Publication	IEEE International Conference on Audio, Speech and Signal Processing	Abbreviated Journal
Volume		Issue		Pages	1949–1952
Keywords
Abstract	This paper addresses the problem of social interaction analysis in competitive meetings, using nonverbal cues. For our study, we made use of ldquoThe Apprenticerdquo reality TV show, which features a competition for a real, highly paid corporate job. Our analysis is centered around two tasks regarding a person's role in a meeting: predicting the person with the highest status and predicting the fired candidates. The current study was carried out using nonverbal audio cues. Results obtained from the analysis of a full season of the show, representing around 90 minutes of audio data, are very promising (up to 85.7% of accuracy in the first case and up to 92.8% in the second case). Our approach is based only on the nonverbal interaction dynamics during the meeting without relying on the spoken words.
Address	Taipei, Taiwan
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1520-6149	ISBN	978-1-4244-2353-8	Medium
Area		Expedition		Conference	ICASSP
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ RVG2009			Serial	1154
Permanent link to this record



Author	Fadi Dornaika; Bogdan Raducanu
Title	Simultaneous 3D face pose and person-specific shape estimation from a single image using a holistic approach			Type	Conference Article
Year	2009	Publication	IEEE Workshop on Applications of Computer Vision	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	This paper presents a new approach for the simultaneous estimation of the 3D pose and specific shape of a previously unseen face from a single image. The face pose is not limited to a frontal view. We describe a holistic approach based on a deformable 3D model and a learned statistical facial texture model. Rather than obtaining a person-specific facial surface, the goal of this work is to compute person-specific 3D face shape in terms of a few control parameters that are used by many applications. The proposed holistic approach estimates the 3D pose parameters as well as the face shape control parameters by registering the warped texture to a statistical face texture, which is carried out by a stochastic and genetic optimizer. The proposed approach has several features that make it very attractive: (i) it uses a single grey-scale image, (ii) it is person-independent, (iii) it is featureless (no facial feature extraction is required), and (iv) its learning stage is easy. The proposed approach lends itself nicely to 3D face tracking and face gesture recognition in monocular videos. We describe extensive experiments that show the feasibility and robustness of the proposed approach.
Address	Utah, USA
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1550-5790	ISBN	978-1-4244-5497-6	Medium
Area		Expedition		Conference	WACV
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ DoR2009b			Serial	1256
Permanent link to this record



Author	Miquel Ferrer; Dimosthenis Karatzas; Ernest Valveny; Horst Bunke
Title	A Recursive Embedding Approach to Median Graph Computation			Type	Conference Article
Year	2009	Publication	7th IAPR – TC–15 Workshop on Graph–Based Representations in Pattern Recognition	Abbreviated Journal
Volume	5534	Issue		Pages	113–123
Keywords
Abstract	The median graph has been shown to be a good choice to infer a representative of a set of graphs. It has been successfully applied to graph-based classification and clustering. Nevertheless, its computation is extremely complex. Several approaches have been presented up to now based on different strategies. In this paper we present a new approximate recursive algorithm for median graph computation based on graph embedding into vector spaces. Preliminary experiments on three databases show that this new approach is able to obtain better medians than the previous existing approaches.
Address	Venice, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02123-7	Medium
Area		Expedition		Conference	GBR
Notes	DAG			Approved	no
Call Number	DAG @ dag @ FKV2009			Serial	1173
Permanent link to this record



Author	L.Tarazon; D. Perez; N. Serrano; V. Alabau; Oriol Ramos Terrades; A. Sanchis; A. Juan
Title	Confidence Measures for Error Correction in Interactive Transcription of Handwritten Text			Type	Conference Article
Year	2009	Publication	15th International Conference on Image Analysis and Processing	Abbreviated Journal
Volume	5716	Issue		Pages	567-574
Keywords
Abstract	An effective approach to transcribe old text documents is to follow an interactive-predictive paradigm in which both, the system is guided by the human supervisor, and the supervisor is assisted by the system to complete the transcription task as efficiently as possible. In this paper, we focus on a particular system prototype called GIDOC, which can be seen as a first attempt to provide user-friendly, integrated support for interactive-predictive page layout analysis, text line detection and handwritten text transcription. More specifically, we focus on the handwriting recognition part of GIDOC, for which we propose the use of confidence measures to guide the human supervisor in locating possible system errors and deciding how to proceed. Empirical results are reported on two datasets showing that a word error rate not larger than a 10% can be achieved by only checking the 32% of words that are recognised with less confidence.
Address	Vietri sul Mare, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-04145-7	Medium
Area		Expedition		Conference	ICIAP
Notes	DAG			Approved	no
Call Number	Admin @ si @ TPS2009			Serial	1871
Permanent link to this record