Publicacions CVC -- Query Results

	Publicacions CVC Home \| Show All \| Simple Search \| Advanced Search \| Add Record \| Import	Login Quick Search: Field: contains: ...
	31–45 of 170 records found matching your query (RSS \| history):

Search & Display Options

Select All Deselect All

<< 1 2 3 4 5 6 7 8 9 10 >> [11–12]

List View

Citations

Details

	Records
	Author	Anjan Dutta; Umapada Pal; Alicia Fornes; Josep Llados
	Title	An Efficient Staff Removal Technique from Printed Musical Documents			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	1965–1968
	Keywords
	Abstract	Staff removal is an important preprocessing step of the Optical Music Recognition (OMR). The process aims to remove the stafflines from a musical document and retain only the musical symbols, later these symbols are used effectively to identify the music information. This paper proposes a simple but robust method to remove stafflines from printed musical scores. In the proposed methodology we have considered a staffline segment as a horizontal linkage of vertical black runs with uniform height. We have used the neighbouring properties of a staffline segment to validate it as a true segment. We have considered the dataset along with the deformations described in for evaluation purpose. From experimentation we have got encouraging results.
	Address	Istanbul (Turkey)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4244-7542-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ DPF2010			Serial	1420
Permanent link to this record



	Author	Alicia Fornes; Sergio Escalera; Josep Llados; Ernest Valveny
	Title	Symbol Classification using Dynamic Aligned Shape Descriptor			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	1957–1960
	Keywords
	Abstract	Shape representation is a difficult task because of several symbol distortions, such as occlusions, elastic deformations, gaps or noise. In this paper, we propose a new descriptor and distance computation for coping with the problem of symbol recognition in the domain of Graphical Document Image Analysis. The proposed D-Shape descriptor encodes the arrangement information of object parts in a circular structure, allowing different levels of distortion. The classification is performed using a cyclic Dynamic Time Warping based method, allowing distortions and rotation. The methodology has been validated on different data sets, showing very high recognition rates.
	Address	Istanbul (Turkey)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4244-7542-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG; HUPBA; MILAB			Approved	no
	Call Number	BCNPCL @ bcnpcl @ FEL2010			Serial	1421
Permanent link to this record



	Author	Susana Alvarez; Anna Salvatella; Maria Vanrell; Xavier Otazu
	Title	Perceptual color texture codebooks for retrieving in highly diverse texture datasets			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	866–869
	Keywords
	Abstract	Color and texture are visual cues of different nature, their integration in a useful visual descriptor is not an obvious step. One way to combine both features is to compute texture descriptors independently on each color channel. A second way is integrate the features at a descriptor level, in this case arises the problem of normalizing both cues. A significant progress in the last years in object recognition has provided the bag-of-words framework that again deals with the problem of feature combination through the definition of vocabularies of visual words. Inspired in this framework, here we present perceptual textons that will allow to fuse color and texture at the level of p-blobs, which is our feature detection step. Feature representation is based on two uniform spaces representing the attributes of the p-blobs. The low-dimensionality of these text on spaces will allow to bypass the usual problems of previous approaches. Firstly, no need for normalization between cues; and secondly, vocabularies are directly obtained from the perceptual properties of text on spaces without any learning step. Our proposal improve current state-of-art of color-texture descriptors in an image retrieval experiment over a highly diverse texture dataset from Corel.
	Address	Istanbul (Turkey)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4244-7542-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	CIC			Approved	no
	Call Number	CAT @ cat @ ASV2010b			Serial	1426
Permanent link to this record



	Author	Marçal Rusiñol; Farshad Nourbakhsh; Dimosthenis Karatzas; Ernest Valveny; Josep Llados
	Title	Perceptual Image Retrieval by Adding Color Information to the Shape Context Descriptor			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	1594–1597
	Keywords
	Abstract	In this paper we present a method for the retrieval of images in terms of perceptual similarity. Local color information is added to the shape context descriptor in order to obtain an object description integrating both shape and color as visual cues. We use a color naming algorithm in order to represent the color information from a perceptual point of view. The proposed method has been tested in two different applications, an object retrieval scenario based on color sketch queries and a color trademark retrieval problem. Experimental results show that the addition of the color information significantly outperforms the sole use of the shape context descriptor.
	Address	Istanbul (Turkey)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4244-7542-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ RNK2010			Serial	1435
Permanent link to this record



	Author	Muhammad Muzzamil Luqman; Josep Llados; Jean-Yves Ramel; Thierry Brouard
	Title	A Fuzzy-Interval Based Approach For Explicit Graph Embedding, Recognizing Patterns in Signals, Speech, Images and Video			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume	6388	Issue		Pages	93–98
	Keywords
	Abstract	We present a new method for explicit graph embedding. Our algorithm extracts a feature vector for an undirected attributed graph. The proposed feature vector encodes details about the number of nodes, number of edges, node degrees, the attributes of nodes and the attributes of edges in the graph. The first two features are for the number of nodes and the number of edges. These are followed by w features for node degrees, m features for k node attributes and n features for l edge attributes — which represent the distribution of node degrees, node attribute values and edge attribute values, and are obtained by defining (in an unsupervised fashion), fuzzy-intervals over the list of node degrees, node attributes and edge attributes. Experimental results are provided for sample data of ICPR2010 contest GEPR.
	Address
	Corporate Author				Thesis
	Publisher	Springer, Heidelberg	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN	0302-9743	ISBN	978-3-642-17710-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ LLR2010			Serial	1459
Permanent link to this record



	Author	Muhammad Muzzamil Luqman; Thierry Brouard; Jean-Yves Ramel; Josep Llados
	Title	A Content Spotting System For Line Drawing Graphic Document Images			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume	20	Issue		Pages	3420–3423
	Keywords
	Abstract	We present a content spotting system for line drawing graphic document images. The proposed system is sufficiently domain independent and takes the keyword based information retrieval for graphic documents, one step forward, to Query By Example (QBE) and focused retrieval. During offline learning mode: we vectorize the documents in the repository, represent them by attributed relational graphs, extract regions of interest (ROIs) from them, convert each ROI to a fuzzy structural signature, cluster similar signatures to form ROI classes and build an index for the repository. During online querying mode: a Bayesian network classifier recognizes the ROIs in the query image and the corresponding documents are fetched by looking up in the repository index. Experimental results are presented for synthetic images of architectural and electronic documents.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4244-7542-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ LBR2010b			Serial	1460
Permanent link to this record



	Author	Albert Gordo; Florent Perronnin
	Title	A Bag-of-Pages Approach to Unordered Multi-Page Document Classification			Type	Conference Article
	Year	2010	Publication	20th International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	1920–1923
	Keywords
	Abstract	We consider the problem of classifying documents containing multiple unordered pages. For this purpose, we propose a novel bag-of-pages document representation. To represent a document, one assigns every page to a prototype in a codebook of pages. This leads to a histogram representation which can then be fed to any discriminative classifier. We also consider several refinements over this initial approach. We show on two challenging datasets that the proposed approach significantly outperforms a baseline system.
	Address	Istanbul (Turkey)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4244-7542-1	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ GoP2010			Serial	1480
Permanent link to this record



	Author	Jaume Amores; David Geronimo; Antonio Lopez
	Title	Multiple instance and active learning for weakly-supervised object-class segmentation			Type	Conference Article
	Year	2010	Publication	3rd IEEE International Conference on Machine Vision	Abbreviated Journal
	Volume		Issue		Pages
	Keywords	Multiple Instance Learning; Active Learning; Object-class segmentation.
	Abstract	In object-class segmentation, one of the most tedious tasks is to manually segment many object examples in order to learn a model of the object category. Yet, there has been little research on reducing the degree of manual annotation for object-class segmentation. In this work we explore alternative strategies which do not require full manual segmentation of the object in the training set. In particular, we study the use of bounding boxes as a coarser and much cheaper form of segmentation and we perform a comparative study of several Multiple-Instance Learning techniques that allow to obtain a model with this type of weak annotation. We show that some of these methods can be competitive, when used with coarse segmentations, with methods that require full manual segmentation of the objects. Furthermore, we show how to use active learning combined with this weakly supervised strategy. As we see, this strategy permits to reduce the amount of annotation and optimize the number of examples that require full manual segmentation in the training set.
	Address	Hong-Kong
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	ICMV
	Notes	ADAS			Approved	no
	Call Number	ADAS @ adas @ AGL2010b			Serial	1429
Permanent link to this record



	Author	Sergio Escalera; Petia Radeva; Jordi Vitria; Xavier Baro; Bogdan Raducanu
	Title	Modelling and Analyzing Multimodal Dyadic Interactions Using Social Networks			Type	Conference Article
	Year	2010	Publication	12th International Conference on Multimodal Interfaces and 7th Workshop on Machine Learning for Multimodal Interaction.	Abbreviated Journal
	Volume		Issue		Pages
	Keywords	Social interaction; Multimodal fusion, Influence model; Social network analysis
	Abstract	Social network analysis became a common technique used to model and quantify the properties of social interactions. In this paper, we propose an integrated framework to explore the characteristics of a social network extracted from multimodal dyadic interactions. First, speech detection is performed through an audio/visual fusion scheme based on stacked sequential learning. In the audio domain, speech is detected through clusterization of audio features. Clusters are modelled by means of an One-state Hidden Markov Model containing a diagonal covariance Gaussian Mixture Model. In the visual domain, speech detection is performed through differential-based feature extraction from the segmented mouth region, and a dynamic programming matching procedure. Second, in order to model the dyadic interactions, we employed the Influence Model whose states encode the previous integrated audio/visual data. Third, the social network is extracted based on the estimated influences. For our study, we used a set of videos belonging to New York Times’ Blogging Heads opinion blog. The results are reported both in terms of accuracy of the audio/visual data fusion and centrality measures used to characterize the social network.
	Address	Beijing (China)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	ICMI-MLI
	Notes	OR;MILAB;HUPBA;MV			Approved	no
	Call Number	BCNPCL @ bcnpcl @ ERV2010			Serial	1427
Permanent link to this record



	Author	Fernando Barrera; Felipe Lumbreras; Angel Sappa
	Title	Multimodal Template Matching based on Gradient and Mutual Information using Scale-Space			Type	Conference Article
	Year	2010	Publication	17th IEEE International Conference on Image Processing	Abbreviated Journal
	Volume		Issue		Pages	2749–2752
	Keywords
	Abstract	This paper presents the combined use of gradient and mutual information for infrared and intensity templates matching. We propose to joint: (i) feature matching in a multiresolution context and (ii) information propagation through scale-space representations. Our method consists in combining mutual information with a shape descriptor based on gradient, and propagate them following a coarse-to-fine strategy. The main contributions of this work are: to offer a theoretical formulation towards a multimodal stereo matching; to show that gradient and mutual information can be reinforced while they are propagated between consecutive levels; and to show that they are valid cost functions in multimodal template matchings. Comparisons are presented showing the improvements and viability of the proposed approach.
	Address	Hong-Kong
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1522-4880	ISBN	978-1-4244-7992-4	Medium
	Area		Expedition		Conference	ICIP
	Notes	ADAS			Approved	no
	Call Number	ADAS @ adas @ BLS2010			Serial	1358
Permanent link to this record



	Author	Mohammad Rouhani; Angel Sappa
	Title	A Fast accurate Implicit Polynomial Fitting Approach			Type	Conference Article
	Year	2010	Publication	17th IEEE International Conference on Image Processing	Abbreviated Journal
	Volume		Issue		Pages	1429–1432
	Keywords
	Abstract	This paper presents a novel hybrid approach that combines state of the art fitting algorithms: algebraic-based and geometric-based. It consists of two steps; first, the 3L algorithm is used as an initialization and then, the obtained result, is improved through a geometric approach. The adopted geometric approach is based on a distance estimation that avoids costly search for the real orthogonal distance. Experimental results are presented as well as quantitative comparisons.
	Address	Hong-Kong
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1522-4880	ISBN	978-1-4244-7992-4	Medium
	Area		Expedition		Conference	ICIP
	Notes	ADAS			Approved	no
	Call Number	ADAS @ adas @ RoS2010b			Serial	1359
Permanent link to this record



	Author	Susana Alvarez; Anna Salvatella; Maria Vanrell; Xavier Otazu
	Title	3D Texton Spaces for color-texture retrieval			Type	Conference Article
	Year	2010	Publication	7th International Conference on Image Analysis and Recognition	Abbreviated Journal
	Volume	6111	Issue		Pages	354–363
	Keywords
	Abstract	Color and texture are visual cues of different nature, their integration in an useful visual descriptor is not an easy problem. One way to combine both features is to compute spatial texture descriptors independently on each color channel. Another way is to do the integration at the descriptor level. In this case the problem of normalizing both cues arises. In this paper we solve the latest problem by fusing color and texture through distances in texton spaces. Textons are the attributes of image blobs and they are responsible for texture discrimination as defined in Julesz’s Texton theory. We describe them in two low-dimensional and uniform spaces, namely, shape and color. The dissimilarity between color texture images is computed by combining the distances in these two spaces. Following this approach, we propose our TCD descriptor which outperforms current state of art methods in the two different approaches mentioned above, early combination with LBP and late combination with MPEG-7. This is done on an image retrieval experiment over a highly diverse texture dataset from Corel.
	Address
	Corporate Author				Thesis
	Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	A.C. Campilho and M.S. Kamel
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN	0302-9743	ISBN	978-3-642-13771-6	Medium
	Area		Expedition		Conference	ICIAR
	Notes	CIC			Approved	no
	Call Number	CAT @ cat @ ASV2010a			Serial	1325
Permanent link to this record



	Author	Naveen Onkarappa; Angel Sappa
	Title	On-Board Monocular Vision System Pose Estimation through a Dense Optical Flow			Type	Conference Article
	Year	2010	Publication	7th International Conference on Image Analysis and Recognition	Abbreviated Journal
	Volume	6111	Issue		Pages	230-239
	Keywords
	Abstract	This paper presents a robust technique for estimating on-board monocular vision system pose. The proposed approach is based on a dense optical flow that is robust against shadows, reflections and illumination changes. A RANSAC based scheme is used to cope with the outliers in the optical flow. The proposed technique is intended to be used in driver assistance systems for applications such as obstacle or pedestrian detection. Experimental results on different scenarios, both from synthetic and real sequences, shows usefulness of the proposed approach.
	Address	Povoa de Varzim (Portugal)
	Corporate Author				Thesis
	Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN	0302-9743	ISBN	978-3-642-13771-6	Medium
	Area		Expedition		Conference	ICIAR
	Notes	ADAS			Approved	no
	Call Number	ADAS @ adas @ OnS2010			Serial	1342
Permanent link to this record



	Author	Alicia Fornes; Josep Llados
	Title	A Symbol-dependent Writer Identifcation Approach in Old Handwritten Music Scores			Type	Conference Article
	Year	2010	Publication	12th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	634 - 639
	Keywords
	Abstract	Writer identification consists in determining the writer of a piece of handwriting from a set of writers. In this paper we introduce a symbol-dependent approach for identifying the writer of old music scores, which is based on two symbol recognition methods. The main idea is to use the Blurred Shape Model descriptor and a DTW-based method for detecting, recognizing and describing the music clefs and notes. The proposed approach has been evaluated in a database of old music scores, achieving very high writer identification rates.
	Address	Kolkata (India)
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4244-8353-2	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ FoL2010			Serial	1321
Permanent link to this record



	Author	Salim Jouili; Salvatore Tabbone; Ernest Valveny
	Title	Comparing Graph Similarity Measures for Graphical Recognition			Type	Book Chapter
	Year	2010	Publication	Graphics Recognition. Achievements, Challenges, and Evolution. 8th International Workshop, GREC 2009. Selected Papers	Abbreviated Journal
	Volume	6020	Issue		Pages	37-48
	Keywords
	Abstract	In this paper we evaluate four graph distance measures. The analysis is performed for document retrieval tasks. For this aim, different kind of documents are used including line drawings (symbols), ancient documents (ornamental letters), shapes and trademark-logos. The experimental results show that the performance of each graph distance measure depends on the kind of data and the graph representation technique.
	Address
	Corporate Author				Thesis
	Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN	0302-9743	ISBN	978-3-642-13727-3	Medium
	Area		Expedition		Conference	GREC
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ JTV2010			Serial	2404
Permanent link to this record