Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >>

Details

Records
Author	Fahad Shahbaz Khan; Shida Beigpour; Joost Van de Weijer; Michael Felsberg
Title	Painting-91: A Large Scale Database for Computational Painting Categorization			Type	Journal Article
Year	2014	Publication	Machine Vision and Applications	Abbreviated Journal	MVAP
Volume	25	Issue	6	Pages	1385-1397
Keywords
Abstract	Computer analysis of visual art, especially paintings, is an interesting cross-disciplinary research domain. Most of the research in the analysis of paintings involve medium to small range datasets with own specific settings. Interestingly, significant progress has been made in the field of object and scene recognition lately. A key factor in this success is the introduction and availability of benchmark datasets for evaluation. Surprisingly, such a benchmark setup is still missing in the area of computational painting categorization. In this work, we propose a novel large scale dataset of digital paintings. The dataset consists of paintings from 91 different painters. We further show three applications of our dataset namely: artist categorization, style classification and saliency detection. We investigate how local and global features popular in image classification perform for the tasks of artist and style categorization. For both categorization tasks, our experimental results suggest that combining multiple features significantly improves the final performance. We show that state-of-the-art computer vision methods can correctly classify 50 % of unseen paintings to its painter in a large dataset and correctly attribute its artistic style in over 60 % of the cases. Additionally, we explore the task of saliency detection on paintings and show experimental findings using state-of-the-art saliency estimation algorithms.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0932-8092	ISBN		Medium
Area		Expedition		Conference
Notes	CIC; LAMP; 600.074; 600.079			Approved	no
Call Number	Admin @ si @ KBW2014			Serial	2510
Permanent link to this record



Author	Alicia Fornes; Josep Llados; Joan Mas; Joana Maria Pujadas-Mora; Anna Cabre
Title	A Bimodal Crowdsourcing Platform for Demographic Historical Manuscripts			Type	Conference Article
Year	2014	Publication	Digital Access to Textual Cultural Heritage Conference	Abbreviated Journal
Volume		Issue		Pages	103-108
Keywords
Abstract	In this paper we present a crowdsourcing web-based application for extracting information from demographic handwritten document images. The proposed application integrates two points of view: the semantic information for demographic research, and the ground-truthing for document analysis research. Concretely, the application has the contents view, where the information is recorded into forms, and the labeling view, with the word labels for evaluating document analysis techniques. The crowdsourcing architecture allows to accelerate the information extraction (many users can work simultaneously), validate the information, and easily provide feedback to the users. We finally show how the proposed application can be extended to other kind of demographic historical manuscripts.
Address	Madrid; May 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4503-2588-2	Medium
Area		Expedition		Conference	DATeCH
Notes	DAG; 600.061; 602.006; 600.077			Approved	no
Call Number	Admin @ si @ FLM2014			Serial	2516
Permanent link to this record



Author	P. Wang; V. Eglin; C. Garcia; C. Largeron; Josep Llados; Alicia Fornes
Title	A Novel Learning-free Word Spotting Approach Based on Graph Representation			Type	Conference Article
Year	2014	Publication	11th IAPR International Workshop on Document Analysis and Systems	Abbreviated Journal
Volume		Issue		Pages	207-211
Keywords
Abstract	Effective information retrieval on handwritten document images has always been a challenging task. In this paper, we propose a novel handwritten word spotting approach based on graph representation. The presented model comprises both topological and morphological signatures of handwriting. Skeleton-based graphs with the Shape Context labelled vertexes are established for connected components. Each word image is represented as a sequence of graphs. In order to be robust to the handwriting variations, an exhaustive merging process based on DTW alignment result is introduced in the similarity measure between word images. With respect to the computation complexity, an approximate graph edit distance approach using bipartite matching is employed for graph matching. The experiments on the George Washington dataset and the marriage records from the Barcelona Cathedral dataset demonstrate that the proposed approach outperforms the state-of-the-art structural methods.
Address	Tours; France; April 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4799-3243-6	Medium
Area		Expedition		Conference	DAS
Notes	DAG; 600.061; 602.006; 600.077			Approved	no
Call Number	Admin @ si @ WEG2014b			Serial	2517
Permanent link to this record



Author	Claudio Baecchi; Francesco Turchini; Lorenzo Seidenari; Andrew Bagdanov; Alberto del Bimbo
Title	Fisher vectors over random density forest for object recognition			Type	Conference Article
Year	2014	Publication	22nd International Conference on Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	4328-4333
Keywords
Abstract
Address	Stockholm; Sweden; August 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICPR
Notes	LAMP; 600.079			Approved	no
Call Number	Admin @ si @ BTS2014			Serial	2518
Permanent link to this record



Author	Federico Bartoli; Giuseppe Lisanti; Svebor Karaman; Andrew Bagdanov; Alberto del Bimbo
Title	Unsupervised scene adaptation for faster multi- scale pedestrian detection			Type	Conference Article
Year	2014	Publication	22nd International Conference on Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	3534 - 3539
Keywords
Abstract
Address	Stockholm; Sweden; August 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICPR
Notes	LAMP; 600.079			Approved	no
Call Number	Admin @ si @ BLK2014			Serial	2519
Permanent link to this record



Author	Antonio Hernandez; Stan Sclaroff; Sergio Escalera
Title	Contextual rescoring for Human Pose Estimation			Type	Conference Article
Year	2014	Publication	25th British Machine Vision Conference	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	A contextual rescoring method is proposed for improving the detection of body joints of a pictorial structure model for human pose estimation. A set of mid-level parts is incorporated in the model, and their detections are used to extract spatial and score-related features relative to other body joint hypotheses. A technique is proposed for the automatic discovery of a compact subset of poselets that covers a set of validation images while maximizing precision. A rescoring mechanism is defined as a set-based boosting classifier that computes a new score for body joint detections, given its relationship to detections of other body joints and mid-level parts in the image. This new score complements the unary potential of a discriminatively trained pictorial structure model. Experiments on two benchmarks show performance improvements when considering the proposed mid-level image representation and rescoring approach in comparison with other pictorial structure-based approaches.
Address	Nottingham; UK; September 2013
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	BMVC
Notes	HuPBA;MILAB			Approved	no
Call Number	HSE2014			Serial	2525
Permanent link to this record



Author	Xim Cerda-Company; C. Alejandro Parraga; Xavier Otazu
Title	Which tone-mapping is the best? A comparative study of tone-mapping perceived quality			Type	Abstract
Year	2014	Publication	Perception	Abbreviated Journal
Volume	43	Issue		Pages	106
Keywords
Abstract	Perception 43 ECVP Abstract Supplement High-dynamic-range (HDR) imaging refers to the methods designed to increase the brightness dynamic range present in standard digital imaging techniques. This increase is achieved by taking the same picture under dierent exposure values and mapping the intensity levels into a single image by way of a tone-mapping operator (TMO). Currently, there is no agreement on how to evaluate the quality of dierent TMOs. In this work we psychophysically evaluate 15 dierent TMOs obtaining rankings based on the perceived properties of the resulting tone-mapped images. We performed two dierent experiments on a CRT calibrated display using 10 subjects: (1) a study of the internal relationships between grey-levels and (2) a pairwise comparison of the resulting 15 tone-mapped images. In (1) observers internally matched the grey-levels to a reference inside the tone-mapped images and in the real scene. In (2) observers performed a pairwise comparison of the tone-mapped images alongside the real scene. We obtained two rankings of the TMOs according their performance. In (1) the best algorithm was ICAM by J.Kuang et al (2007) and in (2) the best algorithm was a TMO by Krawczyk et al (2005). Our results also show no correlation between these two rankings.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECVP
Notes	NEUROBIT; 600.074			Approved	no
Call Number	Admin @ si @ CPO2014			Serial	2527
Permanent link to this record



Author	Noha Elfiky; Theo Gevers; Arjan Gijsenij; Jordi Gonzalez
Title	Color Constancy using 3D Scene Geometry derived from a Single Image			Type	Journal Article
Year	2014	Publication	IEEE Transactions on Image Processing	Abbreviated Journal	TIP
Volume	23	Issue	9	Pages	3855-3868
Keywords
Abstract	The aim of color constancy is to remove the effect of the color of the light source. As color constancy is inherently an ill-posed problem, most of the existing color constancy algorithms are based on specific imaging assumptions (e.g. grey-world and white patch assumption). In this paper, 3D geometry models are used to determine which color constancy method to use for the different geometrical regions (depth/layer) found in images. The aim is to classify images into stages (rough 3D geometry models). According to stage models; images are divided into stage regions using hard and soft segmentation. After that, the best color constancy methods is selected for each geometry depth. To this end, we propose a method to combine color constancy algorithms by investigating the relation between depth, local image statistics and color constancy. Image statistics are then exploited per depth to select the proper color constancy method. Our approach opens the possibility to estimate multiple illuminations by distinguishing nearby light source from distant illuminations. Experiments on state-of-the-art data sets show that the proposed algorithm outperforms state-of-the-art single color constancy algorithms with an improvement of almost 50% of median angular error. When using a perfect classifier (i.e, all of the test images are correctly classified into stages); the performance of the proposed method achieves an improvement of 52% of the median angular error compared to the best-performing single color constancy algorithm.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1057-7149	ISBN		Medium
Area		Expedition		Conference
Notes	ISE; 600.078			Approved	no
Call Number	Admin @ si @ EGG2014			Serial	2528
Permanent link to this record



Author	Francisco Cruz; Oriol Ramos Terrades
Title	EM-Based Layout Analysis Method for Structured Documents			Type	Conference Article
Year	2014	Publication	22nd International Conference on Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	315-320
Keywords
Abstract	In this paper we present a method to perform layout analysis in structured documents. We proposed an EM-based algorithm to fit a set of Gaussian mixtures to the different regions according to the logical distribution along the page. After the convergence, we estimate the final shape of the regions according to the parameters computed for each component of the mixture. We evaluated our method in the task of record detection in a collection of historical structured documents and performed a comparison with other previous works in this task.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1051-4651	ISBN		Medium
Area		Expedition		Conference	ICPR
Notes	DAG; 602.006; 600.061; 600.077			Approved	no
Call Number	Admin @ si @ CrR2014			Serial	2530
Permanent link to this record



Author	Mohammad Rouhani; E. Boyer; Angel Sappa
Title	Non-Rigid Registration meets Surface Reconstruction			Type	Conference Article
Year	2014	Publication	International Conference on 3D Vision	Abbreviated Journal
Volume		Issue		Pages	617-624
Keywords
Abstract	Non rigid registration is an important task in computer vision with many applications in shape and motion modeling. A fundamental step of the registration is the data association between the source and the target sets. Such association proves difficult in practice, due to the discrete nature of the information and its corruption by various types of noise, e.g. outliers and missing data. In this paper we investigate the benefit of the implicit representations for the non-rigid registration of 3D point clouds. First, the target points are described with small quadratic patches that are blended through partition of unity weighting. Then, the discrete association between the source and the target can be replaced by a continuous distance field induced by the interface. By combining this distance field with a proper deformation term, the registration energy can be expressed in a linear least square form that is easy and fast to solve. This significantly eases the registration by avoiding direct association between points. Moreover, a hierarchical approach can be easily implemented by employing coarse-to-fine representations. Experimental results are provided for point clouds from multi-view data sets. The qualitative and quantitative comparisons show the outperformance and robustness of our framework. %in presence of noise and outliers.
Address	Tokyo; Japan; December 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	3DV
Notes	ADAS; 600.055; 600.076			Approved	no
Call Number	Admin @ si @ RBS2014			Serial	2534
Permanent link to this record



Author	Lluis Gomez; Dimosthenis Karatzas
Title	Scene Text Recognition: No Country for Old Men?			Type	Conference Article
Year	2014	Publication	1st International Workshop on Robust Reading	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	IWRR
Notes	DAG; 600.077			Approved	no
Call Number	Admin @ si @ GoK2014c			Serial	2538
Permanent link to this record



Author	Xavier Perez Sala; Fernando De la Torre; Laura Igual; Sergio Escalera; Cecilio Angulo
Title	Subspace Procrustes Analysis			Type	Conference Article
Year	2014	Publication	ECCV Workshop on ChaLearn Looking at People	Abbreviated Journal
Volume	8925	Issue		Pages	654-668
Keywords
Abstract	Procrustes Analysis (PA) has been a popular technique to align and build 2-D statistical models of shapes. Given a set of 2-D shapes PA is applied to remove rigid transformations. Then, a non-rigid 2-D model is computed by modeling (e.g., PCA) the residual. Although PA has been widely used, it has several limitations for modeling 2-D shapes: occluded landmarks and missing data can result in local minima solutions, and there is no guarantee that the 2-D shapes provide a uniform sampling of the 3-D space of rotations for the object. To address previous issues, this paper proposes Subspace PA (SPA). Given several instances of a 3-D object, SPA computes the mean and a 2-D subspace that can simultaneously model all rigid and non-rigid deformations of the 3-D object. We propose a discrete (DSPA) and continuous (CSPA) formulation for SPA, assuming that 3-D samples of an object are provided. DSPA extends the traditional PA, and produces unbiased 2-D models by uniformly sampling dierent views of the 3-D object. CSPA provides a continuous approach to uniformly sample the space of 3-D rotations, being more ecient in space and time. Experiments using SPA to learn 2-D models of bodies from motion capture data illustrate the benets of our approach.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCVW
Notes	OR; HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ PTI2014			Serial	2539
Permanent link to this record



Author	E. Bondi ; L. Sidenari; Andrew Bagdanov; Alberto del Bimbo
Title	Real-time people counting from depth imagery of crowded environments			Type	Conference Article
Year	2014	Publication	11th IEEE International Conference on Advanced Video and Signal based Surveillance	Abbreviated Journal
Volume		Issue		Pages	337 - 342
Keywords
Abstract	In this paper we describe a system for automatic people counting in crowded environments. The approach we propose is a counting-by-detection method based on depth imagery. It is designed to be deployed as an autonomous appliance for crowd analysis in video surveillance application scenarios. Our system performs foreground/background segmentation on depth image streams in order to coarsely segment persons, then depth information is used to localize head candidates which are then tracked in time on an automatically estimated ground plane. The system runs in real-time, at a frame-rate of about 20 fps. We collected a dataset of RGB-D sequences representing three typical and challenging surveillance scenarios, including crowds, queuing and groups. An extensive comparative evaluation is given between our system and more complex, Latent SVM-based head localization for person counting applications.
Address	Seoul; Korea; August 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	AVSS
Notes	LAMP; 600.079			Approved	no
Call Number	Admin @ si @ BSB2014			Serial	2540
Permanent link to this record



Author	Thanh Ha Do; Salvatore Tabbone; Oriol Ramos Terrades
Title	Spotting Symbol Using Sparsity over Learned Dictionary of Local Descriptors			Type	Conference Article
Year	2014	Publication	11th IAPR International Workshop on Document Analysis and Systems	Abbreviated Journal
Volume		Issue		Pages	156-160
Keywords
Abstract	This paper proposes a new approach to spot symbols into graphical documents using sparse representations. More specifically, a dictionary is learned from a training database of local descriptors defined over the documents. Following their sparse representations, interest points sharing similar properties are used to define interest regions. Using an original adaptation of information retrieval techniques, a vector model for interest regions and for a query symbol is built based on its sparsity in a visual vocabulary where the visual words are columns in the learned dictionary. The matching process is performed comparing the similarity between vector models. Evaluation on SESYD datasets demonstrates that our method is promising.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4799-3243-6	Medium
Area		Expedition		Conference	DAS
Notes	DAG; 600.077			Approved	no
Call Number	Admin @ si @ DTR2014			Serial	2543
Permanent link to this record



Author	Marçal Rusiñol; J. Chazalon; Jean-Marc Ogier
Title	Combining Focus Measure Operators to Predict OCR Accuracy in Mobile-Captured Document Images			Type	Conference Article
Year	2014	Publication	11th IAPR International Workshop on Document Analysis and Systems	Abbreviated Journal
Volume		Issue		Pages	181 - 185
Keywords
Abstract	Mobile document image acquisition is a new trend raising serious issues in business document processing workflows. Such digitization procedure is unreliable, and integrates many distortions which must be detected as soon as possible, on the mobile, to avoid paying data transmission fees, and losing information due to the inability to re-capture later a document with temporary availability. In this context, out-of-focus blur is major issue: users have no direct control over it, and it seriously degrades OCR recognition. In this paper, we concentrate on the estimation of focus quality, to ensure a sufficient legibility of a document image for OCR processing. We propose two contributions to improve OCR accuracy prediction for mobile-captured document images. First, we present 24 focus measures, never tested on document images, which are fast to compute and require no training. Second, we show that a combination of those measures enables state-of-the art performance regarding the correlation with OCR accuracy. The resulting approach is fast, robust, and easy to implement in a mobile device. Experiments are performed on a public dataset, and precise details about image processing are given.
Address	Tours; France; April 2014
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4799-3243-6	Medium
Area		Expedition		Conference	DAS
Notes	DAG; 601.223; 600.077			Approved	no
Call Number	Admin @ si @ RCO2014a			Serial	2545
Permanent link to this record