Publicacions CVC -- Query Results

[51–60] << 61 62 63 64 65 66 67 68 69 70 >> [71–80]

Details

Records
Author	Lu Yu; Lichao Zhang; Joost Van de Weijer; Fahad Shahbaz Khan; Yongmei Cheng; C. Alejandro Parraga
Title	Beyond Eleven Color Names for Image Understanding			Type	Journal Article
Year	2018	Publication	Machine Vision and Applications	Abbreviated Journal	MVAP
Volume	29	Issue	2	Pages	361-373
Keywords	Color name; Discriminative descriptors; Image classification; Re-identification; Tracking
Abstract	Color description is one of the fundamental problems of image understanding. One of the popular ways to represent colors is by means of color names. Most existing work on color names focuses on only the eleven basic color terms of the English language. This could be limiting the discriminative power of these representations, and representations based on more color names are expected to perform better. However, there exists no clear strategy to choose additional color names. We collect a dataset of 28 additional color names. To ensure that the resulting color representation has high discriminative power we propose a method to order the additional color names according to their complementary nature with the basic color names. This allows us to compute color name representations with high discriminative power of arbitrary length. In the experiments we show that these new color name descriptors outperform the existing color name descriptor on the task of visual tracking, person re-identification and image classification.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	LAMP; NEUROBIT; 600.068; 600.109; 600.120			Approved	no
Call Number	Admin @ si @ YYW2018			Serial	3087
Permanent link to this record



Author	Sounak Dey; Anguelos Nicolaou; Josep Llados; Umapada Pal
Title	Evaluation of the Effect of Improper Segmentation on Word Spotting			Type	Journal Article
Year	2019	Publication	International Journal on Document Analysis and Recognition	Abbreviated Journal	IJDAR
Volume	22	Issue		Pages	361-374
Keywords
Abstract	Word spotting is an important recognition task in large-scale retrieval of document collections. In most of the cases, methods are developed and evaluated assuming perfect word segmentation. In this paper, we propose an experimental framework to quantify the goodness that word segmentation has on the performance achieved by word spotting methods in identical unbiased conditions. The framework consists of generating systematic distortions on segmentation and retrieving the original queries from the distorted dataset. We have tested our framework on several established and state-of-the-art methods using George Washington and Barcelona Marriage Datasets. The experiments done allow for an estimate of the end-to-end performance of word spotting methods.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG; 600.097; 600.084; 600.121; 600.140; 600.129			Approved	no
Call Number	Admin @ si @ DNL2019			Serial	3455
Permanent link to this record



Author	Eduardo Aguilar; Marc Bolaños; Petia Radeva
Title	Regularized uncertainty-based multi-task learning model for food analysis			Type	Journal Article
Year	2019	Publication	Journal of Visual Communication and Image Representation	Abbreviated Journal	JVCIR
Volume	60	Issue		Pages	360-370
Keywords	Multi-task models; Uncertainty modeling; Convolutional neural networks; Food image analysis; Food recognition; Food group recognition; Ingredients recognition; Cuisine recognition
Abstract	Food plays an important role in several aspects of our daily life. Several computer vision approaches have been proposed for tackling food analysis problems, but very little effort has been done in developing methodologies that could take profit of the existent correlation between tasks. In this paper, we propose a new multi-task model that is able to simultaneously predict different food-related tasks, e.g. dish, cuisine and food categories. Here, we extend the homoscedastic uncertainty modeling to allow single-label and multi-label classification and propose a regularization term, which jointly weighs the tasks as well as their correlations. Furthermore, we propose a new Multi-Attribute Food dataset and a new metric, Multi-Task Accuracy. We prove that using both our uncertainty-based loss and the class regularization term, we are able to improve the coherence of outputs between different tasks. Moreover, we outperform the use of task-specific models on classical measures like accuracy or .
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB; no proj			Approved	no
Call Number	Admin @ si @ ABR2019			Serial	3298
Permanent link to this record



Author	Sergi Garcia Bordils; Andres Mafla; Ali Furkan Biten; Oren Nuriel; Aviad Aberdam; Shai Mazor; Ron Litman; Dimosthenis Karatzas
Title	Out-of-Vocabulary Challenge Report			Type	Conference Article
Year	2022	Publication	Proceedings European Conference on Computer Vision Workshops	Abbreviated Journal
Volume	13804	Issue		Pages	359–375
Keywords
Abstract	This paper presents final results of the Out-Of-Vocabulary 2022 (OOV) challenge. The OOV contest introduces an important aspect that is not commonly studied by Optical Character Recognition (OCR) models, namely, the recognition of unseen scene text instances at training time. The competition compiles a collection of public scene text datasets comprising of 326,385 images with 4,864,405 scene text instances, thus covering a wide range of data distributions. A new and independent validation and test set is formed with scene text instances that are out of vocabulary at training time. The competition was structured in two tasks, end-to-end and cropped scene text recognition respectively. A thorough analysis of results from baselines and different participants is presented. Interestingly, current state-of-the-art models show a significant performance gap under the newly studied setting. We conclude that the OOV dataset proposed in this challenge will be an essential area to be explored in order to develop scene text models that achieve more robust and generalized predictions.
Address	Tel-Aviv; Israel; October 2022
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCVW
Notes	DAG; 600.155; 302.105; 611.002			Approved	no
Call Number	Admin @ si @ GMB2022			Serial	3771
Permanent link to this record



Author	Debora Gil; Petia Radeva
Title	Curvature Vector Flow to Assure Convergent Deformable Models for Shape Modelling			Type	Book Chapter
Year	2003	Publication	Energy Minimization Methods In Computer Vision And Pattern Recognition	Abbreviated Journal	LNCS
Volume	2683	Issue		Pages	357-372
Keywords	Initial condition; Convex shape; Non convex analysis; Increase; Segmentation; Gradient; Standard; Standards; Concave shape; Flow models; Tracking; Edge detection; Curvature
Abstract	Poor convergence to concave shapes is a main limitation of snakes as a standard segmentation and shape modelling technique. The gradient of the external energy of the snake represents a force that pushes the snake into concave regions, as its internal energy increases when new inexion points are created. In spite of the improvement of the external energy by the gradient vector ow technique, highly non convex shapes can not be obtained, yet. In the present paper, we develop a new external energy based on the geometry of the curve to be modelled. By tracking back the deformation of a curve that evolves by minimum curvature ow, we construct a distance map that encapsulates the natural way of adapting to non convex shapes. The gradient of this map, which we call curvature vector ow (CVF), is capable of attracting a snake towards any contour, whatever its geometry. Our experiments show that, any initial snake condition converges to the curve to be modelled in optimal time.
Address
Corporate Author				Thesis
Publisher	Springer, Berlin	Place of Publication	Lisbon, PORTUGAL	Editor	Springer, B.
Language		Summary Language		Original Title
Series Editor		Series Title	Lecture Notes in Computer Science	Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	3-540-40498-8	Medium
Area		Expedition		Conference
Notes	IAM;MILAB			Approved	no
Call Number	IAM @ iam @ GIR2003b			Serial	1535
Permanent link to this record



Author	Pau Rodriguez; Josep M. Gonfaus; Guillem Cucurull; Xavier Roca; Jordi Gonzalez
Title	Attend and Rectify: A Gated Attention Mechanism for Fine-Grained Recovery			Type	Conference Article
Year	2018	Publication	15th European Conference on Computer Vision	Abbreviated Journal
Volume	11212	Issue		Pages	357-372
Keywords	Deep Learning; Convolutional Neural Networks; Attention
Abstract	We propose a novel attention mechanism to enhance Convolutional Neural Networks for fine-grained recognition. It learns to attend to lower-level feature activations without requiring part annotations and uses these activations to update and rectify the output likelihood distribution. In contrast to other approaches, the proposed mechanism is modular, architecture-independent and efficient both in terms of parameters and computation required. Experiments show that networks augmented with our approach systematically improve their classification accuracy and become more robust to clutter. As a result, Wide Residual Networks augmented with our proposal surpasses the state of the art classification accuracies in CIFAR-10, the Adience gender recognition task, Stanford dogs, and UEC Food-100.
Address	Munich; September 2018
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECCV
Notes	ISE; 600.098; 602.121; 600.119			Approved	no
Call Number	Admin @ si @ RGC2018			Serial	3139
Permanent link to this record



Author	Alejandro Gonzalez Alzate; Gabriel Villalonga; Jiaolong Xu; David Vazquez; Jaume Amores; Antonio Lopez
Title	Multiview Random Forest of Local Experts Combining RGB and LIDAR data for Pedestrian Detection			Type	Conference Article
Year	2015	Publication	IEEE Intelligent Vehicles Symposium IV2015	Abbreviated Journal
Volume		Issue		Pages	356-361
Keywords	Pedestrian Detection
Abstract	Despite recent significant advances, pedestrian detection continues to be an extremely challenging problem in real scenarios. In order to develop a detector that successfully operates under these conditions, it becomes critical to leverage upon multiple cues, multiple imaging modalities and a strong multi-view classifier that accounts for different pedestrian views and poses. In this paper we provide an extensive evaluation that gives insight into how each of these aspects (multi-cue, multimodality and strong multi-view classifier) affect performance both individually and when integrated together. In the multimodality component we explore the fusion of RGB and depth maps obtained by high-definition LIDAR, a type of modality that is only recently starting to receive attention. As our analysis reveals, although all the aforementioned aspects significantly help in improving the performance, the fusion of visible spectrum and depth information allows to boost the accuracy by a much larger margin. The resulting detector not only ranks among the top best performers in the challenging KITTI benchmark, but it is built upon very simple blocks that are easy to implement and computationally efficient. These simple blocks can be easily replaced with more sophisticated ones recently proposed, such as the use of convolutional neural networks for feature representation, to further improve the accuracy.
Address	Seoul; Corea; June 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area	ACDC	Expedition		Conference	IV
Notes	ADAS; 600.076; 600.057; 600.054			Approved	no
Call Number	ADAS @ adas @ GVX2015			Serial	2625
Permanent link to this record



Author	A. Sanfeliu; Juan J. Villanueva
Title	An approach of visual motion analysis			Type	Journal Article
Year	2005	Publication	Pattern Recognition Letters	Abbreviated Journal	PRL
Volume	26	Issue	3	Pages	355–368
Keywords
Abstract	IF: 1.138
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes				Approved	no
Call Number	ISE @ ise @ SaV2005			Serial	561
Permanent link to this record



Author	Marco Pedersoli; Jordi Gonzalez; Xu Hu; Xavier Roca
Title	Toward Real-Time Pedestrian Detection Based on a Deformable Template Model			Type	Journal Article
Year	2014	Publication	IEEE Transactions on Intelligent Transportation Systems	Abbreviated Journal	TITS
Volume	15	Issue	1	Pages	355-364
Keywords
Abstract	Most advanced driving assistance systems already include pedestrian detection systems. Unfortunately, there is still a tradeoff between precision and real time. For a reliable detection, excellent precision-recall such a tradeoff is needed to detect as many pedestrians as possible while, at the same time, avoiding too many false alarms; in addition, a very fast computation is needed for fast reactions to dangerous situations. Recently, novel approaches based on deformable templates have been proposed since these show a reasonable detection performance although they are computationally too expensive for real-time performance. In this paper, we present a system for pedestrian detection based on a hierarchical multiresolution part-based model. The proposed system is able to achieve state-of-the-art detection accuracy due to the local deformations of the parts while exhibiting a speedup of more than one order of magnitude due to a fast coarse-to-fine inference technique. Moreover, our system explicitly infers the level of resolution available so that the detection of small examples is feasible with a very reduced computational cost. We conclude this contribution by presenting how a graphics processing unit-optimized implementation of our proposed system is suitable for real-time pedestrian detection in terms of both accuracy and speed.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1524-9050	ISBN		Medium
Area		Expedition		Conference
Notes	ISE; 601.213; 600.078			Approved	no
Call Number	PGH2014			Serial	2350
Permanent link to this record



Author	Ernest Valveny; Philippe Dosch
Title	Performance Evaluation of Symbol Recognition			Type	Book Chapter
Year	2004	Publication	Document Analysis Systems	Abbreviated Journal	LNCS
Volume	3163	Issue		Pages	354–365
Keywords
Abstract
Address	Springer-Verlag
Corporate Author				Thesis
Publisher		Place of Publication		Editor	S. Marinai, A. Dengel (Eds.),
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	3-540-23060-2	Medium
Area		Expedition		Conference
Notes	DAG			Approved	no
Call Number	DAG @ dag @ VaD2004a			Serial	502
Permanent link to this record



Author	Fadi Dornaika; Angel Sappa
Title	Improving Appearance-Based 3D Face Tracking Using Sparse Stereo Data			Type	Conference Article
Year	2007	Publication	Advances in Computer Graphics and Computer Vision,	Abbreviated Journal
Volume		Issue		Pages	354–366
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher	Springer Verlag	Place of Publication		Editor	J. Braz, A. Ranchordas, H. Araujo and J. Jorge,
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	VISAPP
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ DoS2007d			Serial	1046
Permanent link to this record



Author	Susana Alvarez; Anna Salvatella; Maria Vanrell; Xavier Otazu
Title	3D Texton Spaces for color-texture retrieval			Type	Conference Article
Year	2010	Publication	7th International Conference on Image Analysis and Recognition	Abbreviated Journal
Volume	6111	Issue		Pages	354–363
Keywords
Abstract	Color and texture are visual cues of different nature, their integration in an useful visual descriptor is not an easy problem. One way to combine both features is to compute spatial texture descriptors independently on each color channel. Another way is to do the integration at the descriptor level. In this case the problem of normalizing both cues arises. In this paper we solve the latest problem by fusing color and texture through distances in texton spaces. Textons are the attributes of image blobs and they are responsible for texture discrimination as defined in Julesz’s Texton theory. We describe them in two low-dimensional and uniform spaces, namely, shape and color. The dissimilarity between color texture images is computed by combining the distances in these two spaces. Following this approach, we propose our TCD descriptor which outperforms current state of art methods in the two different approaches mentioned above, early combination with LBP and late combination with MPEG-7. This is done on an image retrieval experiment over a highly diverse texture dataset from Corel.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	A.C. Campilho and M.S. Kamel
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-13771-6	Medium
Area		Expedition		Conference	ICIAR
Notes	CIC			Approved	no
Call Number	CAT @ cat @ ASV2010a			Serial	1325
Permanent link to this record



Author	Gioacchino Vino; Angel Sappa
Title	Revisiting Harris Corner Detector Algorithm: a Gradual Thresholding Approach			Type	Conference Article
Year	2013	Publication	10th International Conference on Image Analysis and Recognition	Abbreviated Journal
Volume	7950	Issue		Pages	354-363
Keywords
Abstract	This paper presents an adaptive thresholding approach intended to increase the number of detected corners, while reducing the amount of those ones corresponding to noisy data. The proposed approach works by using the classical Harris corner detector algorithm and overcome the difficulty in finding a general threshold that work well for all the images in a given data set by proposing a novel adaptive thresholding scheme. Initially, two thresholds are used to discern between strong corners and flat regions. Then, a region based criteria is used to discriminate between weak corners and noisy points in the midway interval. Experimental results show that the proposed approach has a better capability to reject false corners and, at the same time, to detect weak ones. Comparisons with the state of the art are provided showing the validity of the proposed approach.
Address	Póvoa de Varzim; Portugal; June 2013
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-39093-7	Medium
Area		Expedition		Conference	ICIAR
Notes	ADAS; 600.055			Approved	no
Call Number	Admin @ si @ ViS2013			Serial	2562
Permanent link to this record



Author	Francisco Javier Orozco; Xavier Roca; Jordi Gonzalez
Title	Real-Time Gaze Tracking with Appearance-Based Models			Type	Journal Article
Year	2008	Publication	Machine Vision Applications	Abbreviated Journal	MVAP
Volume	20	Issue	6	Pages	353-364
Keywords	Keywords Eyelid and iris tracking, Appearance models, Blinking, Iris saccade, Real-time gaze tracking
Abstract	Psychological evidence has emphasized the importance of eye gaze analysis in human computer interaction and emotion interpretation. To this end, current image analysis algorithms take into consideration eye-lid and iris motion detection using colour information and edge detectors. However, eye movement is fast and and hence difficult to use to obtain a precise and robust tracking. Instead, our method proposed to describe eyelid and iris movements as continuous variables using appearance-based tracking. This approach combines the strengths of adaptive appearance models, optimization methods and backtracking techniques.Thus, in the proposed method textures are learned on-line from near frontal images and illumination changes, occlusions and fast movements are managed. The method achieves real-time performance by combining two appearance-based trackers to a backtracking algorithm for eyelid estimation and another for iris estimation. These contributions represent a significant advance towards a reliable gaze motion description for HCI and expression analysis, where the strength of complementary methodologies are combined to avoid using high quality images, colour information, texture training, camera settings and other time-consuming processes.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	ISE @ ise @ ORG2008			Serial	972
Permanent link to this record



Author	Patricia Suarez; Angel Sappa; Boris X. Vintimilla
Title	Vegetation Index Estimation from Monospectral Images			Type	Conference Article
Year	2018	Publication	15th International Conference on Images Analysis and Recognition	Abbreviated Journal
Volume	10882	Issue		Pages	353-362
Keywords
Abstract	This paper proposes a novel approach to estimate Normalized Difference Vegetation Index (NDVI) from just the red channel of a RGB image. The NDVI index is defined as the ratio of the difference of the red and infrared radiances over their sum. In other words, information from the red channel of a RGB image and the corresponding infrared spectral band are required for its computation. In the current work the NDVI index is estimated just from the red channel by training a Conditional Generative Adversarial Network (CGAN). The architecture proposed for the generative network consists of a single level structure, which combines at the final layer results from convolutional operations together with the given red channel with Gaussian noise to enhance details, resulting in a sharp NDVI image. Then, the discriminative model estimates the probability that the NDVI generated index came from the training dataset, rather than the index automatically generated. Experimental results with a large set of real images are provided showing that a Conditional GAN single level model represents an acceptable approach to estimate NDVI index.
Address	Povoa de Varzim; Portugal; June 2018
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICIAR
Notes	MSIAU; 600.086; 600.130; 600.122			Approved	no
Call Number	Admin @ si @ SSV2018c			Serial	3196
Permanent link to this record