Publicacions CVC -- Query Results

[191–200] << 201 202 203 204 205 206 207 208 209 210 >> [211–220]

Details

Records
Author	Axel Barroso-Laguna; Edgar Riba; Daniel Ponsa; Krystian Mikolajczyk
Title	Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters			Type	Conference Article
Year	2019	Publication	18th IEEE International Conference on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	5835-5843
Keywords
Abstract	We introduce a novel approach for keypoint detection task that combines handcrafted and learned CNN filters within a shallow multi-scale architecture. Handcrafted filters provide anchor structures for learned filters, which localize, score and rank repeatable features. Scale-space representation is used within the network to extract keypoints at different levels. We design a loss function to detect robust features that exist across a range of scales and to maximize the repeatability score. Our Key.Net model is trained on data synthetically created from ImageNet and evaluated on HPatches benchmark. Results show that our approach outperforms state-of-the-art detectors in terms of repeatability, matching performance and complexity.
Address	Seul; Corea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCV
Notes	MSIAU; 600.122			Approved	no
Call Number	Admin @ si @ BRP2019			Serial	3290
Permanent link to this record



Author	Javad Zolfaghari Bengar; Abel Gonzalez-Garcia; Gabriel Villalonga; Bogdan Raducanu; Hamed H. Aghdam; Mikhail Mozerov; Antonio Lopez; Joost Van de Weijer
Title	Temporal Coherence for Active Learning in Videos			Type	Conference Article
Year	2019	Publication	IEEE International Conference on Computer Vision Workshops	Abbreviated Journal
Volume		Issue		Pages	914-923
Keywords
Abstract	Autonomous driving systems require huge amounts of data to train. Manual annotation of this data is time-consuming and prohibitively expensive since it involves human resources. Therefore, active learning emerged as an alternative to ease this effort and to make data annotation more manageable. In this paper, we introduce a novel active learning approach for object detection in videos by exploiting temporal coherence. Our active learning criterion is based on the estimated number of errors in terms of false positives and false negatives. The detections obtained by the object detector are used to define the nodes of a graph and tracked forward and backward to temporally link the nodes. Minimizing an energy function defined on this graphical model provides estimates of both false positives and false negatives. Additionally, we introduce a synthetic video dataset, called SYNTHIA-AL, specially designed to evaluate active learning for video object detection in road scenes. Finally, we show that our approach outperforms active learning baselines tested on two datasets.
Address	Seul; Corea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCVW
Notes	LAMP; ADAS; 600.124; 602.200; 600.118; 600.120; 600.141			Approved	no
Call Number	Admin @ si @ ZGV2019			Serial	3294
Permanent link to this record



Author	David Berga; Xose R. Fernandez-Vidal; Xavier Otazu; Xose M. Pardo
Title	SID4VAM: A Benchmark Dataset with Synthetic Images for Visual Attention Modeling			Type	Conference Article
Year	2019	Publication	18th IEEE International Conference on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	8788-8797
Keywords
Abstract	A benchmark of saliency models performance with a synthetic image dataset is provided. Model performance is evaluated through saliency metrics as well as the influence of model inspiration and consistency with human psychophysics. SID4VAM is composed of 230 synthetic images, with known salient regions. Images were generated with 15 distinct types of low-level features (e.g. orientation, brightness, color, size...) with a target-distractor popout type of synthetic patterns. We have used Free-Viewing and Visual Search task instructions and 7 feature contrasts for each feature category. Our study reveals that state-ofthe-art Deep Learning saliency models do not perform well with synthetic pattern images, instead, models with Spectral/Fourier inspiration outperform others in saliency metrics and are more consistent with human psychophysical experimentation. This study proposes a new way to evaluate saliency models in the forthcoming literature, accounting for synthetic images with uniquely low-level feature contexts, distinct from previous eye tracking image datasets.
Address	Seul; Corea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCV
Notes	NEUROBIT; 600.128			Approved	no
Call Number	Admin @ si @ BFO2019b			Serial	3372
Permanent link to this record



Author	Hamed H. Aghdam; Abel Gonzalez-Garcia; Joost Van de Weijer; Antonio Lopez
Title	Active Learning for Deep Detection Neural Networks			Type	Conference Article
Year	2019	Publication	18th IEEE International Conference on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	3672-3680
Keywords
Abstract	The cost of drawing object bounding boxes (ie labeling) for millions of images is prohibitively high. For instance, labeling pedestrians in a regular urban image could take 35 seconds on average. Active learning aims to reduce the cost of labeling by selecting only those images that are informative to improve the detection network accuracy. In this paper, we propose a method to perform active learning of object detectors based on convolutional neural networks. We propose a new image-level scoring process to rank unlabeled images for their automatic selection, which clearly outperforms classical scores. The proposed method can be applied to videos and sets of still images. In the former case, temporal selection rules can complement our scoring process. As a relevant use case, we extensively study the performance of our method on the task of pedestrian detection. Overall, the experiments show that the proposed method performs better than random selection.
Address	Seul; Korea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCV
Notes	ADAS; LAMP; 600.124; 600.109; 600.141; 600.120; 600.118			Approved	no
Call Number	Admin @ si @ AGW2019			Serial	3321
Permanent link to this record



Author	Felipe Codevilla; Eder Santana; Antonio Lopez; Adrien Gaidon
Title	Exploring the Limitations of Behavior Cloning for Autonomous Driving			Type	Conference Article
Year	2019	Publication	18th IEEE International Conference on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	9328-9337
Keywords
Abstract	Driving requires reacting to a wide variety of complex environment conditions and agent behaviors. Explicitly modeling each possible scenario is unrealistic. In contrast, imitation learning can, in theory, leverage data from large fleets of human-driven cars. Behavior cloning in particular has been successfully used to learn simple visuomotor policies end-to-end, but scaling to the full spectrum of driving behaviors remains an unsolved problem. In this paper, we propose a new benchmark to experimentally investigate the scalability and limitations of behavior cloning. We show that behavior cloning leads to state-of-the-art results, executing complex lateral and longitudinal maneuvers, even in unseen environments, without being explicitly programmed to do so. However, we confirm some limitations of the behavior cloning approach: some well-known limitations (eg, dataset bias and overfitting), new generalization issues (eg, dynamic objects and the lack of a causal modeling), and training instabilities, all requiring further research before behavior cloning can graduate to real-world driving. The code, dataset, benchmark, and agent studied in this paper can be found at github.
Address	Seul; Korea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCV
Notes	ADAS; 600.124; 600.118			Approved	no
Call Number	Admin @ si @ CSL2019			Serial	3322
Permanent link to this record



Author	Reza Azad; Maryam Asadi Aghbolaghi; Mahmood Fathy; Sergio Escalera
Title	Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions			Type	Conference Article
Year	2019	Publication	Visual Recognition for Medical Images workshop	Abbreviated Journal
Volume		Issue		Pages	406-415
Keywords
Abstract	In recent years, deep learning-based networks have achieved state-of-the-art performance in medical image segmentation. Among the existing networks, U-Net has been successfully applied on medical image segmentation. In this paper, we propose an extension of U-Net, Bi-directional ConvLSTM U-Net with Densely connected convolutions (BCDU-Net), for medical image segmentation, in which we take full advantages of U-Net, bi-directional ConvLSTM (BConvLSTM) and the mechanism of dense convolutions. Instead of a simple concatenation in the skip connection of U-Net, we employ BConvLSTM to combine the feature maps extracted from the corresponding encoding path and the previous decoding up-convolutional layer in a non-linear way. To strengthen feature propagation and encourage feature reuse, we use densely connected convolutions in the last convolutional layer of the encoding path. Finally, we can accelerate the convergence speed of the proposed network by employing batch normalization (BN). The proposed model is evaluated on three datasets of: retinal blood vessel segmentation, skin lesion segmentation, and lung nodule segmentation, achieving state-of-the-art performance.
Address	Seul; Korea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCVW
Notes	HUPBA; no proj			Approved	no
Call Number	Admin @ si @ AAF2019			Serial	3324
Permanent link to this record



Author	Mohammed Al Rawi; Ernest Valveny
Title	Compact and Efficient Multitask Learning in Vision, Language and Speech			Type	Conference Article
Year	2019	Publication	IEEE International Conference on Computer Vision Workshops	Abbreviated Journal
Volume		Issue		Pages	2933-2942
Keywords
Abstract	Across-domain multitask learning is a challenging area of computer vision and machine learning due to the intra-similarities among class distributions. Addressing this problem to cope with the human cognition system by considering inter and intra-class categorization and recognition complicates the problem even further. We propose in this work an effective holistic and hierarchical learning by using a text embedding layer on top of a deep learning model. We also propose a novel sensory discriminator approach to resolve the collisions between different tasks and domains. We then train the model concurrently on textual sentiment analysis, speech recognition, image classification, action recognition from video, and handwriting word spotting of two different scripts (Arabic and English). The model we propose successfully learned different tasks across multiple domains.
Address	Seul; Korea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCVW
Notes	DAG; 600.121; 600.129			Approved	no
Call Number	Admin @ si @ RaV2019			Serial	3365
Permanent link to this record



Author	Alejandro Cartas; Jordi Luque; Petia Radeva; Carlos Segura; Mariella Dimiccoli
Title	Seeing and Hearing Egocentric Actions: How Much Can We Learn?			Type	Conference Article
Year	2019	Publication	IEEE International Conference on Computer Vision Workshops	Abbreviated Journal
Volume		Issue		Pages	4470-4480
Keywords
Abstract	Our interaction with the world is an inherently multimodal experience. However, the understanding of human-to-object interactions has historically been addressed focusing on a single modality. In particular, a limited number of works have considered to integrate the visual and audio modalities for this purpose. In this work, we propose a multimodal approach for egocentric action recognition in a kitchen environment that relies on audio and visual information. Our model combines a sparse temporal sampling strategy with a late fusion of audio, spatial, and temporal streams. Experimental results on the EPIC-Kitchens dataset show that multimodal integration leads to better performance than unimodal approaches. In particular, we achieved a 5.18% improvement over the state of the art on verb classification.
Address	Seul; Korea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCVW
Notes	MILAB; no proj			Approved	no
Call Number	Admin @ si @ CLR2019b			Serial	3385
Permanent link to this record



Author	Aura Hernandez-Sabate; Debora Gil; David Roche; Monica M. S. Matsumoto; Sergio S. Furuie
Title	Inferring the Performance of Medical Imaging Algorithms			Type	Conference Article
Year	2011	Publication	14th International Conference on Computer Analysis of Images and Patterns	Abbreviated Journal
Volume	6854	Issue		Pages	520-528
Keywords	Validation, Statistical Inference, Medical Imaging Algorithms.
Abstract	Evaluation of the performance and limitations of medical imaging algorithms is essential to estimate their impact in social, economic or clinical aspects. However, validation of medical imaging techniques is a challenging task due to the variety of imaging and clinical problems involved, as well as, the difficulties for systematically extracting a reliable solely ground truth. Although specific validation protocols are reported in any medical imaging paper, there are still two major concerns: definition of standardized methodologies transversal to all problems and generalization of conclusions to the whole clinical data set. We claim that both issues would be fully solved if we had a statistical model relating ground truth and the output of computational imaging techniques. Such a statistical model could conclude to what extent the algorithm behaves like the ground truth from the analysis of a sampling of the validation data set. We present a statistical inference framework reporting the agreement and describing the relationship of two quantities. We show its transversality by applying it to validation of two different tasks: contour segmentation and landmark correspondence.
Address	Sevilla
Corporate Author				Thesis
Publisher	Springer-Verlag Berlin Heidelberg	Place of Publication	Berlin	Editor	Pedro Real; Daniel Diaz-Pernil; Helena Molina-Abril; Ainhoa Berciano; Walter Kropatsch
Language		Summary Language		Original Title
Series Editor		Series Title	L	Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CAIP
Notes	IAM; ADAS			Approved	no
Call Number	IAM @ iam @ HGR2011			Serial	1676
Permanent link to this record



Author	O. Rodriguez; J. Mauri; E Fernandez-Nofrerias; A. Tovar; R. Villuendas; V. Valle; Oriol Pujol; Petia Radeva
Title	Analisis de texturas mediante la modificacion de un modelo binario local para la segmentacion automatica de secuencias de ecografia intracoronaria			Type	Journal
Year	2003	Publication	Revista Española de Cardiologia (IF: 0.959), 56(2), Congreso de las Enfermedades Cardiovasculares	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Sevilla (Spain)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ RMF2003f			Serial	413
Permanent link to this record



Author	O. Rodriguez; J. Mauri; E Fernandez-Nofrerias; J. Lopez; A. Tovar; V. Valle; David Rotger; Petia Radeva
Title	Cuantificacion tridimensional de la longitud de segmentos coronarios a partir de secuencias de ecografia intracoronaria			Type	Journal
Year	2003	Publication	Revista Española de Cardiologia (IF: 0.959), 56(2), Congreso de las Enfermedades Cardiovasculares	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Sevilla (Spain)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ RMF2003g			Serial	414
Permanent link to this record



Author	O. Rodriguez; J. Mauri; E Fernandez-Nofrerias; J. Lopez; A. Tovar; R. Villuendas; V. Valle; Misael Rosales; Petia Radeva
Title	Modelo fisico para la simulacion de imagenes de ecografia intracoronaria			Type	Journal
Year	2003	Publication	Revista Española de Cardiologia (IF: 0.959), 56(2), Congreso de las Enfermedades Cardiovasculares	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Sevilla (Spain)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ RMF2003h			Serial	415
Permanent link to this record



Author	M. Bressan; Jordi Vitria
Title	Improving Naive Bayes using Class Condicitonal ICA.			Type	Miscellaneous
Year	2002	Publication	Iberoamerican Conference on Artificial Intelligence IBERAMIA 2002.	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Sevilla, Espanya
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ BrV2002e			Serial	305
Permanent link to this record



Author	Oriol Pujol; Petia Radeva; J. Mauri; E Fernandez-Nofrerias
Title	Automatic segmentation of lumen in Intravascular Ultrasound Images: An evaluation of texture feature extractors.			Type	Miscellaneous
Year	2002	Publication	Iberoamerican Conference on Artificial Intelligence IBERAMIA 2002, Springer Verlag	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Sevilla, Espanya
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ PRM2002			Serial	314
Permanent link to this record



Author	Fernando Vilariño; Petia Radeva
Title	Patch-Optimized Discriminant Active Contours for Medical Image Segmentation.			Type	Conference Article
Year	2002	Publication	Iberoamerican Conference on Artificial Intelligence	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Sevilla, Espanya
Corporate Author				Thesis
Publisher	Springer Verlag	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	IBERAMIA
Notes	MV;MILAB;SIAI			Approved	no
Call Number	BCNPCL @ bcnpcl @ ViR2002; IAM @ iam @ VRa2003			Serial	320
Permanent link to this record