Publicacions CVC -- Query Results

<< 1 2 3 4 5 6 7 8 9 10 >> [11–15]

Details

Records
Author	Raul Gomez; Lluis Gomez; Jaume Gibert; Dimosthenis Karatzas
Title	Self-Supervised Learning from Web Data for Multimodal Retrieval			Type	Book Chapter
Year	2019	Publication	Multi-Modal Scene Understanding Book	Abbreviated Journal
Volume		Issue		Pages	279-306
Keywords	self-supervised learning; webly supervised learning; text embeddings; multimodal retrieval; multimodal embedding
Abstract	Self-Supervised learning from multimodal image and text data allows deep neural networks to learn powerful features with no need of human annotated data. Web and Social Media platforms provide a virtually unlimited amount of this multimodal data. In this work we propose to exploit this free available data to learn a multimodal image and text embedding, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval. We demonstrate that the proposed pipeline can learn from images with associated text without supervision and analyze the semantic structure of the learnt joint image and text embeddingspace. Weperformathoroughanalysisandperformancecomparisonofﬁvedifferentstateof the art text embeddings in three different benchmarks. We show that the embeddings learnt with Web and Social Media data have competitive performances over supervised methods in the text basedimageretrievaltask,andweclearlyoutperformstateoftheartintheMIRFlickrdatasetwhen training in the target data. Further, we demonstrate how semantic multimodal image retrieval can be performed using the learnt embeddings, going beyond classical instance-level retrieval problems. Finally, we present a new dataset, InstaCities1M, composed by Instagram images and their associated texts that can be used for fair comparison of image-text embeddings.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG; 600.129; 601.338; 601.310			Approved	no
Call Number	Admin @ si @ GGG2019			Serial	3266
Permanent link to this record



Author	Ernest Valveny; Salvatore Tabbone; Oriol Ramos Terrades
Title	Performance Characterization of Shape Descriptors for Symbol Representation			Type	Book Chapter
Year	2008	Publication	Graphics Recognition: Recent Advances and New Opportunities	Abbreviated Journal
Volume	5046	Issue		Pages	278–287
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor	W. Liu, J. Llados, J.M. Ogier
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG			Approved	no
Call Number	DAG @ dag @ VTR2008			Serial	985
Permanent link to this record



Author	Sergio Vera; Miguel Angel Gonzalez Ballester; Debora Gil
Title	Optimal Medial Surface Generation for Anatomical Volume Representations			Type	Book Chapter
Year	2012	Publication	Abdominal Imaging. Computational and Clinical Applications	Abbreviated Journal	LNCS
Volume	7601	Issue		Pages	265-273
Keywords	Medial surface representation; volume reconstruction
Abstract	Medial representations are a widely used technique in abdominal organ shape representation and parametrization. Those methods require good medial manifolds as a starting point. Any medial surface used to parametrize a volume should be simple enough to allow an easy manipulation and complete enough to allow an accurate reconstruction of the volume. Obtaining good quality medial surfaces is still a problem with current iterative thinning methods. This forces the usage of generic, pre-calculated medial templates that are adapted to the final shape at the cost of a drop in volume reconstruction. This paper describes an operator for generation of medial structures that generates clean and complete manifolds well suited for their further use in medial representations of abdominal organ volumes. While being simpler than thinning surfaces, experiments show its high performance in volume reconstruction and preservation of medial surface main branching topology.
Address	Nice, France
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor	Yoshida, Hiroyuki and Hawkes, David and Vannier, MichaelW.
Language		Summary Language		Original Title
Series Editor		Series Title	Lecture Notes in Computer Science	Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-33611-9	Medium
Area		Expedition		Conference	STACOM
Notes	IAM			Approved	no
Call Number	IAM @ iam @ VGG2012b			Serial	1988
Permanent link to this record



Author	Mathieu Nicolas Delalandre; Jean-Yves Ramel; Ernest Valveny; Muhammad Muzzamil Luqman
Title	A Performance Characterization Algorithm for Symbol Localization			Type	Book Chapter
Year	2010	Publication	Graphics Recognition. Achievements, Challenges, and Evolution. 8th International Workshop, GREC 2009. Selected Papers	Abbreviated Journal
Volume	6020	Issue		Pages	260–271
Keywords
Abstract	In this paper we present an algorithm for performance characterization of symbol localization systems. This algorithm is aimed to be a more “reliable” and “open” solution to characterize the performance. To achieve that, it exploits only single points as the result of localization and offers the possibility to reconsider the localization results provided by a system. We use the information about context in groundtruth, and overall localization results, to detect the ambiguous localization results. A probability score is computed for each matching between a localization point and a groundtruth region, depending on the spatial distribution of the other regions in the groundtruth. Final characterization is given with detection rate/probability score plots, describing the sets of possible interpretations of the localization results, according to a given confidence rate. We present experimentation details along with the results for the symbol localization system of [1], exploiting a synthetic dataset of architectural floorplans and electrical diagrams (composed of 200 images and 3861 symbols).
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-13727-3	Medium
Area		Expedition		Conference	GREC
Notes	DAG			Approved	no
Call Number	Admin @ si @ DRV2010			Serial	2406
Permanent link to this record



Author	Fadi Dornaika; Bogdan Raducanu
Title	Subtle Facial Expression Recognition in Still Images and Videos			Type	Book Chapter
Year	2011	Publication	Advances in Face Image Analysis: Techniques and Technologies	Abbreviated Journal
Volume		Issue	14	Pages	259-277
Keywords
Abstract	This chapter addresses the recognition of basic facial expressions. It has three main contributions. First, the authors introduce a view- and texture independent schemes that exploits facial action parameters estimated by an appearance-based 3D face tracker. they represent the learned facial actions associated with different facial expressions by time series. Two dynamic recognition schemes are proposed: (1) the first is based on conditional predictive models and on an analysis-synthesis scheme, and (2) the second is based on examples allowing straightforward use of machine learning approaches. Second, the authors propose an efficient recognition scheme based on the detection of keyframes in videos. Third, the authors compare the dynamic scheme with a static one based on analyzing individual snapshots and show that in general the former performs better than the latter. The authors then provide evaluations of performance using Linear Discriminant Analysis (LDA), Non parametric Discriminant Analysis (NDA), and Support Vector Machines (SVM).
Address
Corporate Author				Thesis
Publisher	IGI-Global	Place of Publication	New York, USA	Editor	Yu-Jin Zhang
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-6152-0991-0	Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ DoR2011			Serial	1751
Permanent link to this record



Author	Jorge Bernal; Fernando Vilariño; F. Javier Sanchez
Title	Towards Intelligent Systems for Colonoscopy			Type	Book Chapter
Year	2011	Publication	Colonoscopy	Abbreviated Journal
Volume	1	Issue		Pages	257-282
Keywords
Abstract	In this chapter we present tools that can be used to build intelligent systems for colonoscopy. The idea is, by using methods based on computer vision and artiﬁcial intelligence, add signiﬁcant value to the colonoscopy procedure. Intelligent systems are being used to assist in other medical interventions
Address
Corporate Author				Thesis
Publisher	Intech	Place of Publication		Editor	Paul Miskovitz
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-953-307-568-6	Medium
Area	800	Expedition		Conference
Notes	MV;SIAI			Approved	no
Call Number	IAM @ iam @ BVS2011			Serial	1697
Permanent link to this record



Author	Antonio Lopez; W. Niessen; Joan Serrat; K. Nikolay; B. Ter Haar Romeny; Juan J. Villanueva; M. Viergerver
Title	New improvements in the multiscale analysis of trabecular bone patterns			Type	Book Chapter
Year	2000	Publication	Pattern Recognition and Applications	Abbreviated Journal
Volume		Issue		Pages	251-260
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher	IOS Press	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	Admin @ si @			Serial	3418
Permanent link to this record



Author	Santiago Segui; Laura Igual; Fernando Vilariño; Petia Radeva; Carolina Malagelada; Fernando Azpiroz; Jordi Vitria
Title	Diagnostic System for Intestinal Motility Disfunctions Using Video Capsule Endoscopy			Type	Book Chapter
Year	2008	Publication	Computer Vision Systems. 6th International	Abbreviated Journal
Volume	5008	Issue		Pages	251–260
Keywords
Abstract	Wireless Video Capsule Endoscopy is a clinical technique consisting of the analysis of images from the intestine which are pro- vided by an ingestible device with a camera attached to it. In this paper we propose an automatic system to diagnose severe intestinal motility disfunctions using the video endoscopy data. The system is based on the application of computer vision techniques within a machine learn- ing framework in order to obtain the characterization of diverse motil- ity events from video sequences. We present experimental results that demonstrate the effectiveness of the proposed system and compare them with the ground-truth provided by the gastroenterologists.
Address	Santorini (Greece)
Corporate Author				Thesis
Publisher	Springer-Verlag	Place of Publication	Berlin Heidelberg	Editor	A. Gasteratos, M. Vincze, and J.K. Tsotsos
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-540-79546-9	Medium
Area	800	Expedition		Conference	ICVS
Notes	OR; MV; MILAB; SIAI			Approved	no
Call Number	BCNPCL @ bcnpcl @ SIV2008; IAM @ iam @ SIV2008			Serial	962
Permanent link to this record



Author	Jordina Torrents-Barrena; Aida Valls; Petia Radeva; Meritxell Arenas; Domenec Puig
Title	Automatic Recognition of Molecular Subtypes of Breast Cancer in X-Ray images using Segmentation-based Fractal Texture Analysis			Type	Book Chapter
Year	2015	Publication	Artificial Intelligence Research and Development	Abbreviated Journal
Volume	277	Issue		Pages	247 - 256
Keywords
Abstract	Breast cancer disease has recently been classified into four subtypes regarding the molecular properties of the affected tumor region. For each patient, an accurate diagnosis of the specific type is vital to decide the most appropriate therapy in order to enhance life prospects. Nowadays, advanced therapeutic diagnosis research is focused on gene selection methods, which are not robust enough. Hence, we hypothesize that computer vision algorithms can offer benefits to address the problem of discriminating among them through X-Ray images. In this paper, we propose a novel approach driven by texture feature descriptors and machine learning techniques. First, we segment the tumour part through an active contour technique and then, we perform a complete fractal analysis to collect qualitative information of the region of interest in the feature extraction stage. Finally, several supervised and unsupervised classifiers are used to perform multiclass classification of the aforementioned data. The experimental results presented in this paper support that it is possible to establish a relation between each tumor subtype and the extracted features of the patterns revealed on mammograms.
Address
Corporate Author				Thesis
Publisher	IOS Press	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title	Frontiers in Artificial Intelligence and Applications	Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB			Approved	no
Call Number	Admin @ si @TVR2015			Serial	2780
Permanent link to this record



Author	Partha Pratim Roy; Eduard Vazquez; Josep Llados; Ramon Baldrich; Umapada Pal
Title	A System to Segment Text and Symbols from Color Maps			Type	Book Chapter
Year	2008	Publication	Graphics Recognition. Recent Advances and New Opportunities	Abbreviated Journal
Volume	5046	Issue		Pages	245-256
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG;CIC			Approved	no
Call Number	CAT @ cat @ RVL2008			Serial	1005
Permanent link to this record



Author	Antonio Lopez; Jiaolong Xu; Jose Luis Gomez; David Vazquez; German Ros
Title	From Virtual to Real World Visual Perception using Domain Adaptation -- The DPM as Example			Type	Book Chapter
Year	2017	Publication	Domain Adaptation in Computer Vision Applications	Abbreviated Journal
Volume		Issue	13	Pages	243-258
Keywords	Domain Adaptation
Abstract	Supervised learning tends to produce more accurate classifiers than unsupervised learning in general. This implies that training data is preferred with annotations. When addressing visual perception challenges, such as localizing certain object classes within an image, the learning of the involved classifiers turns out to be a practical bottleneck. The reason is that, at least, we have to frame object examples with bounding boxes in thousands of images. A priori, the more complex the model is regarding its number of parameters, the more annotated examples are required. This annotation task is performed by human oracles, which ends up in inaccuracies and errors in the annotations (aka ground truth) since the task is inherently very cumbersome and sometimes ambiguous. As an alternative we have pioneered the use of virtual worlds for collecting such annotations automatically and with high precision. However, since the models learned with virtual data must operate in the real world, we still need to perform domain adaptation (DA). In this chapter we revisit the DA of a deformable part-based model (DPM) as an exemplifying case of virtual- to-real-world DA. As a use case, we address the challenge of vehicle detection for driver assistance, using different publicly available virtual-world data. While doing so, we investigate questions such as: how does the domain gap behave due to virtual-vs-real data with respect to dominant object appearance per domain, as well as the role of photo-realism in the virtual world.
Address
Corporate Author				Thesis
Publisher	Springer	Place of Publication		Editor	Gabriela Csurka
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS; 600.085; 601.223; 600.076; 600.118			Approved	no
Call Number	ADAS @ adas @ LXG2017			Serial	2872
Permanent link to this record



Author	Xavier Baro; Jordi Vitria
Title	Evolutionary Object Detection by Means of Naive Bayes Models Estimation			Type	Book Chapter
Year	2008	Publication	Applications of Evolutionary Computing. EvoWorkshops	Abbreviated Journal
Volume	4974	Issue		Pages	235–244
Keywords
Abstract
Address	Naples (Italy)
Corporate Author				Thesis
Publisher		Place of Publication		Editor	M. Giacobini
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;HuPBA;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ BaV2008a			Serial	976
Permanent link to this record



Author	Debora Gil; Petia Radeva
Title	Inhibition of False Landmarks			Type	Book Chapter
Year	2004	Publication	Recent Advances in Artificial Intelligence Research and Development	Abbreviated Journal
Volume		Issue		Pages	233-244
Keywords
Abstract	We argue that a corner detector should be based on the degree of continuity of the tangent vector to the image level sets, work on the image domain and need no assumptions on neither the image local structure nor the particular geometry of the corner/junction. An operator measuring the degree of differentiability of the projection matrix on the image gradient fulfills the above requirements. Its high sensitivity to changes in vector directions makes it suitable for landmark location in real images prone to need smoothing to reduce the impact of noise. Because using smoothing kernels leads to corner misplacement, we suggest an alternative fake response remover based on the receptive field inhibition of spurious details. The combination of both orientation discontinuity detection and noise inhibition produce our Inhibition Orientation Energy (IOE) landmark locator.
Address
Corporate Author				Thesis
Publisher	IOS Press	Place of Publication	Barcelona (Spain)	Editor	al, J.V. et
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	IAM;MILAB			Approved	no
Call Number	IAM @ iam @ GiR2004a			Serial	1533
Permanent link to this record



Author	German Ros; Laura Sellart; Gabriel Villalonga; Elias Maidanik; Francisco Molero; Marc Garcia; Adriana Cedeño; Francisco Perez; Didier Ramirez; Eduardo Escobar; Jose Luis Gomez; David Vazquez; Antonio Lopez
Title	Semantic Segmentation of Urban Scenes via Domain Adaptation of SYNTHIA			Type	Book Chapter
Year	2017	Publication	Domain Adaptation in Computer Vision Applications	Abbreviated Journal
Volume	12	Issue		Pages	227-241
Keywords	SYNTHIA; Virtual worlds; Autonomous Driving
Abstract	Vision-based semantic segmentation in urban scenarios is a key functionality for autonomous driving. Recent revolutionary results of deep convolutional neural networks (DCNNs) foreshadow the advent of reliable classifiers to perform such visual tasks. However, DCNNs require learning of many parameters from raw images; thus, having a sufficient amount of diverse images with class annotations is needed. These annotations are obtained via cumbersome, human labour which is particularly challenging for semantic segmentation since pixel-level annotations are required. In this chapter, we propose to use a combination of a virtual world to automatically generate realistic synthetic images with pixel-level annotations, and domain adaptation to transfer the models learnt to correctly operate in real scenarios. We address the question of how useful synthetic data can be for semantic segmentation – in particular, when using a DCNN paradigm. In order to answer this question we have generated a synthetic collection of diverse urban images, named SYNTHIA, with automatically generated class annotations and object identifiers. We use SYNTHIA in combination with publicly available real-world urban images with manually provided annotations. Then, we conduct experiments with DCNNs that show that combining SYNTHIA with simple domain adaptation techniques in the training stage significantly improves performance on semantic segmentation.
Address
Corporate Author				Thesis
Publisher	Springer	Place of Publication		Editor	Gabriela Csurka
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS; 600.085; 600.082; 600.076; 600.118			Approved	no
Call Number	ADAS @ adas @ RSV2017			Serial	2882
Permanent link to this record



Author	Sergio Vera; Debora Gil; Agnes Borras; F. Javier Sanchez; Frederic Perez; Marius G. Linguraru; Miguel Angel Gonzalez Ballester
Title	Computation and Evaluation of Medial Surfaces for Shape Representation of Abdominal Organs			Type	Book Chapter
Year	2012	Publication	Workshop on Computational and Clinical Applications in Abdominal Imaging	Abbreviated Journal
Volume	7029	Issue		Pages	223–230
Keywords	medial manifolds, abdomen.
Abstract	Medial representations are powerful tools for describing and parameterizing the volumetric shape of anatomical structures. Existing methods show excellent results when applied to 2D objects, but their quality drops across dimensions. This paper contributes to the computation of medial manifolds in two aspects. First, we provide a standard scheme for the computation of medial manifolds that avoid degenerated medial axis segments; second, we introduce an energy based method which performs independently of the dimension. We evaluate quantitatively the performance of our method with respect to existing approaches, by applying them to synthetic shapes of known medial geometry. Finally, we show results on shape representation of multiple abdominal organs, exploring the use of medial manifolds for the representation of multi-organ relations.
Address	Toronto; Canada;
Corporate Author				Thesis
Publisher	Springer Link	Place of Publication	Berlin	Editor	H. Yoshida et al
Language	English	Summary Language	English	Original Title
Series Editor		Series Title	Lecture Notes in Computer Science	Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-28556-1	Medium
Area		Expedition		Conference	ABDI
Notes	IAM;MV			Approved	no
Call Number	IAM @ iam @ VGB2012			Serial	1834
Permanent link to this record