Publicacions CVC -- Query Results

[1–10] << 11 12 13 14 15 16 17 18 19 20 >> [21–30]

Details

Records
Author	Ciprian Corneanu; Sergio Escalera; Aleix M. Martinez
Title	Computing the Testing Error Without a Testing Set			Type	Conference Article
Year	2020	Publication	33rd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Oral. Paper award nominee. Deep Neural Networks (DNNs) have revolutionized computer vision. We now have DNNs that achieve top (performance) results in many problems, including object recognition, facial expression analysis, and semantic segmentation, to name but a few. The design of the DNNs that achieve top results is, however, non-trivial and mostly done by trailand-error. That is, typically, researchers will derive many DNN architectures (i.e., topologies) and then test them on multiple datasets. However, there are no guarantees that the selected DNN will perform well in the real world. One can use a testing set to estimate the performance gap between the training and testing sets, but avoiding overfitting-to-thetesting-data is almost impossible. Using a sequestered testing dataset may address this problem, but this requires a constant update of the dataset, a very expensive venture. Here, we derive an algorithm to estimate the performance gap between training and testing that does not require any testing dataset. Specifically, we derive a number of persistent topology measures that identify when a DNN is learning to generalize to unseen samples. This allows us to compute the DNN’s testing error on unseen samples, even when we do not have access to them. We provide extensive experimental validation on multiple networks and datasets to demonstrate the feasibility of the proposed approach.
Address	Virtual CVPR
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CVPR
Notes	HuPBA; no proj			Approved	no
Call Number	Admin @ si @ CEM2020			Serial	3437
Permanent link to this record



Author	Swathikiran Sudhakaran; Sergio Escalera; Oswald Lanz
Title	Gate-Shift Networks for Video Action Recognition			Type	Conference Article
Year	2020	Publication	33rd IEEE Conference on Computer Vision and Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Deep 3D CNNs for video action recognition are designed to learn powerful representations in the joint spatio-temporal feature space. In practice however, because of the large number of parameters and computations involved, they may under-perform in the lack of sufficiently large datasets for training them at scale. In this paper we introduce spatial gating in spatial-temporal decomposition of 3D kernels. We implement this concept with Gate-Shift Module (GSM). GSM is lightweight and turns a 2D-CNN into a highly efficient spatio-temporal feature extractor. With GSM plugged in, a 2D-CNN learns to adaptively route features through time and combine them, at almost no additional parameters and computational overhead. We perform an extensive evaluation of the proposed module to study its effectiveness in video action recognition, achieving state-of-the-art results on Something Something-V1 and Diving48 datasets, and obtaining competitive results on EPIC-Kitchens with far less model complexity.
Address	Virtual CVPR
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CVPR
Notes	HuPBA; no proj			Approved	no
Call Number	Admin @ si @ SEL2020			Serial	3438
Permanent link to this record



Author	Yaxing Wang; Joost Van de Weijer; Lu Yu; Shangling Jui
Title	Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited Data			Type	Conference Article
Year	2022	Publication	10th International Conference on Learning Representations	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Conditional image synthesis is an integral part of many X2I translation systems, including image-to-image, text-to-image and audio-to-image translation systems. Training these large systems generally requires huge amounts of training data. Therefore, we investigate knowledge distillation to transfer knowledge from a high-quality unconditioned generative model (e.g., StyleGAN) to a conditioned synthetic image generation modules in a variety of systems. To initialize the conditional and reference branch (from a unconditional GAN) we exploit the style mixing characteristics of high-quality GANs to generate an infinite supply of style-mixed triplets to perform the knowledge distillation. Extensive experimental results in a number of image generation tasks (i.e., image-to-image, semantic segmentation-to-image, text-to-image and audio-to-image) demonstrate qualitatively and quantitatively that our method successfully transfers knowledge to the synthetic image generation modules, resulting in more realistic images than previous methods as confirmed by a significant drop in the FID.
Address	Virtual
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICLR
Notes	LAMP; 600.147			Approved	no
Call Number	Admin @ si @ WWY2022			Serial	3791
Permanent link to this record



Author	Eduardo Aguilar; Bhalaji Nagarajan; Rupali Khatun; Marc Bolaños; Petia Radeva
Title	Uncertainty Modeling and Deep Learning Applied to Food Image Analysis			Type	Conference Article
Year	2020	Publication	13th International Joint Conference on Biomedical Engineering Systems and Technologies	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Recently, computer vision approaches specially assisted by deep learning techniques have shown unexpected advancements that practically solve problems that never have been imagined to be automatized like face recognition or automated driving. However, food image recognition has received a little effort in the Computer Vision community. In this project, we review the field of food image analysis and focus on how to combine with two challenging research lines: deep learning and uncertainty modeling. After discussing our methodology to advance in this direction, we comment potential research, social and economic impact of the research on food image analysis.
Address	Villetta; Malta; February 2020
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	BIODEVICES
Notes	MILAB			Approved	no
Call Number	Admin @ si @ ANK2020			Serial	3526
Permanent link to this record



Author	Santiago Segui; Michal Drozdzal; Petia Radeva; Jordi Vitria
Title	An Integrated Approach to Contextual Face Detection			Type	Conference Article
Year	2012	Publication	1st International Conference on Pattern Recognition Applications and Methods	Abbreviated Journal
Volume		Issue		Pages	143-150
Keywords
Abstract	Face detection is, in general, based on content-based detectors. Nevertheless, the face is a non-rigid object with well defined relations with respect to the human body parts. In this paper, we propose to take benefit of the context information in order to improve content-based face detections. We propose a novel framework for integrating multiple content- and context-based detectors in a discriminative way. Moreover, we develop an integrated scoring procedure that measures the ’faceness’ of each hypothesis and is used to discriminate the detection results. Our approach detects a higher rate of faces while minimizing the number of false detections, giving an average increase of more than 10% in average precision when comparing it to state-of-the art face detectors
Address	Vilamoura, Algarve, Portugal
Corporate Author				Thesis
Publisher	Springer	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICPRAM
Notes	MILAB; OR;MV			Approved	no
Call Number	Admin @ si @ SDR2012			Serial	1895
Permanent link to this record



Author	David Masip; Jordi Vitria
Title	Real Time Face Detection and Verification for Uncontrolled Environments			Type	Miscellaneous
Year	2004	Publication	Second COST 275 Workshop Biometrics on the Internet: Fundamentals, Advances and Applications, 55–58.	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Vigo
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ MaV2004a			Serial	446
Permanent link to this record



Author	L.Tarazon; D. Perez; N. Serrano; V. Alabau; Oriol Ramos Terrades; A. Sanchis; A. Juan
Title	Confidence Measures for Error Correction in Interactive Transcription of Handwritten Text			Type	Conference Article
Year	2009	Publication	15th International Conference on Image Analysis and Processing	Abbreviated Journal
Volume	5716	Issue		Pages	567-574
Keywords
Abstract	An effective approach to transcribe old text documents is to follow an interactive-predictive paradigm in which both, the system is guided by the human supervisor, and the supervisor is assisted by the system to complete the transcription task as efficiently as possible. In this paper, we focus on a particular system prototype called GIDOC, which can be seen as a first attempt to provide user-friendly, integrated support for interactive-predictive page layout analysis, text line detection and handwritten text transcription. More specifically, we focus on the handwriting recognition part of GIDOC, for which we propose the use of confidence measures to guide the human supervisor in locating possible system errors and deciding how to proceed. Empirical results are reported on two datasets showing that a word error rate not larger than a 10% can be achieved by only checking the 32% of words that are recognised with less confidence.
Address	Vietri sul Mare, Italy
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-04145-7	Medium
Area		Expedition		Conference	ICIAP
Notes	DAG			Approved	no
Call Number	Admin @ si @ TPS2009			Serial	1871
Permanent link to this record



Author	Andreas Fischer; Ching Y. Suen; Volkmar Frinken; Kaspar Riesen; Horst Bunke
Title	A Fast Matching Algorithm for Graph-Based Handwriting Recognition			Type	Conference Article
Year	2013	Publication	9th IAPR – TC15 Workshop on Graph-based Representation in Pattern Recognition	Abbreviated Journal
Volume	7877	Issue		Pages	194-203
Keywords
Abstract	The recognition of unconstrained handwriting images is usually based on vectorial representation and statistical classification. Despite their high representational power, graphs are rarely used in this field due to a lack of efficient graph-based recognition methods. Recently, graph similarity features have been proposed to bridge the gap between structural representation and statistical classification by means of vector space embedding. This approach has shown a high performance in terms of accuracy but had shortcomings in terms of computational speed. The time complexity of the Hungarian algorithm that is used to approximate the edit distance between two handwriting graphs is demanding for a real-world scenario. In this paper, we propose a faster graph matching algorithm which is derived from the Hausdorff distance. On the historical Parzival database it is demonstrated that the proposed method achieves a speedup factor of 12.9 without significant loss in recognition accuracy.
Address	Vienna; Austria; May 2013
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-38220-8	Medium
Area		Expedition		Conference	GBR
Notes	DAG; 600.045; 605.203			Approved	no
Call Number	Admin @ si @ FSF2013			Serial	2294
Permanent link to this record



Author	Manuel Carbonell; Mauricio Villegas; Alicia Fornes; Josep Llados
Title	Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model			Type	Conference Article
Year	2018	Publication	13th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	399-404
Keywords	Named entity recognition; Handwritten Text Recognition; neural networks
Abstract	When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognition. Experimentally, the work has been tested on a collection of historical marriage records. Results of experiments are presented to show the effect on the performance for different configurations: different ways of encoding the information, doing or not transfer learning and processing at text line or multi-line region level. The results are comparable to state of the art reported in the ICDAR 2017 Information Extraction competition, even though the proposed technique does not use any dictionaries, language modeling or post processing.
Address	Vienna; Austria; April 2018
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	DAS
Notes	DAG; 600.097; 603.057; 601.311; 600.121			Approved	no
Call Number	Admin @ si @ CVF2018			Serial	3170
Permanent link to this record



Author	Florin Popescu; Stephane Ayache; Sergio Escalera; Xavier Baro; Cecile Capponi; Patrick Panciatici; Isabelle Guyon
Title	From geospatial observations of ocean currents to causal predictors of spatio-economic activity using computer vision and machine learning			Type	Conference Article
Year	2016	Publication	European Geosciences Union General Assembly	Abbreviated Journal
Volume	18	Issue		Pages
Keywords
Abstract	The big data transformation currently revolutionizing science and industry forges novel possibilities in multimodal analysis scarcely imaginable only a decade ago. One of the important economic and industrial problems that stand to benefit from the recent expansion of data availability and computational prowess is the prediction of electricity demand and renewable energy generation. Both are correlates of human activity: spatiotemporal energy consumption patterns in society are a factor of both demand (weather dependent) and supply, which determine cost – a relation expected to strengthen along with increasing renewable energy dependence. One of the main drivers of European weather patterns is the activity of the Atlantic Ocean and in particular its dominant Northern Hemisphere current: the Gulf Stream. We choose this particular current as a test case in part due to larger amount of relevant data and scientific literature available for refinement of analysis techniques. This data richness is due not only to its economic importance but also to its size being clearly visible in radar and infrared satellite imagery, which makes it easier to detect using Computer Vision (CV). The power of CV techniques makes basic analysis thus developed scalable to other smaller and less known, but still influential, currents, which are not just curves on a map, but complex, evolving, moving branching trees in 3D projected onto a 2D image. We investigate means of extracting, from several image modalities (including recently available Copernicus radar and earlier Infrared satellites), a parameterized presentation of the state of the Gulf Stream and its environment that is useful as feature space representation in a machine learning context, in this case with the EC’s H2020-sponsored ‘See.4C’ project, in the context of which data scientists may find novel predictors of spatiotemporal energy flow. Although automated extractors of Gulf Stream position exist, they differ in methodology and result. We shall attempt to extract more complex feature representation including branching points, eddies and parameterized changes in transport and velocity. Other related predictive features will be similarly developed, such as inference of deep water flux long the current path and wider spatial scale features such as Hough transform, surface turbulence indicators and temperature gradient indexes along with multi-time scale analysis of ocean height and temperature dynamics. The geospatial imaging and ML community may therefore benefit from a baseline of open-source techniques useful and expandable to other related prediction and/or scientific analysis tasks.
Address	Vienna; Austria; April 2016
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	EGU
Notes	HuPBA;MV;			Approved	no
Call Number	Admin @ si @ PAE2016			Serial	2772
Permanent link to this record



Author	O. Rodriguez-Leor; E Fernandez-Nofrerias; J. Mauri; C. Garcia; R. Villuendas; V. Valle; Oriol Pujol; Petia Radeva
Title	Intravascular ultrasound segmentation using local binary patterns			Type	Journal
Year	2003	Publication	European Heart Journal (IF: 5.997), ESC Congress 2003	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Vienna (Austria)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ RFM2003a			Serial	407
Permanent link to this record



Author	O. Rodriguez-Leor; E Fernandez-Nofrerias; J. Mauri; C. Garcia; R. Villuendas; V. Valle; Misael Rosales; Petia Radeva
Title	Empirical simulation model of intravascular ultrasound			Type	Journal
Year	2003	Publication	European Heart Journal (IF: 5.997), ESC Congress 2003	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Vienna (Austria)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MILAB			Approved	no
Call Number	BCNPCL @ bcnpcl @ RFM2003b			Serial	408
Permanent link to this record



Author	Antonio Lopez; Cristina Cañero; Joan Serrat; J. Saludes; Felipe Lumbreras; T. Graf
Title	Detection of lane markings based on ridgeness and RANSAC			Type	Miscellaneous
Year	2005	Publication	Proceedings of the 8th International IEEE Conference on Intelligent Transportation Systems, 733–738	Abbreviated Journal
Volume		Issue		Pages
Keywords	lane markings
Abstract
Address	Vienna (Austria)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ LCS2005			Serial	588
Permanent link to this record



Author	Daniel Ponsa; Antonio Lopez; Felipe Lumbreras; Joan Serrat; T. Graf
Title	3D Vehicle Sensor based on Monocular Vision			Type	Miscellaneous
Year	2005	Publication	Proceedings of the 8th International IEEE Conference on Intelligent Transportation Systems, 1096–1101, ISBN:0–7803–9216–7	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Vienna (Austria)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ PLL2005			Serial	614
Permanent link to this record



Author	Daniel Ponsa; Antonio Lopez; Joan Serrat; Felipe Lumbreras; T. Graf
Title	Multiple Vehicle 3D Tracking Using an Unscented Kalman Filter			Type	Miscellaneous
Year	2005	Publication	Proceedings of the 8th International IEEE Conference on Intelligent Transportation Systems, 1108–1113, ISBN:0–7803–9216–7	Abbreviated Journal
Volume		Issue		Pages
Keywords	vehicle detection
Abstract
Address	Vienna (Austria)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ PLS2005			Serial	615
Permanent link to this record