|
Aura Hernandez-Sabate and Debora Gil. 2012. The Benefits of IVUS Dynamics for Retrieving Stable Models of Arteries. In Yasuhiro Honda, ed. Intravascular Ultrasound. Intech, 185–206.
|
|
|
Javier Marin, David Geronimo, David Vazquez and Antonio Lopez. 2012. Pedestrian Detection: Exploring Virtual Worlds. Handbook of Pattern Recognition: Methods and Application. iConcept Press, 145–162.
Abstract: Handbook of pattern recognition will include contributions from university educators and active research experts. This Handbook is intended to serve as a basic reference on methods and applications of pattern recognition. The primary aim of this handbook is providing the community of pattern recognition with a readable, easy to understand resource that covers introductory, intermediate and advanced topics with equal clarity. Therefore, the Handbook of pattern recognition can serve equally well as reference resource and as classroom textbook. Contributions cover all methods, techniques and applications of pattern recognition. A tentative list of relevant topics might include: 1- Statistical, structural, syntactic pattern recognition. 2- Neural networks, machine learning, data mining. 3- Discrete geometry, algebraic, graph-based techniques for pattern recognition. 4- Face recognition, Signal analysis, image coding and processing, shape and texture analysis. 5- Document processing, text and graphics recognition, digital libraries. 6- Speech recognition, music analysis, multimedia systems. 7- Natural language analysis, information retrieval. 8- Biometrics, biomedical pattern analysis and information systems. 9- Other scientific, engineering, social and economical applications of pattern recognition. 10- Special hardware architectures, software packages for pattern recognition.
Keywords: Virtual worlds; Pedestrian Detection; Domain Adaptation
|
|
|
David Vazquez. 2013. Domain Adaptation of Virtual and Real Worlds for Pedestrian Detection. (Ph.D. thesis, Ediciones Graficas Rey.)
Abstract: Pedestrian detection is of paramount interest for many applications, e.g. Advanced Driver Assistance Systems, Intelligent Video Surveillance and Multimedia systems. Most promising pedestrian detectors rely on appearance-based classifiers trained with annotated data. However, the required annotation step represents an intensive and subjective task for humans, what makes worth to minimize their intervention in this process by using computational tools like realistic virtual worlds. The reason to use these kind of tools relies in the fact that they allow the automatic generation of precise and rich annotations of visual information. Nevertheless, the use of this kind of data comes with the following question: can a pedestrian appearance model learnt with virtual-world data work successfully for pedestrian detection in real-world scenarios?. To answer this question, we conduct different experiments that suggest a positive answer. However, the pedestrian classifiers trained with virtual-world data can suffer the so called dataset shift problem as real-world based classifiers does. Accordingly, we have designed different domain adaptation techniques to face this problem, all of them integrated in a same framework (V-AYLA). We have explored different methods to train a domain adapted pedestrian classifiers by collecting a few pedestrian samples from the target domain (real world) and combining them with many samples of the source domain (virtual world). The extensive experiments we present show that pedestrian detectors developed within the V-AYLA framework do achieve domain adaptation. Ideally, we would like to adapt our system without any human intervention. Therefore, as a first proof of concept we also propose an unsupervised domain adaptation technique that avoids human intervention during the adaptation process. To the best of our knowledge, this Thesis work is the first demonstrating adaptation of virtual and real worlds for developing an object detector. Last but not least, we also assessed a different strategy to avoid the dataset shift that consists in collecting real-world samples and retrain with them in such a way that no bounding boxes of real-world pedestrians have to be provided. We show that the generated classifier is competitive with respect to the counterpart trained with samples collected by manually annotating pedestrian bounding boxes. The results presented on this Thesis not only end with a proposal for adapting a virtual-world pedestrian detector to the real world, but also it goes further by pointing out a new methodology that would allow the system to adapt to different situations, which we hope will provide the foundations for future research in this unexplored area.
Keywords: Pedestrian Detection; Domain Adaptation
|
|
|
David Vazquez, Antonio Lopez, Daniel Ponsa and David Geronimo. 2013. Interactive Training of Human Detectors. Multiodal Interaction in Image and Video Applications. Springer Berlin Heidelberg, 169–182.
Abstract: Image based human detection remains as a challenging problem. Most promising detectors rely on classifiers trained with labelled samples. However, labelling is a manual labor intensive step. To overcome this problem we propose to collect images of pedestrians from a virtual city, i.e., with automatic labels, and train a pedestrian detector with them, which works fine when such virtual-world data are similar to testing one, i.e., real-world pedestrians in urban areas. When testing data is acquired in different conditions than training one, e.g., human detection in personal photo albums, dataset shift appears. In previous work, we cast this problem as one of domain adaptation and solve it with an active learning procedure. In this work, we focus on the same problem but evaluating a different set of faster to compute features, i.e., Haar, EOH and their combination. In particular, we train a classifier with virtual-world data, using such features and Real AdaBoost as learning machine. This classifier is applied to real-world training images. Then, a human oracle interactively corrects the wrong detections, i.e., few miss detections are manually annotated and some false ones are pointed out too. A low amount of manual annotation is fixed as restriction. Real- and virtual-world difficult samples are combined within what we call cool world and we retrain the classifier with this data. Our experiments show that this adapted classifier is equivalent to the one trained with only real-world data but requiring 90% less manual annotations.
Keywords: Pedestrian Detection; Virtual World; AdaBoost; Domain Adaptation
|
|
|
Felipe Lumbreras, Ramon Baldrich, Maria Vanrell, Joan Serrat and Juan J. Villanueva. 1999. Multiresolution texture classification of ceramic tiles. Recent Research developments in optical engineering, Research Signpost, 2: 213–228.
|
|
|
Ricardo Toledo. 2001. Cardiac workstation and dynamic model to assist in coronary tree analysis. (Ph.D. thesis, .)
|
|
|
Antonio Lopez. 2000. Multilocal Methods for Ridge and Valley Delineation in Image Analysis. (Ph.D. thesis, .)
|
|
|
Felipe Lumbreras. 2001. Segmentation, classification and modelization of textures by means of multiresolution decomposition techniques..
|
|
|
Angel Sappa, Niki Aifanti, N. Grammalidis and Sotiris Malassiotis. 2004. Advances in Vision-Based Human Body Modeling. In N. Sarris and M. Strintzis., ed. 3D Modeling & Animation: Systhesis and Analysis Techniques for the Human Body.1–26.
|
|
|
Angel Sappa and Fadi Dornaika. 2006. An Edge-Based Approach to Motion Detection. 6th International Conference on Computational Science (ICCS´06), LNCS 3991:563–570.
|
|