|
David Vazquez, Antonio Lopez, Daniel Ponsa and Javier Marin. 2011. Virtual Worlds and Active Learning for Human Detection. 13th International Conference on Multimodal Interaction. New York, NY, USA, USA, ACM DL, 393–400.
Abstract: Image based human detection is of paramount interest due to its potential applications in fields such as advanced driving assistance, surveillance and media analysis. However, even detecting non-occluded standing humans remains a challenge of intensive research. The most promising human detectors rely on classifiers developed in the discriminative paradigm, i.e., trained with labelled samples. However, labeling is a manual intensive step, especially in cases like human detection where it is necessary to provide at least bounding boxes framing the humans for training. To overcome such problem, some authors have proposed the use of a virtual world where the labels of the different objects are obtained automatically. This means that the human models (classifiers) are learnt using the appearance of rendered images, i.e., using realistic computer graphics. Later, these models are used for human detection in images of the real world. The results of this technique are surprisingly good. However, these are not always as good as the classical approach of training and testing with data coming from the same camera, or similar ones. Accordingly, in this paper we address the challenge of using a virtual world for gathering (while playing a videogame) a large amount of automatically labelled samples (virtual humans and background) and then training a classifier that performs equal, in real-world images, than the one obtained by equally training from manually labelled real-world samples. For doing that, we cast the problem as one of domain adaptation. In doing so, we assume that a small amount of manually labelled samples from real-world images is required. To collect these labelled samples we propose a non-standard active learning technique. Therefore, ultimately our human model is learnt by the combination of virtual and real world labelled samples (Fig. 1), which has not been done before. We present quantitative results showing that this approach is valid.
Keywords: Pedestrian Detection; Human detection; Virtual; Domain Adaptation; Active Learning
|
|
|
Muhammad Anwer Rao, David Vazquez and Antonio Lopez. 2011. Opponent Colors for Human Detection. In J. Vitria, J.M. Sanches and M. Hernandez, eds. 5th Iberian Conference on Pattern Recognition and Image Analysis. Berlin Heidelberg, Springer, 363–370. (LNCS.)
Abstract: Human detection is a key component in fields such as advanced driving assistance and video surveillance. However, even detecting non-occluded standing humans remains a challenge of intensive research. Finding good features to build human models for further detection is probably one of the most important issues to face. Currently, shape, texture and motion features have deserve extensive attention in the literature. However, color-based features, which are important in other domains (e.g., image categorization), have received much less attention. In fact, the use of RGB color space has become a kind of choice by default. The focus has been put in developing first and second order features on top of RGB space (e.g., HOG and co-occurrence matrices, resp.). In this paper we evaluate the opponent colors (OPP) space as a biologically inspired alternative for human detection. In particular, by feeding OPP space in the baseline framework of Dalal et al. for human detection (based on RGB, HOG and linear SVM), we will obtain better detection performance than by using RGB space. This is a relevant result since, up to the best of our knowledge, OPP space has not been previously used for human detection. This suggests that in the future it could be worth to compute co-occurrence matrices, self-similarity features, etc., also on top of OPP space, i.e., as we have done with HOG in this paper.
Keywords: Pedestrian Detection; Color; Part Based Models
|
|
|
Javier Marin, David Vazquez, David Geronimo and Antonio Lopez. 2010. Learning Appearance in Virtual Scenarios for Pedestrian Detection. 23rd IEEE Conference on Computer Vision and Pattern Recognition.137–144.
Abstract: Detecting pedestrians in images is a key functionality to avoid vehicle-to-pedestrian collisions. The most promising detectors rely on appearance-based pedestrian classifiers trained with labelled samples. This paper addresses the following question: can a pedestrian appearance model learnt in virtual scenarios work successfully for pedestrian detection in real images? (Fig. 1). Our experiments suggest a positive answer, which is a new and relevant conclusion for research in pedestrian detection. More specifically, we record training sequences in virtual scenarios and then appearance-based pedestrian classifiers are learnt using HOG and linear SVM. We test such classifiers in a publicly available dataset provided by Daimler AG for pedestrian detection benchmarking. This dataset contains real world images acquired from a moving car. The obtained result is compared with the one given by a classifier learnt using samples coming from real images. The comparison reveals that, although virtual samples were not specially selected, both virtual and real based training give rise to classifiers of similar performance.
Keywords: Pedestrian Detection; Domain Adaptation
|
|
|
Muhammad Anwer Rao, David Vazquez and Antonio Lopez. 2011. Color Contribution to Part-Based Person Detection in Different Types of Scenarios. In P. Real, D.D., H. Molina, A. Berciano, W. Kropatsch, ed. 14th International Conference on Computer Analysis of Images and Patterns. Berlin Heidelberg, Springer, 463–470.
Abstract: Camera-based person detection is of paramount interest due to its potential applications. The task is diffcult because the great variety of backgrounds (scenarios, illumination) in which persons are present, as well as their intra-class variability (pose, clothe, occlusion). In fact, the class person is one of the included in the popular PASCAL visual object classes (VOC) challenge. A breakthrough for this challenge, regarding person detection, is due to Felzenszwalb et al. These authors proposed a part-based detector that relies on histograms of oriented gradients (HOG) and latent support vector machines (LatSVM) to learn a model of the whole human body and its constitutive parts, as well as their relative position. Since the approach of Felzenszwalb et al. appeared new variants have been proposed, usually giving rise to more complex models. In this paper, we focus on an issue that has not attracted suficient interest up to now. In particular, we refer to the fact that HOG is usually computed from RGB color space, but other possibilities exist and deserve the corresponding investigation. In this paper we challenge RGB space with the opponent color space (OPP), which is inspired in the human vision system.We will compute the HOG on top of OPP, then we train and test the part-based human classifer by Felzenszwalb et al. using PASCAL VOC challenge protocols and person database. Our experiments demonstrate that OPP outperforms RGB. We also investigate possible differences among types of scenarios: indoor, urban and countryside. Interestingly, our experiments suggest that the beneficts of OPP with respect to RGB mainly come for indoor and countryside scenarios, those in which the human visual system was designed by evolution.
Keywords: Pedestrian Detection; Color
|
|
|
A. Pujol, Javier Varona and Joan Serrat. 1997. A machine vision system for the inspection of industrial sieves. (SNRFAI’97) 7th Spanish National Symposium on Pattern Recognition and Image Analysis.
|
|
|
Craig Von Land, Ricardo Toledo and Juan J. Villanueva. 1997. TeleRegions: Application of Telematics in Cardiac Care. Computers In Cardiology.195–198.
|
|
|
W. Niessen, Antonio Lopez, W. Van Enk, P. Van Roermund, Bart M. Ter Haar Romeny and M. Viergever. 1997. Multiscale Trabecular Bone Orientation Analysis. (SNRFAI’97) 7th Spanish National Symposium on Pattern Recognition and Image Analysis.19–24.
|
|
|
W. Niessen, Antonio Lopez, W. Van Enk, P. Van Roermund, Bart M. Ter Haar Romeny and M. Viergever. 1997. In Vivo Analysis of Trabecular Bone Architecture. Information Processing in Medical Imaging. IMPI 1997.435–440. (LNCS.)
|
|
|
Antonio Lopez and Joan Serrat. 1996. Tracing crease curves by solving a system of differential equations. ECCV 1996. (LNCS.)
|
|
|
Craig Von Land, Ricardo Toledo and Juan J. Villanueva. 1996. CARE: Computer Assisted Radiology Environment. Tecnologia de Imagenes Medicas, Convencion Iberoamericana sobre la Salud en la Sociedad Global de la Informacion..
|
|