|
Alex Goldhoorn, Arnau Ramisa, Ramon Lopez de Mantaras and Ricardo Toledo. 2007. Using the Average Landmark Vector Method for Robot Homing. Artificial Intelligence Research and Development, Proceedings of the 10th International Conference of the ACIA.331–338.
|
|
|
Jaume Amores, N. Sebe and Petia Radeva. 2007. Class-Specific Binaryy Correlograms for Object Recognition. British Machine Vision Conference.
|
|
|
Karel Paleček, David Geronimo and Frederic Lerasle. 2012. Pre-attention cues for person detection. Cognitive Behavioural Systems, COST 2102 International Training School. Springer Berlin Heidelberg, 225–235. (LNCS.)
Abstract: Current state-of-the-art person detectors have been proven reliable and achieve very good detection rates. However, the performance is often far from real time, which limits their use to low resolution images only. In this paper, we deal with candidate window generation problem for person detection, i.e. we want to reduce the computational complexity of a person detector by reducing the number of regions that has to be evaluated. We base our work on Alexe’s paper [1], which introduced several pre-attention cues for generic object detection. We evaluate these cues in the context of person detection and show that their performance degrades rapidly for scenes containing multiple objects of interest such as pictures from urban environment. We extend this set by new cues, which better suits our class-specific task. The cues are designed to be simple and efficient, so that they can be used in the pre-attention phase of a more complex sliding window based person detector.
|
|
|
Angel Sappa, Rosa Herrero, Fadi Dornaika, David Geronimo and Antonio Lopez. 2007. Road Approximation in Euclidean and v-Disparity Space: A Comparative Study. Computer Aided Systems Theory,.1105–1112. (LNCS.)
Abstract: This paper presents a comparative study between two road approximation techniques—planar surfaces—from stereo vision data. The first approach is carried out in the v-disparity space and is based on a voting scheme, the Hough transform. The second one consists in computing the best fitting plane for the whole 3D road data points, directly in the Euclidean space, by using least squares fitting. The comparative study is initially performed over a set of different synthetic surfaces
(e.g., plane, quadratic surface, cubic surface) digitized by a virtual stereo head; then real data obtained with a commercial stereo head are used. The comparative study is intended to be used as a criterion for fining the best technique according to the road geometry. Additionally, it highlights common problems driven from a wrong assumption about the scene’s prior knowledge.
|
|
|
Angel Sappa, Fadi Dornaika, David Geronimo and Antonio Lopez. 2007. Efficient On-Board Stereo Vision Pose Estimation. Computer Aided Systems Theory, Selected paper from.1183–1190. (LNCS.)
Abstract: This paper presents an efficient technique for real time estimation of on-board stereo vision system pose. The whole process is performed in the Euclidean space and consists of two stages. Initially, a compact representation of the original 3D data points is computed. Then, a RANSAC based least squares approach is used for fitting a plane to the 3D road points. Fast RANSAC fitting is obtained by selecting points according to a probability distribution function that takes into account the density of points at a given depth. Finally, stereo camera position
and orientation—pose—is computed relative to the road plane. The proposed technique is intended to be used on driver assistance systems for applications such as obstacle or pedestrian detection. A real time performance is reached. Experimental results on several environments and comparisons with a previous work are presented.
|
|
|
Joan Serrat, Jordi Vitria and J. Pladellorens. 1991. Morphological Segmentation of Heart Scintigraphic image Sequences. Computer Assisted Radiology..
|
|
|
Yi Xiao, Felipe Codevilla, Christopher Pal and Antonio Lopez. 2020. Action-Based Representation Learning for Autonomous Driving. Conference on Robot Learning.
Abstract: Human drivers produce a vast amount of data which could, in principle, be used to improve autonomous driving systems. Unfortunately, seemingly straightforward approaches for creating end-to-end driving models that map sensor data directly into driving actions are problematic in terms of interpretability, and typically have significant difficulty dealing with spurious correlations. Alternatively, we propose to use this kind of action-based driving data for learning representations. Our experiments show that an affordance-based driving model pre-trained with this approach can leverage a relatively small amount of weakly annotated imagery and outperform pure end-to-end driving models, while being more interpretable. Further, we demonstrate how this strategy outperforms previous methods based on learning inverse dynamics models as well as other methods based on heavy human supervision (ImageNet).
|
|
|
J. Mauri and 14 others. 2000. Moviment del vas en l anàlisi d imatges d ecografia intracoronària: un model matemàtic. Congrés de la Societat Catalana de Cardiologia..
|
|
|
J. Mauri and 14 others. 2000. Avaluació del Conjunt Stent/Artèria mitjançant ecografia intracoronària: lentorn informàtic. Congrés de la Societat Catalana de Cardiologia..
|
|
|
David Vazquez, Jiaolong Xu, Sebastian Ramos, Antonio Lopez and Daniel Ponsa. 2013. Weakly Supervised Automatic Annotation of Pedestrian Bounding Boxes. CVPR Workshop on Ground Truth – What is a good dataset?. IEEE, 706–711.
Abstract: Among the components of a pedestrian detector, its trained pedestrian classifier is crucial for achieving the desired performance. The initial task of the training process consists in collecting samples of pedestrians and background, which involves tiresome manual annotation of pedestrian bounding boxes (BBs). Thus, recent works have assessed the use of automatically collected samples from photo-realistic virtual worlds. However, learning from virtual-world samples and testing in real-world images may suffer the dataset shift problem. Accordingly, in this paper we assess an strategy to collect samples from the real world and retrain with them, thus avoiding the dataset shift, but in such a way that no BBs of real-world pedestrians have to be provided. In particular, we train a pedestrian classifier based on virtual-world samples (no human annotation required). Then, using such a classifier we collect pedestrian samples from real-world images by detection. After, a human oracle rejects the false detections efficiently (weak annotation). Finally, a new classifier is trained with the accepted detections. We show that this classifier is competitive with respect to the counterpart trained with samples collected by manually annotating hundreds of pedestrian BBs.
Keywords: Pedestrian Detection; Domain Adaptation
|
|