|
Jaume Amores, David Geronimo and Antonio Lopez. 2010. Multiple instance and active learning for weakly-supervised object-class segmentation. 3rd IEEE International Conference on Machine Vision.
Abstract: In object-class segmentation, one of the most tedious tasks is to manually segment many object examples in order to learn a model of the object category. Yet, there has been little research on reducing the degree of manual annotation for
object-class segmentation. In this work we explore alternative strategies which do not require full manual segmentation of the object in the training set. In particular, we study the use of bounding boxes as a coarser and much cheaper form of segmentation and we perform a comparative study of several Multiple-Instance Learning techniques that allow to obtain a model with this type of weak annotation. We show that some of these methods can be competitive, when used with coarse
segmentations, with methods that require full manual segmentation of the objects. Furthermore, we show how to use active learning combined with this weakly supervised strategy.
As we see, this strategy permits to reduce the amount of annotation and optimize the number of examples that require full manual segmentation in the training set.
Keywords: Multiple Instance Learning; Active Learning; Object-class segmentation.
|
|
|
Daniel Ponsa and Antonio Lopez. 2007. Vehicle Trajectory Estimation based on Monocular Vision. 3rd Iberian Conference on Pattern Recognition and Image Analysis, LNCS 4477.587–594.
Keywords: vehicle detection
|
|
|
Daniel Ponsa and Antonio Lopez. 2007. Feature Selection Based on a New Formulation of the Minimal-Redundancy-Maximal-Relevance Criterion. 3rd Iberian Conference on Pattern Recognition and Image Analysis, LNCS 4477.47–54.
|
|
|
David Geronimo, Antonio Lopez, Daniel Ponsa and Angel Sappa. 2007. Haar Wavelets and Edge Orientation Histograms for On-Board Pedestrian Detection. In J. Marti et al., ed. 3rd Iberian Conference on Pattern Recognition and Image Analysis, LNCS 4477.418–425.
Keywords: Pedestrian detection
|
|
|
David Geronimo, Antonio Lopez and Angel Sappa. 2007. Computer Vision Approaches for Pedestrian Detection: Visible Spectrum Survey. In J. Marti et al., ed. 3rd Iberian Conference on Pattern Recognition and Image Analysis, LNCS 4477.547–554.
Abstract: Pedestrian detection from images of the visible spectrum is a high relevant area of research given its potential impact in the design of pedestrian protection systems. There are many proposals in the literature but they lack a comparative viewpoint. According to this, in this paper we first propose a common framework where we fit the different approaches, and second we use this framework to provide a comparative point of view of the details of such different approaches, pointing out also the main challenges to be solved in the future. In summary, we expect
this survey to be useful for both novel and experienced researchers in the field. In the first case, as a clarifying snapshot of the state of the art; in the second, as a way to unveil trends and to take conclusions from the comparative study.
Keywords: Pedestrian detection
|
|
|
Jose Manuel Alvarez, Antonio Lopez and Ramon Baldrich. 2007. Shadow Resistant Road Segmentation from a Mobile Monocular System. 3rd Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2007), J. Marti et al. (Eds.) LNCS 4477:9–16.
|
|
|
Joan Serrat, Ferran Diego, Felipe Lumbreras and Jose Manuel Alvarez. 2007. Synchronization of Video Sequences from Free-moving Cameras. In J. Marti et al., ed. 3rd Iberian Conference on Pattern Recognition and Image Analysis.620–627. (LNCS.)
|
|
|
Antonio Lopez, Joan Serrat, Cristina Cañero and Felipe Lumbreras. 2007. Robust Lane Lines Detection and Quantitative Assessment. In J. Marti et al, ed. 3rd Iberian Conference on Pattern Recognition and Image Analysis.274–281. (LNCS.)
|
|
|
Lorenzo Porzi, Markus Hofinger, Idoia Ruiz, Joan Serrat, Samuel Rota Bulo and Peter Kontschieder. 2020. Learning Multi-Object Tracking and Segmentation from Automatic Annotations. 33rd IEEE Conference on Computer Vision and Pattern Recognition.6845–6854.
Abstract: In this work we contribute a novel pipeline to automatically generate training data, and to improve over state-of-the-art multi-object tracking and segmentation (MOTS) methods. Our proposed track mining algorithm turns raw street-level videos into high-fidelity MOTS training data, is scalable and overcomes the need of expensive and time-consuming manual annotation approaches. We leverage state-of-the-art instance segmentation results in combination with optical flow predictions, also trained on automatically harvested training data. Our second major contribution is MOTSNet – a deep learning, tracking-by-detection architecture for MOTS – deploying a novel mask-pooling layer for improved object association over time. Training MOTSNet with our automatically extracted data leads to significantly improved sMOTSA scores on the novel KITTI MOTS dataset (+1.9%/+7.5% on cars/pedestrians), and MOTSNet improves by +4.1% over previously best methods on the MOTSChallenge dataset. Our most impressive finding is that we can improve over previous best-performing works, even in complete absence of manually annotated MOTS training data.
|
|
|
Marçal Rusiñol, David Aldavert, Dimosthenis Karatzas, Ricardo Toledo and Josep Llados. 2011. Interactive Trademark Image Retrieval by Fusing Semantic and Visual Content. Advances in Information Retrieval. In P. Clough and 6 others, eds. 33rd European Conference on Information Retrieval. Berlin, Springer, 314–325. (LNCS.)
Abstract: In this paper we propose an efficient queried-by-example retrieval system which is able to retrieve trademark images by similarity from patent and trademark offices' digital libraries. Logo images are described by both their semantic content, by means of the Vienna codes, and their visual contents, by using shape and color as visual cues. The trademark descriptors are then indexed by a locality-sensitive hashing data structure aiming to perform approximate k-NN search in high dimensional spaces in sub-linear time. The resulting ranked lists are combined by using the Condorcet method and a relevance feedback step helps to iteratively revise the query and refine the obtained results. The experiments demonstrate the effectiveness and efficiency of this system on a realistic and large dataset.
|
|