|
Arnau Ramisa, Ramon Lopez de Mantaras and Ricardo Toledo. 2007. Comparing Combinations of Feature Regions for Panoramic VSLAM. 4th International Conference on Informatics in Control, Automation and Robotics.292–297.
|
|
|
Miguel Oliveira, Angel Sappa and V. Santos. 2012. Color Correction for Onboard Multi-camera Systems using 3D Gaussian Mixture Models. IEEE Intelligent Vehicles Symposium. IEEE Xplore, 299–303.
Abstract: The current paper proposes a novel color correction approach for onboard multi-camera systems. It works by segmenting the given images into several regions. A probabilistic segmentation framework, using 3D Gaussian Mixture Models, is proposed. Regions are used to compute local color correction functions, which are then combined to obtain the final corrected image. An image data set of road scenarios is used to establish a performance comparison of the proposed method with other seven well known color correction algorithms. Results show that the proposed approach is the highest scoring color correction method. Also, the proposed single step 3D color space probabilistic segmentation reduces processing time over similar approaches.
|
|
|
Marçal Rusiñol, David Aldavert, Dimosthenis Karatzas, Ricardo Toledo and Josep Llados. 2011. Interactive Trademark Image Retrieval by Fusing Semantic and Visual Content. Advances in Information Retrieval. In P. Clough and 6 others, eds. 33rd European Conference on Information Retrieval. Berlin, Springer, 314–325. (LNCS.)
Abstract: In this paper we propose an efficient queried-by-example retrieval system which is able to retrieve trademark images by similarity from patent and trademark offices' digital libraries. Logo images are described by both their semantic content, by means of the Vienna codes, and their visual contents, by using shape and color as visual cues. The trademark descriptors are then indexed by a locality-sensitive hashing data structure aiming to perform approximate k-NN search in high dimensional spaces in sub-linear time. The resulting ranked lists are combined by using the Condorcet method and a relevance feedback step helps to iteratively revise the query and refine the obtained results. The experiments demonstrate the effectiveness and efficiency of this system on a realistic and large dataset.
|
|
|
Carme Julia, Angel Sappa, Felipe Lumbreras and Antonio Lopez. 2008. Recovery of Surface Normals and Reflectance from Different Lighting Conditions. 5th International Conference on Image Analysis and Recognition.315–325. (LNCS.)
|
|
|
Fernando Barrera, Felipe Lumbreras and Angel Sappa. 2012. Evaluation of Similarity Functions in Multimodal Stereo. 9th International Conference on Image Analysis and Recognition. Springer Berlin Heidelberg, 320–329. (LNCS.)
Abstract: This paper presents an evaluation framework for multimodal stereo matching, which allows to compare the performance of four similarity functions. Additionally, it presents details of a multimodal stereo head that supply thermal infrared and color images, as well as, aspects of its calibration and rectification. The pipeline includes a novel method for the disparity selection, which is suitable for evaluating the similarity functions. Finally, a benchmark for comparing different initializations of the proposed framework is presented. Similarity functions are based on mutual information, gradient orientation and scale space representations. Their evaluation is performed using two metrics: i) disparity error, and ii) number of correct matches on planar regions. In addition to the proposed evaluation, the current paper also shows that 3D sparse representations can be recovered from such a multimodal stereo head.
Keywords: Aveiro, Portugal
|
|
|
David Aldavert, Arnau Ramisa, Ramon Lopez de Mantaras and Ricardo Toledo. 2010. Real-time Object Segmentation using a Bag of Features Approach. In In R.Alquezar, A.M., J.Aguilar., ed. 13th International Conference of the Catalan Association for Artificial Intelligence. IOS Press Amsterdam,, 321–329.
Abstract: In this paper, we propose an object segmentation framework, based on the popular bag of features (BoF), which can process several images per second while achieving a good segmentation accuracy assigning an object category to every pixel of the image. We propose an efficient color descriptor to complement the information obtained by a typical gradient-based local descriptor. Results show that color proves to be a useful cue to increase the segmentation accuracy, specially in large homogeneous regions. Then, we extend the Hierarchical K-Means codebook using the recently proposed Vector of Locally Aggregated Descriptors method. Finally, we show that the BoF method can be easily parallelized since it is applied locally, thus the time necessary to process an image is further reduced. The performance of the proposed method is evaluated in the standard PASCAL 2007 Segmentation Challenge object segmentation dataset.
Keywords: Object Segmentation; Bag Of Features; Feature Quantization; Densely sampled descriptors
|
|
|
Diego Cheda, Daniel Ponsa and Antonio Lopez. 2012. Monocular Depth-based Background Estimation. 7th International Conference on Computer Vision Theory and Applications.323–328.
Abstract: In this paper, we address the problem of reconstructing the background of a scene from a video sequence with occluding objects. The images are taken by hand-held cameras. Our method composes the background by selecting the appropriate pixels from previously aligned input images. To do that, we minimize a cost function that penalizes the deviations from the following assumptions: background represents objects whose distance to the camera is maximal, and background objects are stationary. Distance information is roughly obtained by a supervised learning approach that allows us to distinguish between close and distant image regions. Moving foreground objects are filtered out by using stationariness and motion boundary constancy measurements. The cost function is minimized by a graph cuts method. We demonstrate the applicability of our approach to recover an occlusion-free background in a set of sequences.
|
|
|
Dennis G.Romero, Anselmo Frizera, Angel Sappa, Boris X. Vintimilla and Teodiano F.Bastos. 2015. A predictive model for human activity recognition by observing actions and context. Advanced Concepts for Intelligent Vision Systems, Proceedings of 16th International Conference, ACIVS 2015. Springer International Publishing, 323–333. (LNCS.)
Abstract: This paper presents a novel model to estimate human activities — a human activity is defined by a set of human actions. The proposed approach is based on the usage of Recurrent Neural Networks (RNN) and Bayesian inference through the continuous monitoring of human actions and its surrounding environment. In the current work human activities are inferred considering not only visual analysis but also additional resources; external sources of information, such as context information, are incorporated to contribute to the activity estimation. The novelty of the proposed approach lies in the way the information is encoded, so that it can be later associated according to a predefined semantic structure. Hence, a pattern representing a given activity can be defined by a set of actions, plus contextual information or other kind of information that could be relevant to describe the activity. Experimental results with real data are provided showing the validity of the proposed approach.
|
|
|
Angel Sappa, Niki Aifanti, Sotiris Malassiotis and Michael G. Strintzis. 2003. Monocular 3D Human Body Reconstruction Towards Depth Augmentation of Television Sequences. IEEE International Conference on Image Processing, Barcelona, Spain, September 2003.325–328.
|
|
|
Alex Goldhoorn, Arnau Ramisa, Ramon Lopez de Mantaras and Ricardo Toledo. 2007. Using the Average Landmark Vector Method for Robot Homing. Artificial Intelligence Research and Development, Proceedings of the 10th International Conference of the ACIA.331–338.
|
|