Records |
Author |
Yainuvis Socarras |
Title |
Image segmentation for improving pedestrian detection |
Type |
Report |
Year |
2011 |
Publication |
CVC Technical Report |
Abbreviated Journal |
|
Volume |
167 |
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
Bellaterra (Spain) |
Corporate Author |
Computer Vision Center |
Thesis |
Master's thesis |
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ADAS; |
Approved |
no |
Call Number |
Admin @ si @ Soc2011 |
Serial |
1933 |
Permanent link to this record |
|
|
|
Author |
Maria del Camp Davesa |
Title |
Human action categorization in image sequences |
Type |
Report |
Year |
2011 |
Publication |
CVC Technical Report |
Abbreviated Journal |
|
Volume |
169 |
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
Bellaterra (Spain) |
Corporate Author |
Computer Vision Center |
Thesis |
Master's thesis |
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
CiC;CIC |
Approved |
no |
Call Number |
Admin @ si @ Dav2011 |
Serial |
1934 |
Permanent link to this record |
|
|
|
Author |
Marçal Rusiñol; R.Roset; Josep Llados; C.Montaner |
Title |
Automatic Index Generation of Digitized Map Series by Coordinate Extraction and Interpretation |
Type |
Conference Article |
Year |
2011 |
Publication |
In Proceedings of the Sixth International Workshop on Digital Technologies in Cartographic Heritage |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
CartoHerit |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ RRL2011b |
Serial |
1978 |
Permanent link to this record |
|
|
|
Author |
Sergio Vera; Debora Gil; Agnes Borras; F. Javier Sanchez; Frederic Perez; Marius G. Linguraru |
Title |
Computation and Evaluation of Medial Surfaces for Shape Representation of Abdominal Organs |
Type |
Conference Article |
Year |
2011 |
Publication |
Workshop on Computational and Clinical Applications in Abdominal Imaging |
Abbreviated Journal |
|
Volume |
7029 |
Issue |
|
Pages |
223-230 |
Keywords |
|
Abstract |
Medial representations are powerful tools for describing and parameterizing the volumetric shape of anatomical structures. Existing methods show excellent results when applied to 2D objects, but their quality drops across dimensions. This paper contributes to the computation of medial manifolds in two aspects. First, we provide a standard scheme for the computation of medial manifolds that avoid degenerated medial axis segments; second, we introduce an energy based method which performs independently of the dimension. We evaluate quantitatively the performance of our method with respect to existing approaches, by applying them to synthetic shapes of known medial geometry. Finally, we show results on shape representation of multiple abdominal organs, exploring the use of medial manifolds for the representation of multi-organ relations. |
Address |
Nice, France |
Corporate Author |
|
Thesis |
|
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
|
Editor |
In H. Yoshida et al |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
ABDI |
Notes |
IAM; MV |
Approved |
no |
Call Number |
VGB2011 |
Serial |
2036 |
Permanent link to this record |
|
|
|
Author |
Jaime Moreno; Xavier Otazu |
Title |
Image coder based on Hilbert scanning of embedded quadTrees |
Type |
Conference Article |
Year |
2011 |
Publication |
Data Compression Conference |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
470-470 |
Keywords |
|
Abstract |
In this work we present an effective and computationally simple algorithm for image compression based on Hilbert Scanning of Embedded quadTrees (Hi-SET). It allows to represent an image as an embedded bitstream along a fractal function. Embedding is an important feature of modern image compression algorithms, in this way Salomon in [1, pg. 614] cite that another feature and perhaps a unique one is the fact of achieving the best quality for the number of bits input by the decoder at any point during the decoding. Hi-SET possesses also this latter feature. Furthermore, the coder is based on a quadtree partition strategy, that applied to image transformation structures such as discrete cosine or wavelet transform allows to obtain an energy clustering both in frequency and space. The coding algorithm is composed of three general steps, using just a list of significant pixels. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
DCC |
Notes |
CIC |
Approved |
no |
Call Number |
Admin @ si @ MoO2011b |
Serial |
2177 |
Permanent link to this record |
|
|
|
Author |
Mirko Arnold; Stephan Ameling; Anarta Ghosh; Gerard Lacey |
Title |
Quality Improvement of Endoscopy Videos |
Type |
Conference Article |
Year |
2011 |
Publication |
Proceedings of the 8th IASTED International Conference on Biomedical Engineering |
Abbreviated Journal |
|
Volume |
723 |
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
800 |
Expedition |
|
Conference |
|
Notes |
MV |
Approved |
no |
Call Number |
fernando @ fernando @ |
Serial |
2426 |
Permanent link to this record |
|
|
|
Author |
Victor Ponce; Mario Gorga; Xavier Baro; Petia Radeva; Sergio Escalera |
Title |
Análisis de la expresión oral y gestual en proyectos fin de carrera vía un sistema de visión artificial |
Type |
Journal Article |
Year |
2011 |
Publication |
ReVisión |
Abbreviated Journal |
|
Volume |
4 |
Issue |
1 |
Pages |
|
Keywords |
|
Abstract |
La comunicación y expresión oral es una competencia de especial relevancia en el EEES. No obstante, en muchas enseñanzas superiores la puesta en práctica de esta competencia ha sido relegada principalmente a la presentación de proyectos fin de carrera. Dentro de un proyecto de innovación docente, se ha desarrollado una herramienta informática para la extracción de información objetiva para el análisis de la expresión oral y gestual de los alumnos. El objetivo es dar un “feedback” a los estudiantes que les permita mejorar la calidad de sus presentaciones. El prototipo inicial que se presenta en este trabajo permite extraer de forma automática información audiovisual y analizarla mediante técnicas de aprendizaje. El sistema ha sido aplicado a 15 proyectos fin de carrera y 15 exposiciones dentro de una asignatura de cuarto curso. Los resultados obtenidos muestran la viabilidad del sistema para sugerir factores que ayuden tanto en el éxito de la comunicación así como en los criterios de evaluación. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
1989-1199 |
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
HuPBA; MILAB;MV |
Approved |
no |
Call Number |
Admin @ si @ PGB2011d |
Serial |
2514 |
Permanent link to this record |
|
|
|
Author |
Jon Almazan; Alicia Fornes; Ernest Valveny |
Title |
A Non-Rigid Feature Extraction Method for Shape Recognition |
Type |
Conference Article |
Year |
2011 |
Publication |
11th International Conference on Document Analysis and Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
987-991 |
Keywords |
|
Abstract |
This paper presents a methodology for shape recognition that focuses on dealing with the difficult problem of large deformations. The proposed methodology consists in a novel feature extraction technique, which uses a non-rigid representation adaptable to the shape. This technique employs a deformable grid based on the computation of geometrical centroids that follows a region partitioning algorithm. Then, a feature vector is extracted by computing pixel density measures around these geometrical centroids. The result is a shape descriptor that adapts its representation to the given shape and encodes the pixel density distribution. The validity of the method when dealing with large deformations has been experimentally shown over datasets composed of handwritten shapes. It has been applied to signature verification and shape recognition tasks demonstrating high accuracy and low computational cost. |
Address |
Beijing; China; September 2011 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-0-7695-4520-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICDAR |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ AFV2011 |
Serial |
1763 |
Permanent link to this record |
|
|
|
Author |
Lluis Pere de las Heras; Joan Mas; Gemma Sanchez; Ernest Valveny |
Title |
Wall Patch-Based Segmentation in Architectural Floorplans |
Type |
Conference Article |
Year |
2011 |
Publication |
11th International Conference on Document Analysis and Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
1270-1274 |
Keywords |
|
Abstract |
Segmentation of architectural floor plans is a challenging task, mainly because of the large variability in the notation between different plans. In general, traditional techniques, usually based on analyzing and grouping structural primitives obtained by vectorization, are only able to handle a reduced range of similar notations. In this paper we propose an alternative patch-based segmentation approach working at pixel level, without need of vectorization. The image is divided into a set of patches and a set of features is extracted for every patch. Then, each patch is assigned to a visual word of a previously learned vocabulary and given a probability of belonging to each class of objects. Finally, a post-process assigns the final label for every pixel. This approach has been applied to the detection of walls on two datasets of architectural floor plans with different notations, achieving high accuracy rates. |
Address |
Beiging, China |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
1520-5363 |
ISBN |
978-0-7695-4520-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICDAR |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ HMS2011a |
Serial |
1792 |
Permanent link to this record |
|
|
|
Author |
Alicia Fornes; Anjan Dutta; Albert Gordo; Josep Llados |
Title |
The ICDAR 2011 Music Scores Competition: Staff Removal and Writer Identification |
Type |
Conference Article |
Year |
2011 |
Publication |
11th International Conference on Document Analysis and Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
1511-1515 |
Keywords |
|
Abstract |
In the last years, there has been a growing interest in the analysis of handwritten music scores. In this sense, our goal has been to foster the interest in the analysis of handwritten music scores by the proposal of two different competitions: Staff removal and Writer Identification. Both competitions have been tested on the CVC-MUSCIMA database: a ground-truth of handwritten music score images. This paper describes the competition details, including the dataset and ground-truth, the evaluation metrics, and a short description of the participants, their methods, and the obtained results. |
Address |
Beijing, China |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-0-7695-4520-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICDAR |
Notes |
DAG |
Approved |
no |
Call Number |
Admin @ si @ FDG2011b |
Serial |
1794 |
Permanent link to this record |
|
|
|
Author |
Nataliya Shapovalova; Carles Fernandez; Xavier Roca; Jordi Gonzalez |
Title |
Semantics of Human Behavior in Image Sequences |
Type |
Book Chapter |
Year |
2011 |
Publication |
Computer Analysis of Human Behavior |
Abbreviated Journal |
|
Volume |
|
Issue |
7 |
Pages |
151-182 |
Keywords |
|
Abstract |
Human behavior is contextualized and understanding the scene of an action is crucial for giving proper semantics to behavior. In this chapter we present a novel approach for scene understanding. The emphasis of this work is on the particular case of Human Event Understanding. We introduce a new taxonomy to organize the different semantic levels of the Human Event Understanding framework proposed. Such a framework particularly contributes to the scene understanding domain by (i) extracting behavioral patterns from the integrative analysis of spatial, temporal, and contextual evidence and (ii) integrative analysis of bottom-up and top-down approaches in Human Event Understanding. We will explore how the information about interactions between humans and their environment influences the performance of activity recognition, and how this can be extrapolated to the temporal domain in order to extract higher inferences from human events observed in sequences of images. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
Albert Ali Salah; |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-0-85729-993-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ISE |
Approved |
no |
Call Number |
Admin @ si @ SFR2011 |
Serial |
1810 |
Permanent link to this record |
|
|
|
Author |
Murad Al Haj; Carles Fernandez; Zhanwu Xiong; Ivan Huerta; Jordi Gonzalez; Xavier Roca |
Title |
Beyond the Static Camera: Issues and Trends in Active Vision |
Type |
Book Chapter |
Year |
2011 |
Publication |
Visual Analysis of Humans: Looking at People |
Abbreviated Journal |
|
Volume |
|
Issue |
2 |
Pages |
11-30 |
Keywords |
|
Abstract |
Maximizing both the area coverage and the resolution per target is highly desirable in many applications of computer vision. However, with a limited number of cameras viewing a scene, the two objectives are contradictory. This chapter is dedicated to active vision systems, trying to achieve a trade-off between these two aims and examining the use of high-level reasoning in such scenarios. The chapter starts by introducing different approaches to active cameras configurations. Later, a single active camera system to track a moving object is developed, offering the reader first-hand understanding of the issues involved. Another section discusses practical considerations in building an active vision platform, taking as an example a multi-camera system developed for a European project. The last section of the chapter reflects upon the future trends of using semantic factors to drive smartly coordinated active systems. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
Th.B. Moeslund; A. Hilton; V. Krüger; L. Sigal |
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-0-85729-996-3 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ISE |
Approved |
no |
Call Number |
Admin @ si @ AFX2011 |
Serial |
1814 |
Permanent link to this record |
|
|
|
Author |
Mario Rojas; David Masip; Jordi Vitria |
Title |
Predicting Dominance Judgements Automatically: A Machine Learning Approach. |
Type |
Conference Article |
Year |
2011 |
Publication |
IEEE International Workshop on Social Behavior Analysis |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
939-944 |
Keywords |
|
Abstract |
The amount of multimodal devices that surround us is growing everyday. In this context, human interaction and communication have become a focus of attention and a hot topic of research. A crucial element in human relations is the evaluation of individuals with respect to facial traits, what is called a first impression. Studies based on appearance have suggested that personality can be expressed by appearance and the observer may use such information to form judgments. In the context of rapid facial evaluation, certain personality traits seem to have a more pronounced effect on the relations and perceptions inside groups. The perception of dominance has been shown to be an active part of social roles at different stages of life, and even play a part in mate selection. The aim of this paper is to study to what extent this information is learnable from the point of view of computer science. Specifically we intend to determine if judgments of dominance can be learned by machine learning techniques. We implement two different descriptors in order to assess this. The first is the histogram of oriented gradients (HOG), and the second is a probabilistic appearance descriptor based on the frequencies of grouped binary tests. State of the art classification rules validate the performance of both descriptors, with respect to the prediction task. Experimental results show that machine learning techniques can predict judgments of dominance rather accurately (accuracies up to 90%) and that the HOG descriptor may characterize appropriately the information necessary for such task. |
Address |
Santa Barbara, CA |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-1-4244-9140-7 |
Medium |
|
Area |
|
Expedition |
|
Conference |
SBA |
Notes |
OR;MV |
Approved |
no |
Call Number |
Admin @ si @ RMV2011b |
Serial |
1760 |
Permanent link to this record |
|
|
|
Author |
Sergio Escalera; Xavier Baro; Oriol Pujol; Jordi Vitria; Petia Radeva |
Title |
Traffic-Sign Recognition Systems |
Type |
Book Whole |
Year |
2011 |
Publication |
SpringerBriefs in Computer Science |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
5-13 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer London |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-1-4471-2244-9 |
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
MILAB; OR;HuPBA;MV |
Approved |
no |
Call Number |
Admin @ si @ EBP2011 |
Serial |
1801 |
Permanent link to this record |
|
|
|
Author |
David Vazquez; Antonio Lopez; Daniel Ponsa; Javier Marin |
Title |
Virtual Worlds and Active Learning for Human Detection |
Type |
Conference Article |
Year |
2011 |
Publication |
13th International Conference on Multimodal Interaction |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
393-400 |
Keywords |
Pedestrian Detection; Human detection; Virtual; Domain Adaptation; Active Learning |
Abstract |
Image based human detection is of paramount interest due to its potential applications in fields such as advanced driving assistance, surveillance and media analysis. However, even detecting non-occluded standing humans remains a challenge of intensive research. The most promising human detectors rely on classifiers developed in the discriminative paradigm, i.e., trained with labelled samples. However, labeling is a manual intensive step, especially in cases like human detection where it is necessary to provide at least bounding boxes framing the humans for training. To overcome such problem, some authors have proposed the use of a virtual world where the labels of the different objects are obtained automatically. This means that the human models (classifiers) are learnt using the appearance of rendered images, i.e., using realistic computer graphics. Later, these models are used for human detection in images of the real world. The results of this technique are surprisingly good. However, these are not always as good as the classical approach of training and testing with data coming from the same camera, or similar ones. Accordingly, in this paper we address the challenge of using a virtual world for gathering (while playing a videogame) a large amount of automatically labelled samples (virtual humans and background) and then training a classifier that performs equal, in real-world images, than the one obtained by equally training from manually labelled real-world samples. For doing that, we cast the problem as one of domain adaptation. In doing so, we assume that a small amount of manually labelled samples from real-world images is required. To collect these labelled samples we propose a non-standard active learning technique. Therefore, ultimately our human model is learnt by the combination of virtual and real world labelled samples (Fig. 1), which has not been done before. We present quantitative results showing that this approach is valid. |
Address |
Alicante, Spain |
Corporate Author |
|
Thesis |
|
Publisher |
ACM DL |
Place of Publication |
New York, NY, USA, USA |
Editor |
|
Language |
English |
Summary Language |
English |
Original Title |
Virtual Worlds and Active Learning for Human Detection |
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
978-1-4503-0641-6 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICMI |
Notes |
ADAS |
Approved |
yes |
Call Number |
ADAS @ adas @ VLP2011a |
Serial |
1683 |
Permanent link to this record |