Publicacions CVC -- Query Results

Joan Mas, Gemma Sanchez, & Josep Llados. (2006). An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars. http://refbase.cvc.uab.es/show.php?record=711
Oriol Ramos Terrades. (2006). Linear Combination of Multiresolution Descriptors: Application to Graphics Recognition (Salvatore Antoine Tabbone, & Ernest Valveny, Eds.). Ph.D. thesis, , . http://refbase.cvc.uab.es/show.php?record=713
Fernando Vilariño. (2006). A Machine Learning Approach for Intestinal Motility Assessment with Capsule Endoscopy (Petia Radeva, Ed.). Ph.D. thesis, , . Abstract: Intestinal motility assessment with video capsule endoscopy arises as a novel and challenging clinical fieldwork. This technique is based on the analysis of the patterns of intestinal contractions obtained by labelling all the motility events present in a video provided by a capsule with a wireless micro-camera, which is ingested by the patient. However, the visual analysis of these video sequences presents several im- portant drawbacks, mainly related to both the large amount of time needed for the visualization process, and the low prevalence of intestinal contractions in video. In this work we propose a machine learning system to automatically detect the intestinal contractions in video capsule endoscopy, driving a very useful but not fea- sible clinical routine into a feasible clinical procedure. Our proposal is divided into two different parts: The first part tackles the problem of the automatic detection of phasic contractions in capsule endoscopy videos. Phasic contractions are dynamic events spanning about 4-5 seconds, which show visual patterns with a high variability. Our proposal is based on a sequential design which involves the analysis of textural, color and blob features with powerful classifiers such as SVM. This approach appears to cope with two basic aims: the reduction of the imbalance rate of the data set, and the modular construction of the system, which adds the capability of including domain knowledge as new stages in the cascade. The second part of the current work tackles the problem of the automatic detection of tonic contractions. Tonic contrac- tions manifest in capsule endoscopy as a sustained pattern of the folds and wrinkles of the intestine, which may be prolonged for an undetermined span of time. Our proposal is based on the analysis of the wrinkle patterns, presenting a comparative study of diverse features and classification methods, and providing a set of appro- priate descriptors for their characterization. We provide a detailed analysis of the performance achieved by our system both in a qualitative and a quantitative way. http://refbase.cvc.uab.es/show.php?record=738
Josep Llados. (2006). Computer Vision: Progress of Research and Development ( J. Llados(ed.), Ed.). http://refbase.cvc.uab.es/show.php?record=766
F. Pla, Petia Radeva, & Jordi Vitria. (2006). Pattern Recognition: Progress, Directions and Applications. http://refbase.cvc.uab.es/show.php?record=771
Josep Llados, W. Liu, & Jean-Marc Ogier. (2007). Seventh IAPR International Workshop on Graphics Recognition GREC 2007. http://refbase.cvc.uab.es/show.php?record=835
Jordi Gonzalez, & Thomas B. Moeslund. (2008). Tracking Humans for the Evaluation of their Motion in Image Sequences. http://refbase.cvc.uab.es/show.php?record=1002
Juan J. Villanueva. (2008). Visualization, Imaging, and Image Processing,. http://refbase.cvc.uab.es/show.php?record=1003
Aymen Azaza. (2018). Context, Motion and Semantic Information for Computational Saliency (Joost Van de Weijer, & Ali Douik, Eds.). Ph.D. thesis, Ediciones Graficas Rey, . Abstract: The main objective of this thesis is to highlight the salient object in an image or in a video sequence. We address three important—but in our opinion insufficiently investigated—aspects of saliency detection. Firstly, we start by extending previous research on saliency which explicitly models the information provided from the context. Then, we show the importance of explicit context modelling for saliency estimation. Several important works in saliency are based on the usage of object proposals. However, these methods focus on the saliency of the object proposal itself and ignore the context. To introduce context in such saliency approaches, we couple every object proposal with its direct context. This allows us to evaluate the importance of the immediate surround (context) for its saliency. We propose several saliency features which are computed from the context proposals including features based on omni-directional and horizontal context continuity. Secondly, we investigate the usage of top-downmethods (high-level semantic information) for the task of saliency prediction since most computational methods are bottom-up or only include few semantic classes. We propose to consider a wider group of object classes. These objects represent important semantic information which we will exploit in our saliency prediction approach. Thirdly, we develop a method to detect video saliency by computing saliency from supervoxels and optical flow. In addition, we apply the context features developed in this thesis for video saliency detection. The method combines shape and motion features with our proposed context features. To summarize, we prove that extending object proposals with their direct context improves the task of saliency detection in both image and video data. Also the importance of the semantic information in saliency estimation is evaluated. Finally, we propose a newmotion feature to detect saliency in video data. The three proposed novelties are evaluated on standard saliency benchmark datasets and are shown to improve with respect to state-of-the-art. http://refbase.cvc.uab.es/show.php?record=3218
Alfons Juan-Ciscar, & Gemma Sanchez. (2008). PRIS 2008. Pattern Recognition in Information Systems. Proceedings of the 8th international Workshop on Pattern Recognition in Information systems – PRIS 2008, in conjunction with ICEIS 2008. http://refbase.cvc.uab.es/show.php?record=1054
Miquel Ferrer. (2008). Theory and Algorithms on the Median Graph. Application to Graph-based Classification and Clustering (Francesc Serratosa Casanelles, & Ernest Valveny, Eds.). Ph.D. thesis, , . http://refbase.cvc.uab.es/show.php?record=1105
Daniel Ponsa. (2007). Model-Based Visual Localisation of Contours and Vehicles (Antonio Lopez, & Xavier Roca, Eds.). Ph.D. thesis, Ediciones Graficas Rey, . Keywords: Phd Thesis http://refbase.cvc.uab.es/show.php?record=1107
Robert Benavente. (2007). A Parametric Model for Computational Colour Naming (Maria Vanrell, Ed.). Ph.D. thesis, Ediciones Graficas Rey, . Keywords: PhD Thesis http://refbase.cvc.uab.es/show.php?record=1108
Robert Benavente, Laura Igual, & Fernando Vilariño. (2008). Current Challenges in Computer Vision. http://refbase.cvc.uab.es/show.php?record=1110
Pau Baiget. (2009). Modeling Human Behavior for Image Sequence Understanding and Generation (Jordi Gonzalez, & Xavier Roca, Eds.). Ph.D. thesis, Ediciones Graficas Rey, . Abstract: The comprehension of animal behavior, especially human behavior, is one of the most ancient and studied problems since the beginning of civilization. The big list of factors that interact to determine a person action require the collaboration of different disciplines, such as psichology, biology, or sociology. In the last years the analysis of human behavior has received great attention also from the computer vision community, given the latest advances in the acquisition of human motion data from image sequences. Despite the increasing availability of that data, there still exists a gap towards obtaining a conceptual representation of the obtained observations. Human behavior analysis is based on a qualitative interpretation of the results, and therefore the assignment of concepts to quantitative data is linked to a certain ambiguity. This Thesis tackles the problem of obtaining a proper representation of human behavior in the contexts of computer vision and animation. On the one hand, a good behavior model should permit the recognition and explanation the observed activity in image sequences. On the other hand, such a model must allow the generation of new synthetic instances, which model the behavior of virtual agents. First, we propose methods to automatically learn the models from observations. Given a set of quantitative results output by a vision system, a normal behavior model is learnt. This results provides a tool to determine the normality or abnormality of future observations. However, machine learning methods are unable to provide a richer description of the observations. We confront this problem by means of a new method that incorporates prior knowledge about the enviornment and about the expected behaviors. This framework, formed by the reasoning engine FMTL and the modeling tool SGT allows the generation of conceptual descriptions of activity in new image sequences. Finally, we demonstrate the suitability of the proposed framework to simulate behavior of virtual agents, which are introduced into real image sequences and interact with observed real agents, thereby easing the generation of augmented reality sequences. The set of approaches presented in this Thesis has a growing set of potential applications. The analysis and description of behavior in image sequences has its principal application in the domain of smart video--surveillance, in order to detect suspicious or dangerous behaviors. Other applications include automatic sport commentaries, elderly monitoring, road traffic analysis, and the development of semantic video search engines. Alternatively, behavioral virtual agents allow to simulate accurate real situations, such as fires or crowds. Moreover, the inclusion of virtual agents into real image sequences has been widely deployed in the games and cinema industries. http://refbase.cvc.uab.es/show.php?record=1210

Joan Mas, Gemma Sanchez, & Josep Llados. (2006). An Incremental Parser to Recognize Diagram Symbols and Gestures represented by Adjacency Grammars.

Oriol Ramos Terrades. (2006). Linear Combination of Multiresolution Descriptors: Application to Graphics Recognition (Salvatore Antoine Tabbone, & Ernest Valveny, Eds.). Ph.D. thesis, , .

Fernando Vilariño. (2006). A Machine Learning Approach for Intestinal Motility Assessment with Capsule Endoscopy (Petia Radeva, Ed.). Ph.D. thesis, , .

Abstract: Intestinal motility assessment with video capsule endoscopy arises as a novel and challenging clinical fieldwork. This technique is based on the analysis of the patterns of intestinal contractions obtained by labelling all the motility events present in a video provided by a capsule with a wireless micro-camera, which is ingested by the patient. However, the visual analysis of these video sequences presents several im- portant drawbacks, mainly related to both the large amount of time needed for the visualization process, and the low prevalence of intestinal contractions in video.
In this work we propose a machine learning system to automatically detect the intestinal contractions in video capsule endoscopy, driving a very useful but not fea- sible clinical routine into a feasible clinical procedure. Our proposal is divided into two different parts: The first part tackles the problem of the automatic detection of phasic contractions in capsule endoscopy videos. Phasic contractions are dynamic events spanning about 4-5 seconds, which show visual patterns with a high variability. Our proposal is based on a sequential design which involves the analysis of textural, color and blob features with powerful classifiers such as SVM. This approach appears to cope with two basic aims: the reduction of the imbalance rate of the data set, and the modular construction of the system, which adds the capability of including domain knowledge as new stages in the cascade. The second part of the current work tackles the problem of the automatic detection of tonic contractions. Tonic contrac- tions manifest in capsule endoscopy as a sustained pattern of the folds and wrinkles of the intestine, which may be prolonged for an undetermined span of time. Our proposal is based on the analysis of the wrinkle patterns, presenting a comparative study of diverse features and classification methods, and providing a set of appro- priate descriptors for their characterization. We provide a detailed analysis of the performance achieved by our system both in a qualitative and a quantitative way.

http://refbase.cvc.uab.es/show.php?record=738

Josep Llados. (2006). Computer Vision: Progress of Research and Development ( J. Llados(ed.), Ed.).

F. Pla, Petia Radeva, & Jordi Vitria. (2006). Pattern Recognition: Progress, Directions and Applications.

Josep Llados, W. Liu, & Jean-Marc Ogier. (2007). Seventh IAPR International Workshop on Graphics Recognition GREC 2007.

Jordi Gonzalez, & Thomas B. Moeslund. (2008). Tracking Humans for the Evaluation of their Motion in Image Sequences.

Juan J. Villanueva. (2008). Visualization, Imaging, and Image Processing,.

Aymen Azaza. (2018). Context, Motion and Semantic Information for Computational Saliency (Joost Van de Weijer, & Ali Douik, Eds.). Ph.D. thesis, Ediciones Graficas Rey, .

Abstract: The main objective of this thesis is to highlight the salient object in an image or in a video sequence. We address three important—but in our opinion
insufficiently investigated—aspects of saliency detection. Firstly, we start
by extending previous research on saliency which explicitly models the information provided from the context. Then, we show the importance of
explicit context modelling for saliency estimation. Several important works
in saliency are based on the usage of object proposals. However, these methods
focus on the saliency of the object proposal itself and ignore the context.
To introduce context in such saliency approaches, we couple every object
proposal with its direct context. This allows us to evaluate the importance
of the immediate surround (context) for its saliency. We propose several
saliency features which are computed from the context proposals including
features based on omni-directional and horizontal context continuity. Secondly,
we investigate the usage of top-downmethods (high-level semantic
information) for the task of saliency prediction since most computational
methods are bottom-up or only include few semantic classes. We propose
to consider a wider group of object classes. These objects represent important
semantic information which we will exploit in our saliency prediction
approach. Thirdly, we develop a method to detect video saliency by computing
saliency from supervoxels and optical flow. In addition, we apply the
context features developed in this thesis for video saliency detection. The
method combines shape and motion features with our proposed context
features. To summarize, we prove that extending object proposals with their
direct context improves the task of saliency detection in both image and
video data. Also the importance of the semantic information in saliency
estimation is evaluated. Finally, we propose a newmotion feature to detect
saliency in video data. The three proposed novelties are evaluated on standard
saliency benchmark datasets and are shown to improve with respect to
state-of-the-art.

http://refbase.cvc.uab.es/show.php?record=3218

Alfons Juan-Ciscar, & Gemma Sanchez. (2008). PRIS 2008. Pattern Recognition in Information Systems. Proceedings of the 8th international Workshop on Pattern Recognition in Information systems – PRIS 2008, in conjunction with ICEIS 2008.

Miquel Ferrer. (2008). Theory and Algorithms on the Median Graph. Application to Graph-based Classification and Clustering (Francesc Serratosa Casanelles, & Ernest Valveny, Eds.). Ph.D. thesis, , .

Daniel Ponsa. (2007). Model-Based Visual Localisation of Contours and Vehicles (Antonio Lopez, & Xavier Roca, Eds.). Ph.D. thesis, Ediciones Graficas Rey, .

Robert Benavente. (2007). A Parametric Model for Computational Colour Naming (Maria Vanrell, Ed.). Ph.D. thesis, Ediciones Graficas Rey, .

Robert Benavente, Laura Igual, & Fernando Vilariño. (2008). Current Challenges in Computer Vision.

Pau Baiget. (2009). Modeling Human Behavior for Image Sequence Understanding and Generation (Jordi Gonzalez, & Xavier Roca, Eds.). Ph.D. thesis, Ediciones Graficas Rey, .

Abstract: The comprehension of animal behavior, especially human behavior, is one of the most ancient and studied problems since the beginning of civilization. The big list of factors that interact to determine a person action require the collaboration of different disciplines, such as psichology, biology, or sociology. In the last years the analysis of human behavior has received great attention also from the computer vision community, given the latest advances in the acquisition of human motion data from image sequences.
Despite the increasing availability of that data, there still exists a gap towards obtaining a conceptual representation of the obtained observations. Human behavior analysis is based on a qualitative interpretation of the results, and therefore the assignment of concepts to quantitative data is linked to a certain ambiguity.
This Thesis tackles the problem of obtaining a proper representation of human behavior in the contexts of computer vision and animation. On the one hand, a good behavior model should permit the recognition and explanation the observed activity in image sequences. On the other hand, such a model must allow the generation of new synthetic instances, which model the behavior of virtual agents.
First, we propose methods to automatically learn the models from observations. Given a set of quantitative results output by a vision system, a normal behavior model is learnt. This results provides a tool to determine the normality or abnormality of future observations. However, machine learning methods are unable to provide a richer description of the observations. We confront this problem by means of a new method that incorporates prior knowledge about the enviornment and about the expected behaviors. This framework, formed by the reasoning engine FMTL and the modeling tool SGT allows the generation of conceptual descriptions of activity in new image sequences. Finally, we demonstrate the suitability of the proposed framework to simulate behavior of virtual agents, which are introduced into real image sequences and interact with observed real agents, thereby easing the generation of augmented reality sequences.
The set of approaches presented in this Thesis has a growing set of potential applications. The analysis and description of behavior in image sequences has its principal application in the domain of smart video--surveillance, in order to detect suspicious or dangerous behaviors. Other applications include automatic sport commentaries, elderly monitoring, road traffic analysis, and the development of semantic video search engines. Alternatively, behavioral virtual agents allow to simulate accurate real situations, such as fires or crowds. Moreover, the inclusion of virtual agents into real image sequences has been widely deployed in the games and cinema industries.

http://refbase.cvc.uab.es/show.php?record=1210