Publicacions CVC -- Query Results

Marc Bolaños, Mariella Dimiccoli, & Petia Radeva. (2017). Towards Storytelling from Visual Lifelogging: An Overview. THMS - IEEE Transactions on Human-Machine Systems, 47(1), 77–90. Abstract: Visual lifelogging consists of acquiring images that capture the daily experiences of the user by wearing a camera over a long period of time. The pictures taken offer considerable potential for knowledge mining concerning how people live their lives, hence, they open up new opportunities for many potential applications in fields including healthcare, security, leisure and the quantified self. However, automatically building a story from a huge collection of unstructured egocentric data presents major challenges. This paper provides a thorough review of advances made so far in egocentric data analysis, and in view of the current state of the art, indicates new lines of research to move us towards storytelling from visual lifelogging. http://refbase.cvc.uab.es/show.php?record=2712
Partha Pratim Roy, Eduard Vazquez, Josep Llados, Ramon Baldrich, & Umapada Pal. (2007). A System to Retrieve Text/Symbols from Color Maps using Connected Component and Skeleton Analysis. In J.M. Ogier W. L. J. Llados (Ed.), Seventh IAPR International Workshop on Graphics Recognition (79–78). http://refbase.cvc.uab.es/show.php?record=836
Ognjen Rudovic, & Xavier Roca. (2008). Building Temporale Templates for Human Behaviour Classification. In First International Workshop on Tracking Humans for the Evaluation of their Motion in Image Sequences BMVC 2008, (79–88). http://refbase.cvc.uab.es/show.php?record=1009
Jaume Garcia, Debora Gil, Sandra Pujades, & Francesc Carreras. (2008). Valoracion de la Funcion del Ventriculo Izquierdo mediante Modelos Regionales Hiperparametricos. Revista Española de Cardiologia, 61(3), 79. Abstract: La mayoría de la enfermedades cardiovasculares afectan a las propiedades contráctiles de la banda ventricular helicoidal. Esto se refleja en una variación del comportamiento normal de la función ventricular. Parámetros locales tales como los strains, o la deformación experimentada por el tejido, son indicadores capaces de detectar anomalías funcionales en territorios específicos. A menudo, dichos parámetros son considerados de forma separada. En este trabajo presentamos un marco computacional (el Dominio Paramétrico Normalizado, DPN) que permite integrarlos en hiperparámetros funcionales y estudiar sus rangos de normalidad. Dichos rangos permiten valorar de forma objetiva la función regional de cualquier nuevo paciente. Para ello, consideramos secuencias de resonancia magnética etiquetada a nivel basal, medio y apical. Los hiperparámetros se obtienen a partir del movimiento intramural del VI estimado mediante el método Harmonic Phase Flow. El DPN se define a partir de en una parametrización del Ventrículo Izquierdo (VI) en sus coordenadas radiales y circunferencial basada en criterios anatómicos. El paso de los hiperparámetros al DPN hace posible la comparación entre distintos pacientes. Los rangos de normalidad se definen mediante análisis estadístico de valores de voluntarios sanos en 45 regiones del DPN a lo largo de 9 fases sistólicas. Se ha usado un conjunto de 19 (14 H; E: 30.7±7.5) voluntarios sanos para crear los patrones de normalidad y se han validado usando 2 controles sanos y 3 pacientes afectados de contractilidad global reducida. Para los controles los resultados regionales se han ajustado dentro de la normalidad, mientras que para los pacientes se han obtenido valores anormales en las zonas descritas, localizando y cuantificando así el diagnóstico empírico. http://refbase.cvc.uab.es/show.php?record=1032
Enric Marti, Jaume Rocarias, Debora Gil, Marc Vivet, & Carme Julia. (2008). Uso de recursos virtuales en Aprendizaje Basado en Proyectos. Una experiencia en la asignatura de Graficos por Computador. Abstract: En esta comunicación presentamos una experiencia en Aprendizaje Basado en Proyectos (Project Based Learning – PBL) realizada los últimos cuatro años (cursos del 2004-05 al 2007-08) en Gráficos por Computador 2, asignatura optativa de tercer curso de Ingeniería Informática, titulación impartida en la Escuela Técnica Superior de Ingeniería (ETSE) de la Universidad Autónoma de Barcelona (UAB). Fruto de la constante voluntad de mejora de la organización ABP de nuestra asignatura nos decidimos a utilizar una herramienta LMS (Learning Management System) basada en Moodle y adaptada por nosotros llamada Caronte para poder gestionar la documentación generada en ABP, y añadir una componente semipresencial a la asignatura. En primer lugar se presenta la organización de nuestra asignatura, basada proponer al alumno dos itinerarios para cursarla: el itinerario ABP y el itinerario basado en clases magistrales i examen que llamaremos TPPE (Teoría, Problemas, Prácticas, Examen). La dinámica ABP nos genera una cantidad importante de documentación entre los grupos y el profesor, aparte de el feedback que el profesor genera a los alumnos. En la segunda parte del artículo presentamos los espacios docentes electrónicos de ambos itinerarios, con los que trabajan los alumnos. Finalmente, mostramos los resultados obtenidos de alumnos matriculados y de encuestas de valoración realizados por los alumnos para finalmente exponer las conclusiones de estos cuatro años de experiencia en ABP y en el uso de recursos virtuales en ABP, así como plantear mejoras y temas de discusión sobre ABP. http://refbase.cvc.uab.es/show.php?record=1101
David Roche, Debora Gil, & Jesus Giraldo. (2012). Assessing agonist efficacy in an uncertain Em world. In A. Christopoulus and M. Bouvier (Ed.), 40th Keystone Symposia on mollecular and celular biology (79). Keystone Symposia. Abstract: The operational model of agonism has been widely used for the analysis of agonist action since its formulation in 1983. The model includes the Em parameter, which is defined as the maximum response of the system. The methods for Em estimation provide Em values not significantly higher than the maximum responses achieved by full agonists. However, it has been found that that some classes of compounds as, for instance, superagonists and positive allosteric modulators can increase the full agonist maximum response, implying upper limits for Em and thereby posing doubts on the validity of Em estimates. Because of the correlation between Em and operational efficacy, τ, wrong Em estimates will yield wrong τ estimates. In this presentation, the operational model of agonism and various methods for the simulation of allosteric modulation will be analyzed. Alternatives for curve fitting will be presented and discussed. http://refbase.cvc.uab.es/show.php?record=1855
T.O. Nguyen, Salvatore Tabbone, Oriol Ramos Terrades, & A.T. Thierry. (2008). Proposition d'un descripteur de formes et du modèle vectoriel pour la recherche de symboles. In Colloque International Francophone sur l'Ecrit et le Document (pp. 79–84). http://refbase.cvc.uab.es/show.php?record=1875
Lluis Pere de las Heras, Joan Mas, Gemma Sanchez, & Ernest Valveny. (2013). Notation-invariant patch-based wall detector in architectural floor plans. In Graphics Recognition. New Trends and Challenges (Vol. 7423, pp. 79–88). LNCS. Springer Berlin Heidelberg. Abstract: Architectural floor plans exhibit a large variability in notation. Therefore, segmenting and identifying the elements of any kind of plan becomes a challenging task for approaches based on grouping structural primitives obtained by vectorization. Recently, a patch-based segmentation method working at pixel level and relying on the construction of a visual vocabulary has been proposed in [1], showing its adaptability to different notations by automatically learning the visual appearance of the elements in each different notation. This paper presents an evolution of that previous work, after analyzing and testing several alternatives for each of the different steps of the method: Firstly, an automatic plan-size normalization process is done. Secondly we evaluate different features to obtain the description of every patch. Thirdly, we train an SVM classifier to obtain the category of every patch instead of constructing a visual vocabulary. These variations of the method have been tested for wall detection on two datasets of architectural floor plans with different notations. After studying in deep each of the steps in the process pipeline, we are able to find the best system configuration, which highly outperforms the results on wall segmentation obtained by the original paper. http://refbase.cvc.uab.es/show.php?record=2322
Jorge Charco, Angel Sappa, Boris X. Vintimilla, & Henry Velesaca. (2022). Human Body Pose Estimation in Multi-view Environments. In ICT Applications for Smart Cities. Intelligent Systems Reference Library (Vol. 224, pp. 79–99). ISRL. Springer. Abstract: This chapter tackles the challenging problem of human pose estimation in multi-view environments to handle scenes with self-occlusions. The proposed approach starts by first estimating the camera pose—extrinsic parameters—in multi-view scenarios; due to few real image datasets, different virtual scenes are generated by using a special simulator, for training and testing the proposed convolutional neural network based approaches. Then, these extrinsic parameters are used to establish the relation between different cameras into the multi-view scheme, which captures the pose of the person from different points of view at the same time. The proposed multi-view scheme allows to robustly estimate human body joints’ position even in situations where they are occluded. This would help to avoid possible false alarms in behavioral analysis systems of smart cities, as well as applications for physical therapy, safe moving assistance for the elderly among other. The chapter concludes by presenting experimental results in real scenes by using state-of-the-art and the proposed multi-view approaches. http://refbase.cvc.uab.es/show.php?record=3810
Asma Bensalah, Pau Riba, Alicia Fornes, & Josep Llados. (2019). Shoot less and Sketch more: An Efficient Sketch Classification via Joining Graph Neural Networks and Few-shot Learning. In 13th IAPR International Workshop on Graphics Recognition (pp. 80–85). Abstract: With the emergence of the touchpad devices and drawing tablets, a new era of sketching started afresh. However, the recognition of sketches is still a tough task due to the variability of the drawing styles. Moreover, in some application scenarios there is few labelled data available for training, which imposes a limitation for deep learning architectures. In addition, in many cases there is a need to generate models able to adapt to new classes. In order to cope with these limitations, we propose a method based on few-shot learning and graph neural networks for classifying sketches aiming for an efficient neural model. We test our approach with several databases of sketches, showing promising results. Keywords: Sketch classification; Convolutional Neural Network; Graph Neural Network; Few-shot learning http://refbase.cvc.uab.es/show.php?record=3354
Jaume Amores. (2013). Multiple Instance Classification: review, taxonomy and comparative study. AI - Artificial Intelligence, 201, 81–105. Abstract: Multiple Instance Learning (MIL) has become an important topic in the pattern recognition community, and many solutions to this problemhave been proposed until now. Despite this fact, there is a lack of comparative studies that shed light into the characteristics and behavior of the different methods. In this work we provide such an analysis focused on the classification task (i.e.,leaving out other learning tasks such as regression). In order to perform our study, we implemented fourteen methods grouped into three different families. We analyze the performance of the approaches across a variety of well-known databases, and we also study their behavior in synthetic scenarios in order to highlight their characteristics. As a result of this analysis, we conclude that methods that extract global bag-level information show a clearly superior performance in general. In this sense, the analysis permits us to understand why some types of methods are more successful than others, and it permits us to establish guidelines in the design of new MIL methods. Keywords: Multi-instance learning; Codebook; Bag-of-Words http://refbase.cvc.uab.es/show.php?record=2273
Hamdi Dibeklioglu, M.O. Hortas, I. Kosunen, P. Zuzánek, Albert Ali Salah, & Theo Gevers. (2011). Design and implementation of an affect-responsive interactive photo frame. JMUI - Journal on Multimodal User Interfaces, 81–95. Abstract: This paper describes an affect-responsive interactive photo-frame application that offers its user a different experience with every use. It relies on visual analysis of activity levels and facial expressions of its users to select responses from a database of short video segments. This ever-growing database is automatically prepared by an offline analysis of user-uploaded videos. The resulting system matches its user’s affect along dimensions of valence and arousal, and gradually adapts its response to each specific user. In an extended mode, two such systems are coupled and feed each other with visual content. The strengths and weaknesses of the system are assessed through a usability study, where a Wizard-of-Oz response logic is contrasted with the fully automatic system that uses affective and activity-based features, either alone, or in tandem. http://refbase.cvc.uab.es/show.php?record=1842
Arnau Baro, Pau Riba, Jorge Calvo-Zaragoza, & Alicia Fornes. (2018). Optical Music Recognition by Long Short-Term Memory Networks. In B. L. A. Fornes (Ed.), Graphics Recognition. Current Trends and Evolutions (Vol. 11009, pp. 81–95). LNCS. Springer. Abstract: Optical Music Recognition refers to the task of transcribing the image of a music score into a machine-readable format. Many music scores are written in a single staff, and therefore, they could be treated as a sequence. Therefore, this work explores the use of Long Short-Term Memory (LSTM) Recurrent Neural Networks for reading the music score sequentially, where the LSTM helps in keeping the context. For training, we have used a synthetic dataset of more than 40000 images, labeled at primitive level. The experimental results are promising, showing the benefits of our approach. Keywords: Optical Music Recognition; Recurrent Neural Network; Long ShortTerm Memory http://refbase.cvc.uab.es/show.php?record=3227
Ernest Valveny, Salvatore Tabbone, Oriol Ramos Terrades, & Emilie Jean-Marie Odile. (2007). Performance Characterization of Shape Descriptors for Symbol Representation. In Seventh IAPR International Workshop on Graphics Recognition (82–83). http://refbase.cvc.uab.es/show.php?record=889
Jose Antonio Rodriguez, & Florent Perronnin. (2008). Score Normalization for Hmm-based Word Spotting Using Universal Background Model. In International Conference on Frontiers in Handwriting Recognition (82–87). http://refbase.cvc.uab.es/show.php?record=1067

Marc Bolaños, Mariella Dimiccoli, & Petia Radeva. (2017). Towards Storytelling from Visual Lifelogging: An Overview. THMS - IEEE Transactions on Human-Machine Systems, 47(1), 77–90.

Partha Pratim Roy, Eduard Vazquez, Josep Llados, Ramon Baldrich, & Umapada Pal. (2007). A System to Retrieve Text/Symbols from Color Maps using Connected Component and Skeleton Analysis. In J.M. Ogier W. L. J. Llados (Ed.), Seventh IAPR International Workshop on Graphics Recognition (79–78).

Ognjen Rudovic, & Xavier Roca. (2008). Building Temporale Templates for Human Behaviour Classification. In First International Workshop on Tracking Humans for the Evaluation of their Motion in Image Sequences BMVC 2008, (79–88).

Jaume Garcia, Debora Gil, Sandra Pujades, & Francesc Carreras. (2008). Valoracion de la Funcion del Ventriculo Izquierdo mediante Modelos Regionales Hiperparametricos. Revista Española de Cardiologia, 61(3), 79.

Enric Marti, Jaume Rocarias, Debora Gil, Marc Vivet, & Carme Julia. (2008). Uso de recursos virtuales en Aprendizaje Basado en Proyectos. Una experiencia en la asignatura de Graficos por Computador.

David Roche, Debora Gil, & Jesus Giraldo. (2012). Assessing agonist efficacy in an uncertain Em world. In A. Christopoulus and M. Bouvier (Ed.), 40th Keystone Symposia on mollecular and celular biology (79). Keystone Symposia.

T.O. Nguyen, Salvatore Tabbone, Oriol Ramos Terrades, & A.T. Thierry. (2008). Proposition d'un descripteur de formes et du modèle vectoriel pour la recherche de symboles. In Colloque International Francophone sur l'Ecrit et le Document (pp. 79–84).

Lluis Pere de las Heras, Joan Mas, Gemma Sanchez, & Ernest Valveny. (2013). Notation-invariant patch-based wall detector in architectural floor plans. In Graphics Recognition. New Trends and Challenges (Vol. 7423, pp. 79–88). LNCS. Springer Berlin Heidelberg.

Jorge Charco, Angel Sappa, Boris X. Vintimilla, & Henry Velesaca. (2022). Human Body Pose Estimation in Multi-view Environments. In ICT Applications for Smart Cities. Intelligent Systems Reference Library (Vol. 224, pp. 79–99). ISRL. Springer.

Asma Bensalah, Pau Riba, Alicia Fornes, & Josep Llados. (2019). Shoot less and Sketch more: An Efficient Sketch Classification via Joining Graph Neural Networks and Few-shot Learning. In 13th IAPR International Workshop on Graphics Recognition (pp. 80–85).

Jaume Amores. (2013). Multiple Instance Classification: review, taxonomy and comparative study. AI - Artificial Intelligence, 201, 81–105.

Hamdi Dibeklioglu, M.O. Hortas, I. Kosunen, P. Zuzánek, Albert Ali Salah, & Theo Gevers. (2011). Design and implementation of an affect-responsive interactive photo frame. JMUI - Journal on Multimodal User Interfaces, 81–95.

Arnau Baro, Pau Riba, Jorge Calvo-Zaragoza, & Alicia Fornes. (2018). Optical Music Recognition by Long Short-Term Memory Networks. In B. L. A. Fornes (Ed.), Graphics Recognition. Current Trends and Evolutions (Vol. 11009, pp. 81–95). LNCS. Springer.

Ernest Valveny, Salvatore Tabbone, Oriol Ramos Terrades, & Emilie Jean-Marie Odile. (2007). Performance Characterization of Shape Descriptors for Symbol Representation. In Seventh IAPR International Workshop on Graphics Recognition (82–83).

Jose Antonio Rodriguez, & Florent Perronnin. (2008). Score Normalization for Hmm-based Word Spotting Using Universal Background Model. In International Conference on Frontiers in Handwriting Recognition (82–87).