|
N. Serrano, L. Tarazon, D. Perez, Oriol Ramos Terrades, & S. Juan. (2010). The GIDOC Prototype. In 10th International Workshop on Pattern Recognition in Information Systems (pp. 82–89).
Abstract: Transcription of handwritten text in (old) documents is an important, time-consuming task for digital libraries. It might be carried out by first processing all document images off-line, and then manually supervising system transcriptions to edit incorrect parts. However, current techniques for automatic page layout analysis, text line detection and handwriting recognition are still far from perfect, and thus post-editing system output is not clearly better than simply ignoring it.
A more effective approach to transcribe old text documents is to follow an interactive- predictive paradigm in which both, the system is guided by the user, and the user is assisted by the system to complete the transcription task as efficiently as possible. Following this approach, a system prototype called GIDOC (Gimp-based Interactive transcription of old text DOCuments) has been developed to provide user-friendly, integrated support for interactive-predictive layout analysis, line detection and handwriting transcription.
GIDOC is designed to work with (large) collections of homogeneous documents, that is, of similar structure and writing styles. They are annotated sequentially, by (par- tially) supervising hypotheses drawn from statistical models that are constantly updated with an increasing number of available annotated documents. And this is done at different annotation levels. For instance, at the level of page layout analysis, GIDOC uses a novel text block detection method in which conventional, memoryless techniques are improved with a “history” model of text block positions. Similarly, at the level of text line image transcription, GIDOC includes a handwriting recognizer which is steadily improved with a growing number of (partially) supervised transcriptions.
|
|
|
Mohamed Ilyes Lakhal, Hakan Cevikalp, & Sergio Escalera. (2018). CRN: End-to-end Convolutional Recurrent Network Structure Applied to Vehicle Classification. In 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (Vol. 5, pp. 137–144).
Abstract: Vehicle type classification is considered to be a central part of Intelligent Traffic Systems. In the recent years, deep learning methods have emerged in as being the state-of-the-art in many computer vision tasks. In this paper, we present a novel yet simple deep learning framework for the vehicle type classification problem. We propose an end-to-end trainable system, that combines convolution neural network for feature extraction and recurrent neural network as a classifier. The recurrent network structure is used to handle various types of feature inputs, and at the same time allows to produce a single or a set of class predictions. In order to assess the effectiveness of our solution, we have conducted a set of experiments in two public datasets, obtaining state of the art results. In addition, we also report results on the newly released MIO-TCD dataset.
Keywords: Vehicle Classification; Deep Learning; End-to-end Learning
|
|
|
Judit Martinez, Eva Costa, P. Herreros, Antonio Lopez, & Juan J. Villanueva. (2003). TV-Screen Quality Inspection by Artificial Vision. In Proceedings SPIE 5132, Sixth International Conference on Quality Control by Artificial Vision (QCAV 2003).
Abstract: A real-time vision system for TV screen quality inspection is introduced. The whole system consists of eight cameras and one processor per camera. It acquires and processes 112 images in 6 seconds. The defects to be inspected can be grouped into four main categories (bubble, line-out, line reduction and landing) although there exists a large variability among each particular type of defect. The complexity of the whole inspection process has been reduced by dividing images into smaller ones and grouping the defects into frequency and intensity relevant ones. Tools such as mathematical morphology, Fourier transform, profile analysis and classification have been used. The performance of the system has been successfully proved against human operators in normal production conditions.
|
|
|
Fadi Dornaika, & Franck Davoine. (2005). Facial expression recognition in continuous videos using dynamic programming.
|
|
|
Fadi Dornaika, & Franck Davoine. (2005). SFM for planar scenes using image derivatives.
|
|
|
Carles Fernandez, & Jordi Gonzalez. (2007). Ontology for Semantic Integration in a Cognitive Surveillance System. In Semantic Multimedia, 2nd International Conference on Semantics and Digital Media Technologies (Vol. 4816, 263–263). LNCS.
|
|
|
Fernando Lopez, J.M. Valiente, Ramon Baldrich, & Maria Vanrell. (2005). Fast surface grading using color statistics in the CIELab space. In Pattern Recognition and Image Analysis. IbPRIA 2005 (Vol. LNCS 3523, pp. 66–673).
|
|
|
Eduard Vazquez, Ramon Baldrich, Javier Vazquez, & Maria Vanrell. (2007). Topological histogram reduction towards colour segmentation. In 3rd Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2007), J. Marti et al. (Eds.) LNCS 4477:55–62.
|
|
|
Jose Manuel Alvarez, Antonio Lopez, & Ramon Baldrich. (2007). Shadow Resistant Road Segmentation from a Mobile Monocular System. In 3rd Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2007), J. Marti et al. (Eds.) LNCS 4477:9–16.
|
|
|
Robert Benavente, M.C. Olive, Maria Vanrell, & Ramon Baldrich. (1999). Colour Perception: A Simple Method for Colour Naming..
|
|
|
Francesco Ciompi, Oriol Pujol, E Fernandez-Nofrerias, J. Mauri, & Petia Radeva. (2010). Conditional Random Fields for image segmentation in Intravascular Ultrasound. In Medical Image Computing in Catalunya: Graduate Student Workshop (13–14).
Abstract: We present a Conditional Random Fields based approach for segmenting Intravascular Ultrasond (IVUS) images. The presented method uses a contextual discriminative graphical model to deal with the presence of distorsions and artifacts in IVUS images, that turns the segmentation of interesting regions into a difficult task. An accurate lumen segmentation on IVUS longitudinal images is achieved.
|
|
|
Pierluigi Casale, Oriol Pujol, & Petia Radeva. (2010). Classyfing Agitation in Sedated ICU Patients. In Medical Image Computing in Catalunya: Graduate Student Workshop (19–20).
Abstract: Agitation is a serious problem in sedated intensive care unit (ICU) patients. In this work, standard machine learning techniques working on wearable accelerometer data have been used to classifying agitation levels achieving very good classification performances.
|
|
|
Antonio Hernandez, Carlo Gatta, Petia Radeva, Laura Igual, R. Letaz, & Sergio Escalera. (2010). Automatic Vessel Segmentation For Angiography and CT Registration. In Medical Image Computing in Catalunya: Graduate Student Workshop (1–2).
|
|
|
Miguel Reyes, Jordi Vitria, Petia Radeva, & Sergio Escalera. (2010). Real-time Activity Monitoring of Inpatients. In Medical Image Computing in Catalunya: Graduate Student Workshop (35–36).
Abstract: In this paper, we present the development of an application capable of monitoring a set of patient vital signs in real time. The application has been designed to support the medical staff of a hospital. Preliminary results show the suitability
of the system to prevent the injury produced by the agitation of the patients.
|
|
|
Enric Marti, Ferran Poveda, Antoni Gurgui, Jaume Rocarias, & Debora Gil. (2013). Una propuesta de seguimiento, tutorías on line y evaluación en la metodología de Aprendizaje Basado en Proyectos.
|
|