Home | [21–30] << 31 32 33 34 35 36 37 38 39 40 >> [41–50] |
![]() |
Records | |||||
---|---|---|---|---|---|
Author | David Fernandez; Jon Almazan; Nuria Cirera; Alicia Fornes; Josep Llados | ||||
Title | BH2M: the Barcelona Historical Handwritten Marriages database | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 256 - 261 | ||
Keywords | |||||
Abstract | This paper presents an image database of historical handwritten marriages records stored in the archives of Barcelona cathedral, and the corresponding meta-data addressed to evaluate the performance of document analysis algorithms. The contribution of this paper is twofold. First, it presents a complete ground truth which covers the whole pipeline of handwriting
recognition research, from layout analysis to recognition and understanding. Second, it is the first dataset in the emerging area of genealogical document analysis, where documents are manuscripts pseudo-structured with specific lexicons and the interest is beyond pure transcriptions but context dependent. |
||||
Address | Creete Island; Grecia; September 2014 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1051-4651 | ISBN | Medium | ||
Area | Expedition | Conference | ICPR | ||
Notes | DAG; 600.056; 600.061; 602.006; 600.077 | Approved | no | ||
Call Number | Admin @ si @ FAC2014 | Serial | 2461 | ||
Permanent link to this record | |||||
Author | Lluis Gomez; Dimosthenis Karatzas | ||||
Title | MSER-based Real-Time Text Detection and Tracking | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 3110 - 3115 | ||
Keywords | |||||
Abstract | We present a hybrid algorithm for detection and tracking of text in natural scenes that goes beyond the fulldetection approaches in terms of time performance optimization.
A state-of-the-art scene text detection module based on Maximally Stable Extremal Regions (MSER) is used to detect text asynchronously, while on a separate thread detected text objects are tracked by MSER propagation. The cooperation of these two modules yields real time video processing at high frame rates even on low-resource devices. |
||||
Address | Stockholm; August 2014 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1051-4651 | ISBN | Medium | ||
Area | Expedition | Conference | ICPR | ||
Notes | DAG; 600.056; 601.158; 601.197; 600.077 | Approved | no | ||
Call Number | Admin @ si @ GoK2014a | Serial | 2492 | ||
Permanent link to this record | |||||
Author | Hongxing Gao; Marçal Rusiñol; Dimosthenis Karatzas; Josep Llados | ||||
Title | Embedding Document Structure to Bag-of-Words through Pair-wise Stable Key-regions | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 2903 - 2908 | ||
Keywords | |||||
Abstract | Since the document structure carries valuable discriminative information, plenty of efforts have been made for extracting and understanding document structure among which layout analysis approaches are the most commonly used. In this paper, Distance Transform based MSER (DTMSER) is employed to efficiently extract the document structure as a dendrogram of key-regions which roughly correspond to structural elements such as characters, words and paragraphs. Inspired by the Bag
of Words (BoW) framework, we propose an efficient method for structural document matching by representing the document image as a histogram of key-region pairs encoding structural relationships. Applied to the scenario of document image retrieval, experimental results demonstrate a remarkable improvement when comparing the proposed method with typical BoW and pyramidal BoW methods. |
||||
Address | Stockholm; Sweden; August 2014 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICPR | ||
Notes | DAG; 600.056; 600.061; 600.077 | Approved | no | ||
Call Number | Admin @ si @ GRK2014b | Serial | 2497 | ||
Permanent link to this record | |||||
Author | P. Wang; V. Eglin; C. Garcia; C. Largeron; Josep Llados; Alicia Fornes | ||||
Title | A Coarse-to-Fine Word Spotting Approach for Historical Handwritten Documents Based on Graph Embedding and Graph Edit Distance | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 3074 - 3079 | ||
Keywords | word spotting; coarse-to-fine mechamism; graphbased representation; graph embedding; graph edit distance | ||||
Abstract | Effective information retrieval on handwritten document images has always been a challenging task, especially historical ones. In the paper, we propose a coarse-to-fine handwritten word spotting approach based on graph representation. The presented model comprises both the topological and morphological signatures of the handwriting. Skeleton-based graphs with the Shape Context labelled vertexes are established for connected components. Each word image is represented as a sequence of graphs. Aiming at developing a practical and efficient word spotting approach for large-scale historical handwritten documents, a fast and coarse comparison is first applied to prune the regions that are not similar to the query based on the graph embedding methodology. Afterwards, the query and regions of interest are compared by graph edit distance based on the Dynamic Time Warping alignment. The proposed approach is evaluated on a public dataset containing 50 pages of historical marriage license records. The results show that the proposed approach achieves a compromise between efficiency and accuracy. | ||||
Address | Stockholm; Sweden; August 2014 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1051-4651 | ISBN | Medium | ||
Area | Expedition | Conference | ICPR | ||
Notes | DAG; 600.061; 602.006; 600.077 | Approved | no | ||
Call Number | Admin @ si @ WEG2014a | Serial | 2515 | ||
Permanent link to this record | |||||
Author | Claudio Baecchi; Francesco Turchini; Lorenzo Seidenari; Andrew Bagdanov; Alberto del Bimbo | ||||
Title | Fisher vectors over random density forest for object recognition | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 4328-4333 | ||
Keywords | |||||
Abstract | |||||
Address | Stockholm; Sweden; August 2014 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICPR | ||
Notes | LAMP; 600.079 | Approved | no | ||
Call Number | Admin @ si @ BTS2014 | Serial | 2518 | ||
Permanent link to this record | |||||
Author | Federico Bartoli; Giuseppe Lisanti; Svebor Karaman; Andrew Bagdanov; Alberto del Bimbo | ||||
Title | Unsupervised scene adaptation for faster multi- scale pedestrian detection | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 3534 - 3539 | ||
Keywords | |||||
Abstract | |||||
Address | Stockholm; Sweden; August 2014 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICPR | ||
Notes | LAMP; 600.079 | Approved | no | ||
Call Number | Admin @ si @ BLK2014 | Serial | 2519 | ||
Permanent link to this record | |||||
Author | Francisco Cruz; Oriol Ramos Terrades | ||||
Title | EM-Based Layout Analysis Method for Structured Documents | Type | Conference Article | ||
Year | 2014 | Publication ![]() |
22nd International Conference on Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 315-320 | ||
Keywords | |||||
Abstract | In this paper we present a method to perform layout analysis in structured documents. We proposed an EM-based algorithm to fit a set of Gaussian mixtures to the different regions according to the logical distribution along the page. After the convergence, we estimate the final shape of the regions according
to the parameters computed for each component of the mixture. We evaluated our method in the task of record detection in a collection of historical structured documents and performed a comparison with other previous works in this task. |
||||
Address | |||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1051-4651 | ISBN | Medium | ||
Area | Expedition | Conference | ICPR | ||
Notes | DAG; 602.006; 600.061; 600.077 | Approved | no | ||
Call Number | Admin @ si @ CrR2014 | Serial | 2530 | ||
Permanent link to this record | |||||
Author | Victor Ponce; Mario Gorga; Xavier Baro; Sergio Escalera | ||||
Title | Human Behavior Analysis from Video Data Using Bag-of-Gestures | Type | Conference Article | ||
Year | 2011 | Publication ![]() |
22nd International Joint Conference on Artificial Intelligence | Abbreviated Journal | |
Volume | 3 | Issue | Pages | 2836-2837 | |
Keywords | |||||
Abstract | Human Behavior Analysis in Uncontrolled Environments can be categorized in two main challenges: 1) Feature extraction and 2) Behavior analysis from a set of corporal language vocabulary. In this work, we present our achievements characterizing some simple behaviors from visual data on different real applications and discuss our plan for future work: low level vocabulary definition from bag-of-gesture units and high level modelling and inference of human behaviors. | ||||
Address | Barcelona | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | 978-1-57735-516-8 | Medium | ||
Area | Expedition | Conference | IJCAI | ||
Notes | HuPBA;MV | Approved | no | ||
Call Number | Admin @ si @ PGB2011b | Serial | 1770 | ||
Permanent link to this record | |||||
Author | Cristhian A. Aguilera-Carrasco; Angel Sappa; Ricardo Toledo | ||||
Title | LGHD: a Feature Descriptor for Matching Across Non-Linear Intensity Variations | Type | Conference Article | ||
Year | 2015 | Publication ![]() |
22th IEEE International Conference on Image Processing | Abbreviated Journal | |
Volume | Issue | Pages | 178 - 181 | ||
Keywords | |||||
Abstract | |||||
Address | Quebec; Canada; September 2015 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICIP | ||
Notes | ADAS; 600.076 | Approved | no | ||
Call Number | Admin @ si @ AST2015 | Serial | 2630 | ||
Permanent link to this record | |||||
Author | Javier M. Olaso; Alain Vazquez; Leila Ben Letaifa; Mikel de Velasco; Aymen Mtibaa; Mohamed Amine Hmani; Dijana Petrovska-Delacretaz; Gerard Chollet; Cesar Montenegro; Asier Lopez-Zorrilla; Raquel Justo; Roberto Santana; Jofre Tenorio-Laranga; Eduardo Gonzalez-Fraile; Begoña Fernandez-Ruanova; Gennaro Cordasco; Anna Esposito; Kristin Beck Gjellesvik; Anna Torp Johansen; Maria Stylianou Kornes; Colin Pickard; Cornelius Glackin; Gary Cahalane; Pau Buch; Cristina Palmero; Sergio Escalera; Olga Gordeeva; Olivier Deroo; Anaïs Fernandez; Daria Kyslitska; Jose Antonio Lozano; Maria Ines Torres; Stephan Schlogl | ||||
Title | The EMPATHIC Virtual Coach: a demo | Type | Conference Article | ||
Year | 2021 | Publication ![]() |
23rd ACM International Conference on Multimodal Interaction | Abbreviated Journal | |
Volume | Issue | Pages | 848-851 | ||
Keywords | |||||
Abstract | The main objective of the EMPATHIC project has been the design and development of a virtual coach to engage the healthy-senior user and to enhance well-being through awareness of personal status. The EMPATHIC approach addresses this objective through multimodal interactions supported by the GROW coaching model. The paper summarizes the main components of the EMPATHIC Virtual Coach (EMPATHIC-VC) and introduces a demonstration of the coaching sessions in selected scenarios. | ||||
Address | Virtual; October 2021 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICMI | ||
Notes | HUPBA; no proj | Approved | no | ||
Call Number | Admin @ si @ OVB2021 | Serial | 3644 | ||
Permanent link to this record | |||||
Author | Jon Almazan; Albert Gordo; Alicia Fornes; Ernest Valveny | ||||
Title | Efficient Exemplar Word Spotting | Type | Conference Article | ||
Year | 2012 | Publication ![]() |
23rd British Machine Vision Conference | Abbreviated Journal | |
Volume | Issue | Pages | 67.1- 67.11 | ||
Keywords | |||||
Abstract | In this paper we propose an unsupervised segmentation-free method for word spotting in document images.
Documents are represented with a grid of HOG descriptors, and a sliding window approach is used to locate the document regions that are most similar to the query. We use the exemplar SVM framework to produce a better representation of the query in an unsupervised way. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage. |
||||
Address | |||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | 1-901725-46-4 | Medium | ||
Area | Expedition | Conference | BMVC | ||
Notes | DAG | Approved | no | ||
Call Number | DAG @ dag @ AGF2012 | Serial | 1984 | ||
Permanent link to this record | |||||
Author | Naila Murray; Luca Marchesotti; Florent Perronnin | ||||
Title | Learning to Rank Images using Semantic and Aesthetic Labels | Type | Conference Article | ||
Year | 2012 | Publication ![]() |
23rd British Machine Vision Conference | Abbreviated Journal | |
Volume | Issue | Pages | 110.1-110.10 | ||
Keywords | |||||
Abstract | Most works on image retrieval from text queries have addressed the problem of retrieving semantically relevant images. However, the ability to assess the aesthetic quality of an image is an increasingly important differentiating factor for search engines. In this work, given a semantic query, we are interested in retrieving images which are semantically relevant and score highly in terms of aesthetics/visual quality. We use large-margin classifiers and rankers to learn statistical models capable of ordering images based on the aesthetic and semantic information. In particular, we compare two families of approaches: while the first one attempts to learn a single ranker which takes into account both semantic and aesthetic information, the second one learns separate semantic and aesthetic models. We carry out a quantitative and qualitative evaluation on a recently-published large-scale dataset and we show that the second family of techniques significantly outperforms the first one. | ||||
Address | Guildford, London | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | 1-901725-46-4 | Medium | ||
Area | Expedition | Conference | BMVC | ||
Notes | CIC | Approved | no | ||
Call Number | Admin @ si @ MMP2012b | Serial | 2027 | ||
Permanent link to this record | |||||
Author | Pedro Martins; Paulo Carvalho; Carlo Gatta | ||||
Title | Context Aware Keypoint Extraction for Robust Image Representation | Type | Conference Article | ||
Year | 2012 | Publication ![]() |
23rd British Machine Vision Conference | Abbreviated Journal | |
Volume | Issue | Pages | 100.1 - 100.12 | ||
Keywords | |||||
Abstract | |||||
Address | |||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | BMVC | ||
Notes | MILAB | Approved | no | ||
Call Number | Admin @ si @ MCG2012a | Serial | 2140 | ||
Permanent link to this record | |||||
Author | Mario Rojas; David Masip; A. Todorov; Jordi Vitria | ||||
Title | Automatic Point-based Facial Trait Judgments Evaluation | Type | Conference Article | ||
Year | 2010 | Publication ![]() |
23rd IEEE Conference on Computer Vision and Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 2715–2720 | ||
Keywords | |||||
Abstract | Humans constantly evaluate the personalities of other people using their faces. Facial trait judgments have been studied in the psychological field, and have been determined to influence important social outcomes of our lives, such as elections outcomes and social relationships. Recent work on textual descriptions of faces has shown that trait judgments are highly correlated. Further, behavioral studies suggest that two orthogonal dimensions, valence and dominance, can describe the basis of the human judgments from faces. In this paper, we used a corpus of behavioral data of judgments on different trait dimensions to automatically learn a trait predictor from facial pixel images. We study whether trait evaluations performed by humans can be learned using machine learning classifiers, and used later in automatic evaluations of new facial images. The experiments performed using local point-based descriptors show promising results in the evaluation of the main traits. | ||||
Address | San Francisco CA, USA | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1063-6919 | ISBN | 978-1-4244-6984-0 | Medium | |
Area | Expedition | Conference | CVPR | ||
Notes | OR;MV | Approved | no | ||
Call Number | BCNPCL @ bcnpcl @ RMT2010 | Serial | 1282 | ||
Permanent link to this record | |||||
Author | Josep M. Gonfaus; Xavier Boix; Joost Van de Weijer; Andrew Bagdanov; Joan Serrat; Jordi Gonzalez | ||||
Title | Harmony Potentials for Joint Classification and Segmentation | Type | Conference Article | ||
Year | 2010 | Publication ![]() |
23rd IEEE Conference on Computer Vision and Pattern Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 3280–3287 | ||
Keywords | |||||
Abstract | Hierarchical conditional random fields have been successfully applied to object segmentation. One reason is their ability to incorporate contextual information at different scales. However, these models do not allow multiple labels to be assigned to a single node. At higher scales in the image, this yields an oversimplified model, since multiple classes can be reasonable expected to appear within one region. This simplified model especially limits the impact that observations at larger scales may have on the CRF model. Neglecting the information at larger scales is undesirable since class-label estimates based on these scales are more reliable than at smaller, noisier scales. To address this problem, we propose a new potential, called harmony potential, which can encode any possible combination of class labels. We propose an effective sampling strategy that renders tractable the underlying optimization problem. Results show that our approach obtains state-of-the-art results on two challenging datasets: Pascal VOC 2009 and MSRC-21. | ||||
Address | San Francisco CA, USA | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1063-6919 | ISBN | 978-1-4244-6984-0 | Medium | |
Area | Expedition | Conference | CVPR | ||
Notes | ADAS;CIC;ISE | Approved | no | ||
Call Number | ADAS @ adas @ GBW2010 | Serial | 1296 | ||
Permanent link to this record |