TY  - CONF
AU  - David Aldavert
AU  - Marçal Rusiñol
AU  - Ricardo Toledo
AU  - Josep Llados
A2  - ICDAR
PY  - 2013//
TI  - Integrating Visual and Textual Cues for Query-by-String Word Spotting
BT  - 12th International Conference on Document Analysis and Recognition
SP  - 511
EP  - 515
N2  - In this paper, we present a word spotting framework that follows the query-by-string paradigm where word images are represented both by textual and visual representations. The textual representation is formulated in terms of character $n$-grams while the visual one is based on the bag-of-visual-words scheme. These two representations are merged together and projected to a sub-vector space. This transform allows to, given a textual query, retrieve word instances that were only represented by the visual modality. Moreover, this statistical representation can be used together with state-of-the-art indexation structures in order to deal with large-scale scenarios. The proposed method is evaluated using a collection of historical documents outperforming state-of-the-art performances.
SN  - 1520-5363
L1  - http://refbase.cvc.uab.es/files/ART2013.pdf
UR  - http://dx.doi.org/10.1109/ICDAR.2013.108
N1  - DAG; ADAS; 600.045; 600.055; 600.061
ID  - David Aldavert2013
ER  -