TY - JOUR AU - Pau Riba AU - Josep Llados AU - Alicia Fornes AU - Anjan Dutta PY - 2017// TI - Large-scale graph indexing using binary embeddings of node contexts for information spotting in document image databases T2 - PRL JO - Pattern Recognition Letters SP - 203 EP - 211 VL - 87 N2 - Graph-based representations are experiencing a growing usage in visual recognition and retrieval due to their representational power in front of classical appearance-based representations. However, retrieving a query graph from a large dataset of graphs implies a high computational complexity. The most important property for a large-scale retrieval is the search time complexity to be sub-linear in the number of database examples. With this aim, in this paper we propose a graph indexation formalism applied to visual retrieval. A binary embedding is defined as hashing keys for graph nodes. Given a database of labeled graphs, graph nodes are complemented with vectors of attributes representing their local context. Then, each attribute vector is converted to a binary code applying a binary-valued hash function. Therefore, graph retrieval is formulated in terms of finding target graphs in the database whose nodes have a small Hamming distance from the query nodes, easily computed with bitwise logical operators. As an application example, we validate the performance of the proposed methods in different real scenarios such as handwritten word spotting in images of historical documents or symbol spotting in architectural floor plans. UR - https://doi.org/10.1016/j.patrec.2016.06.015 N1 - DAG; 600.097; 602.006; 603.053; 600.121 ID - Pau Riba2017 ER -