TY - CONF AU - Marçal Rusiñol AU - T.Benkhelfallah AU - V. Poulain d'Andecy A2 - ICDAR PY - 2013// TI - Field Extraction from Administrative Documents by Incremental Structural Templates BT - 12th International Conference on Document Analysis and Recognition SP - 1100 EP - 1104 N2 - In this paper we present an incremental framework aimed at extracting field information from administrative document images in the context of a Digital Mail-room scenario. Given a single training sample in which the user has marked which fields have to be extracted from a particular document class, a document model representing structural relationships among words is built. This model is incrementally refined as the system processes more and more documents from the same class. A reformulation of the tf-idf statistic scheme allows to adjust the importance weights of the structural relationships among words. We report in the experimental section our results obtained with a large dataset of real invoices. SN - 1520-5363 L1 - http://refbase.cvc.uab.es/files/RBP2013.pdf UR - http://dx.doi.org/10.1109/ICDAR.2013.223 N1 - DAG; 600.56; 600.045; 605.203; 602.101 ID - Marçal Rusiñol2013 ER -