Home | << 1 >> |
Record | |||||
---|---|---|---|---|---|
Author | Marçal Rusiñol; T.Benkhelfallah; V. Poulain d'Andecy | ||||
Title | Field Extraction from Administrative Documents by Incremental Structural Templates | Type | Conference Article | ||
Year | 2013 | Publication | 12th International Conference on Document Analysis and Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 1100 - 1104 | ||
Keywords | |||||
Abstract | In this paper we present an incremental framework aimed at extracting field information from administrative document images in the context of a Digital Mail-room scenario. Given a single training sample in which the user has marked which fields have to be extracted from a particular document class, a document model representing structural relationships among words is built. This model is incrementally refined as the system processes more and more documents from the same class. A reformulation of the tf-idf statistic scheme allows to adjust the importance weights of the structural relationships among words. We report in the experimental section our results obtained with a large dataset of real invoices. | ||||
Address | Washington; USA; August 2013 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 1520-5363 | ISBN | Medium | ||
Area | Expedition | Conference | ICDAR | ||
Notes | DAG; 600.56; 600.045; 605.203; 602.101 | Approved | no | ||
Call Number | Admin @ si @ RBP2013 | Serial | 2346 | ||
Permanent link to this record |