Home | << 1 >> |
Record | |||||
---|---|---|---|---|---|
Author | V. Poulain d'Andecy; Emmanuel Hartmann; Marçal Rusiñol | ||||
Title | Field Extraction by hybrid incremental and a-priori structural templates | Type | Conference Article | ||
Year | 2018 | Publication | 13th IAPR International Workshop on Document Analysis Systems | Abbreviated Journal | |
Volume | Issue | Pages | 251 - 256 | ||
Keywords | Layout Analysis; information extraction; incremental learning | ||||
Abstract | In this paper, we present an incremental framework for extracting information fields from administrative documents. First, we demonstrate some limits of the existing state-of-the-art methods such as the delay of the system efficiency. This is a concern in industrial context when we have only few samples of each document class. Based on this analysis, we propose a hybrid system combining incremental learning by means of itf-df statistics and a-priori generic
models. We report in the experimental section our results obtained with a dataset of real invoices. |
||||
Address | Viena; Austria; April 2018 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | DAS | ||
Notes | DAG; 600.084; 600.129; 600.121 | Approved | no | ||
Call Number | Admin @ si @ PHR2018 | Serial | 3106 | ||
Permanent link to this record |