Publicacions CVC -- Query Results

	V. Poulain d'Andecy, Emmanuel Hartmann, & Marçal Rusiñol. (2018). Field Extraction by hybrid incremental and a-priori structural templates. In 13th IAPR International Workshop on Document Analysis Systems (pp. 251–256). Abstract: In this paper, we present an incremental framework for extracting information fields from administrative documents. First, we demonstrate some limits of the existing state-of-the-art methods such as the delay of the system efficiency. This is a concern in industrial context when we have only few samples of each document class. Based on this analysis, we propose a hybrid system combining incremental learning by means of itf-df statistics and a-priori generic models. We report in the experimental section our results obtained with a dataset of real invoices. Keywords: Layout Analysis; information extraction; incremental learning Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML

Abstract: In this paper, we present an incremental framework for extracting information fields from administrative documents. First, we demonstrate some limits of the existing state-of-the-art methods such as the delay of the system efficiency. This is a concern in industrial context when we have only few samples of each document class. Based on this analysis, we propose a hybrid system combining incremental learning by means of itf-df statistics and a-priori generic
models. We report in the experimental section our results obtained with a dataset of real invoices.

Keywords: Layout Analysis; information extraction; incremental learning

Permanent link

| Save citation: RTF PDF LaTeX

| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML

	Publicacions CVC Home \| Show All \| Simple Search \| Advanced Search \| Add Record \| Import	Login Quick Search: Field: contains: ...
	1–1 of 1 record found matching your query (RSS):