|
Stepan Simsa and 10 others. 2023. Overview of DocILE 2023: Document Information Localization and Extraction. International Conference of the Cross-Language Evaluation Forum for European Languages.276–293. (LNCS.)
Abstract: This paper provides an overview of the DocILE 2023 Competition, its tasks, participant submissions, the competition results and possible future research directions. This first edition of the competition focused on two Information Extraction tasks, Key Information Localization and Extraction (KILE) and Line Item Recognition (LIR). Both of these tasks require detection of pre-defined categories of information in business documents. The second task additionally requires correctly grouping the information into tuples, capturing the structure laid out in the document. The competition used the recently published DocILE dataset and benchmark that stays open to new submissions. The diversity of the participant solutions indicates the potential of the dataset as the submissions included pure Computer Vision, pure Natural Language Processing, as well as multi-modal solutions and utilized all of the parts of the dataset, including the annotated, synthetic and unlabeled subsets.
Keywords: Information Extraction; Computer Vision; Natural Language Processing; Optical Character Recognition; Document Understanding
|
|
|
Sergio Escalera, Alicia Fornes, Oriol Pujol, Josep Llados and Petia Radeva. 2007. Multi-class Binary Object Categorization using Blurred Shape Models. Progress in Pattern Recognition, Image Analysis and Applications, 12th Iberoamerican Congress on Pattern.773–782. (LCNS.)
|
|
|
Josep Llados, J. Lopez-Krahe and Enric Marti. 1999. A Hough-based method for hatched pattern detection in maps and diagrams..
|
|
|
Josep Llados, Felipe Lumbreras and X. Varona. 1999. A multidocument platform for automatic reading of identity cards..
|
|
|
A. Pujol and 6 others. 1999. Real time pharmaceutical product recognition using color and shape indexing. Proceedings of the 2nd International Workshop on European Scientific and Industrial Collaboration (WESIC´99), Promotoring Advanced Technologies in Manufacturing..
|
|
|
Josep Llados, Gemma Sanchez and Enric Marti. 1997. A String-Based Method to Recognize Symbols and Structural Textures in Architectural Plans..
|
|
|
Jordi Vitria and 6 others. 1999. Real time recognition of pharmaceutical products by subspace methods.
|
|
|
V. Chapaprieta and Ernest Valveny. 2001. Handwritten Digit Recognition Using Point Distribution Models..
|
|
|
Josep Llados. 1996. Interpretacio de dibuixos linials fets a ma alçada mitjançant isomorfisme entre subgrafs i transformacio de Hough.
|
|
|
Josep Llados, Felipe Lumbreras, V. Chapaprieta and J. Queralt. 2001. ICAR: Identity Card Automatic Reader..
|
|