|
Minesh Mathew, Dimosthenis Karatzas and C.V. Jawahar. 2021. DocVQA: A Dataset for VQA on Document Images. IEEE Winter Conference on Applications of Computer Vision.2200–2209.
Abstract: We present a new dataset for Visual Question Answering (VQA) on document images called DocVQA. The dataset consists of 50,000 questions defined on 12,000+ document images. Detailed analysis of the dataset in comparison with similar datasets for VQA and reading comprehension is presented. We report several baseline results by adopting existing VQA and reading comprehension models. Although the existing models perform reasonably well on certain types of questions, there is large performance gap compared to human performance (94.36% accuracy). The models need to improve specifically on questions where understanding structure of the document is crucial. The dataset, code and leaderboard are available at docvqa. org
|
|
|
Mickael Coustaty and Alicia Fornes. 2023. Document Analysis and Recognition – ICDAR 2023 Workshops. (LNCS.)
|
|
|
Mathieu Nicolas Delalandre, Tony Pridmore, Ernest Valveny, Herve Locteau and Eric Trupin. 2008. Building Synthetic Graphical Documents for Performance Evaluation. In W. Liu, J.L., J.M. Ogier, ed. Graphics Recognition: Recent Advances and New Opportunities.288–298. (LNCS.)
|
|
|
Mathieu Nicolas Delalandre, Tony Pridmore, Ernest Valveny, Eric Trupin and Herve Locteau. 2007. Building Synthetic Graphical Documents for Performance Evaluation. Seventh IAPR International Workshop on Graphics Recognition.84–87.
|
|
|
Mathieu Nicolas Delalandre, Jean-Yves Ramel, Ernest Valveny and Muhammad Muzzamil Luqman. 2009. A Performance Characterization Algorithm for Symbol Localization. 8th IAPR International Workshop on Graphics Recognition. Springer, 3–11.
Abstract: In this paper we present an algorithm for performance characterization of symbol localization systems. This algorithm is aimed to be a more “reliable” and “open” solution to characterize the performance. To achieve that, it exploits only single points as the result of localization and offers the possibility to reconsider the localization results provided by a system. We use the information about context in groundtruth, and overall localization results, to detect the ambiguous localization results. A probability score is computed for each matching between a localization point and a groundtruth region, depending on the spatial distribution of the other regions in the groundtruth. Final characterization is given with detection rate/probability score plots, describing the sets of possible interpretations of the localization results, according to a given confidence rate. We present experimentation details along with the results for the symbol localization system of [1], exploiting a synthetic dataset of architectural floorplans and electrical diagrams (composed of 200 images and 3861 symbols).
|
|
|
Mathieu Nicolas Delalandre, Jean-Yves Ramel, Ernest Valveny and Muhammad Muzzamil Luqman. 2010. A Performance Characterization Algorithm for Symbol Localization. Graphics Recognition. Achievements, Challenges, and Evolution. 8th International Workshop, GREC 2009. Selected Papers. Springer Berlin Heidelberg, 260–271. (LNCS.)
Abstract: In this paper we present an algorithm for performance characterization of symbol localization systems. This algorithm is aimed to be a more “reliable” and “open” solution to characterize the performance. To achieve that, it exploits only single points as the result of localization and offers the possibility to reconsider the localization results provided by a system. We use the information about context in groundtruth, and overall localization results, to detect the ambiguous localization results. A probability score is computed for each matching between a localization point and a groundtruth region, depending on the spatial distribution of the other regions in the groundtruth. Final characterization is given with detection rate/probability score plots, describing the sets of possible interpretations of the localization results, according to a given confidence rate. We present experimentation details along with the results for the symbol localization system of [1], exploiting a synthetic dataset of architectural floorplans and electrical diagrams (composed of 200 images and 3861 symbols).
|
|
|
Mathieu Nicolas Delalandre, Jean-Marc Ogier and Josep Llados. 2007. A Fast System for the Retrieval of Ornamental Letter Image. Seventh IAPR International Workshop on Graphics Recognition.51–54.
|
|
|
Mathieu Nicolas Delalandre, Jean-Marc Ogier and Josep Llados. 2008. A Fast Cbir System of Old Ornamental Letter. In W. Liu, J.L., J.M. Ogier, ed. Graphics Reognition: Recent Advances and New Opportunities.135–144. (LNCS.)
|
|
|
Mathieu Nicolas Delalandre, Ernest Valveny, Tony Pridmore and Dimosthenis Karatzas. 2010. Generation of Synthetic Documents for Performance Evaluation of Symbol Recognition & Spotting Systems. IJDAR, 13(3), 187–207.
Abstract: This paper deals with the topic of performance evaluation of symbol recognition & spotting systems. We propose here a new approach to the generation of synthetic graphics documents containing non-isolated symbols in a real context. This approach is based on the definition of a set of constraints that permit us to place the symbols on a pre-defined background according to the properties of a particular domain (architecture, electronics, engineering, etc.). In this way, we can obtain a large amount of images resembling real documents by simply defining the set of constraints and providing a few pre-defined backgrounds. As documents are synthetically generated, the groundtruth (the location and the label of every symbol) becomes automatically available. We have applied this approach to the generation of a large database of architectural drawings and electronic diagrams, which shows the flexibility of the system. Performance evaluation experiments of a symbol localization system show that our approach permits to generate documents with different features that are reflected in variation of localization results.
|
|
|
Mathieu Nicolas Delalandre, Ernest Valveny and Josep Llados. 2008. Performance Evaluation of Symbol Recognition and Spotting Systems: An Overview.
|
|