|
Ayan Banerjee, Sanket Biswas, Josep Llados and Umapada Pal. 2024. GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation.
Abstract: Object detection in documents is a key step to automate the structural elements identification process in a digital or scanned document through understanding the hierarchical structure and relationships between different elements. Large and complex models, while achieving high accuracy, can be computationally expensive and memory-intensive, making them impractical for deployment on resource constrained devices. Knowledge distillation allows us to create small and more efficient models that retain much of the performance of their larger counterparts. Here we present a graph-based knowledge distillation framework to correctly identify and localize the document objects in a document image. Here, we design a structured graph with nodes containing proposal-level features and edges representing the relationship between the different proposal regions. Also, to reduce text bias an adaptive node sampling strategy is designed to prune the weight distribution and put more weightage on non-text nodes. We encode the complete graph as a knowledge representation and transfer it from the teacher to the student through the proposed distillation loss by effectively capturing both local and global information concurrently. Extensive experimentation on competitive benchmarks demonstrates that the proposed framework outperforms the current state-of-the-art approaches. The code will be available at: this https URL.
|
|
|
Sergio Escalera, Alicia Fornes, Oriol Pujol, Josep Llados and Petia Radeva. 2007. Multi-class Binary Object Categorization using Blurred Shape Models. Progress in Pattern Recognition, Image Analysis and Applications, 12th Iberoamerican Congress on Pattern.773–782. (LCNS.)
|
|
|
Josep Llados, Gemma Sanchez and Enric Marti. 1997. A String-Based Method to Recognize Symbols and Structural Textures in Architectural Plans. Graphics Recognition Algorithms and Systems. GREC 1997..91–103. (LNCS.)
|
|
|
Agnes Borras, Francesc Tous, Josep Llados and Maria Vanrell. 2003. High-Level Clothes Description Based on Colour-Texture and Structural Features. 1rst. Iberian Conference on Pattern Recognition and Image Analysis IbPRIA 2003.108–116. (LNCS.)
Abstract: ecture Notes in Computer Science 2652 108–116
|
|
|
Salim Jouili, Salvatore Tabbone and Ernest Valveny. 2010. Comparing Graph Similarity Measures for Graphical Recognition. Graphics Recognition. Achievements, Challenges, and Evolution. 8th International Workshop, GREC 2009. Selected Papers. Springer Berlin Heidelberg, 37–48. (LNCS.)
Abstract: In this paper we evaluate four graph distance measures. The analysis is performed for document retrieval tasks. For this aim, different kind of documents are used including line drawings (symbols), ancient documents (ornamental letters), shapes and trademark-logos. The experimental results show that the performance of each graph distance measure depends on the kind of data and the graph representation technique.
|
|
|
W. Liu and Josep Llados. 2006. Graphics Recognition. Ten Years Review and Future Perspectives. (LNCS.)
|
|
|
Marçal Rusiñol and Josep Llados. 2008. A Region-Based Hashing Approach for Symbol Spotting in Technical Documents. In W. Lius, J.L., J.M. Ogier, ed. Graphics Recognition: Recent Advances and New Opportunities.104–113. (LNCS.)
|
|
|
Ernest Valveny, Salvatore Tabbone and Oriol Ramos Terrades. 2008. Performance Characterization of Shape Descriptors for Symbol Representation. In W. Liu, J.L., J.M. Ogier, ed. Graphics Recognition: Recent Advances and New Opportunities.278–287. (LNCS.)
|
|
|
Ernest Valveny, Philippe Dosch and Alicia Fornes. 2008. Report on the Third Contest on Symbol Recognition. In W. Liu, J.L., J.M. Ogier, ed. Graphics Recognition: Recent Advances and New Opportunities.321–328. (LNCS.)
|
|
|
Mathieu Nicolas Delalandre, Jean-Marc Ogier and Josep Llados. 2008. A Fast Cbir System of Old Ornamental Letter. In W. Liu, J.L., J.M. Ogier, ed. Graphics Reognition: Recent Advances and New Opportunities.135–144. (LNCS.)
|
|