toggle visibility Search & Display Options

Select All    Deselect All
 | 
Citations
 | 

2023

Ruben Tito, Dimosthenis Karatzas and Ernest Valveny. 2023. Hierarchical multimodal transformers for Multipage DocVQA.
toggle visibility
Sergi Garcia Bordils, Dimosthenis Karatzas and Marçal Rusiñol. 2023. Accelerating Transformer-Based Scene Text Detection and Recognition via Token Pruning. 17th International Conference on Document Analysis and Recognition.106–121. (LNCS.)
toggle visibility
Souhail Bakkali and 6 others. 2023. TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language.
toggle visibility
Souhail Bakkali, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol and Oriol Ramos Terrades. 2023. VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification. PR, 139, 109419.
toggle visibility
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas and CV Jawahar. 2023. Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops.
toggle visibility
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas and CV Jawahar. 2023. Watching the News: Towards VideoQA Models that can Read. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer.
toggle visibility
Stepan Simsa and 10 others. 2023. Overview of DocILE 2023: Document Information Localization and Extraction. International Conference of the Cross-Language Evaluation Forum for European Languages.276–293. (LNCS.)
toggle visibility
Stepan Simsa and 10 others. 2023. DocILE Benchmark for Document Information Localization and Extraction. 17th International Conference on Document Analysis and Recognition.147–166. (LNCS.)
toggle visibility
Subhajit Maity and 6 others. 2023. SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation. 17th International Conference on Doccument Analysis and Recognition.342–360.
toggle visibility
Weijia Wu and 7 others. 2023. ICDAR 2023 Competition on Video Text Reading for Dense and Small Text. 17th International Conference on Document Analysis and Recognition.405–419. (LNCS.)
toggle visibility
Select All    Deselect All
 | 
Citations
 |