toggle visibility Search & Display Options

Select All    Deselect All
List View
 |   | 
  Author Title Year (down) Publication Volume Pages Links
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny Hierarchical multimodal transformers for Multipage DocVQA 2023 arxiv details   pdf url
Sergi Garcia Bordils; Dimosthenis Karatzas; Marçal Rusiñol Accelerating Transformer-Based Scene Text Detection and Recognition via Token Pruning 2023 17th International Conference on Document Analysis and Recognition 14192 106-121 details   url
Souhail Bakkali; Sanket Biswas; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades; Josep Llados TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language 2023 Arxiv details   pdf url
Souhail Bakkali; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification 2023 Pattern Recognition 139 109419 details   pdf doi
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering 2023 Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops details   pdf url
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar Watching the News: Towards VideoQA Models that can Read 2023 Proceedings of the IEEE/CVF Winter Conference on Applications of Computer details   pdf url
Stepan Simsa; Michal Uricar; Milan Sulc; Yash Patel; Ahmed Hamdi; Matej Kocian; Matyas Skalicky; Jiri Matas; Antoine Doucet; Mickael Coustaty; Dimosthenis Karatzas Overview of DocILE 2023: Document Information Localization and Extraction 2023 International Conference of the Cross-Language Evaluation Forum for European Languages 14163 276–293 details   doi
Stepan Simsa; Milan Sulc; Michal Uricar; Yash Patel; Ahmed Hamdi; Matej Kocian; Matyas Skalicky; Jiri Matas; Antoine Doucet; Mickael Coustaty; Dimosthenis Karatzas DocILE Benchmark for Document Information Localization and Extraction 2023 17th International Conference on Document Analysis and Recognition 14188 147–166 details   pdf url
Subhajit Maity; Sanket Biswas; Siladittya Manna; Ayan Banerjee; Josep Llados; Saumik Bhattacharya; Umapada Pal SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation 2023 17th International Conference on Doccument Analysis and Recognition 14187 342–360 details   pdf doi
Weijia Wu; Yuzhong Zhao; Zhuang Li; Jiahong Li; Mike Zheng Shou; Umapada Pal; Dimosthenis Karatzas; Xiang Bai ICDAR 2023 Competition on Video Text Reading for Dense and Small Text 2023 17th International Conference on Document Analysis and Recognition 14188 405–419 details   pdf url
Select All    Deselect All
List View
 |   | 

Save Citations:
Export Records: