Search & Display Options
Search within Results:
Field:
author
title
year
keywords
abstract
type
contains:
...
Exclude matches
Display Options:
Field:
author
title
year
keywords
abstract
type
records per page
Select All
Deselect All
<<
1
2
3
4
5
6
7
8
9
10
>>
[11–20]
List View
|
Citations
|
Details
Author
Title
Year
Publication
Volume
Pages
Links
Pau Torras; Mohamed Ali Souibgui; Sanket Biswas; Alicia Fornes
Segmentation-Free Alignment of Arbitrary Symbol Transcripts to Images
2023
Document Analysis and Recognition – ICDAR 2023 Workshops
14193
83-93
Ruben Perez Tito
Exploring the role of Text in Visual Question Answering on Natural Scenes and Documents
2023
PhD Thesis, Universitat Autonoma de Barcelona-CVC
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny
Hierarchical multimodal transformers for Multi-Page DocVQA
2023
Pattern Recognition
144
109834
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny
Hierarchical multimodal transformers for Multipage DocVQA
2023
Pattern Recognition
144
Ruben Tito; Khanh Nguyen; Marlon Tobaben; Raouf Kerkouche; Mohamed Ali Souibgui; Kangsoo Jung; Lei Kang; Ernest Valveny; Antti Honkela; Mario Fritz; Dimosthenis Karatzas
Privacy-Aware Document Visual Question Answering
2023
Arxiv
Sergi Garcia Bordils; Dimosthenis Karatzas; Marçal Rusiñol
Accelerating Transformer-Based Scene Text Detection and Recognition via Token Pruning
2023
17th International Conference on Document Analysis and Recognition
14192
106-121
Souhail Bakkali; Sanket Biswas; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades; Josep Llados
TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
2023
Arxiv
Souhail Bakkali; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
2023
Pattern Recognition
139
109419
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar
Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering
2023
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar
Watching the News: Towards VideoQA Models that can Read
2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer
Select All
Deselect All
<<
1
2
3
4
5
6
7
8
9
10
>>
[11–20]
List View
|
Citations
|
Details