Search & Display Options
Search within Results:
Field:
author
title
year
keywords
abstract
type
contains:
...
Exclude matches
Display Options:
Field:
author
title
year
keywords
abstract
type
records per page
Select All
Deselect All
[11–20]
<<
21
22
23
24
25
26
27
28
29
30
>>
[31–40]
List View
|
Citations
|
Details
Author
Title
Year
Publication
Volume
Pages
Links
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny
Hierarchical multimodal transformers for Multipage DocVQA
2023
Pattern Recognition
144
Mohamed Ali Souibgui; Sanket Biswas; Andres Mafla; Ali Furkan Biten; Alicia Fornes; Yousri Kessentini; Josep Llados; Lluis Gomez; Dimosthenis Karatzas
Text-DIAE: a self-supervised degradation invariant autoencoder for text recognition and document enhancement
2023
Proceedings of the 37th AAAI Conference on Artificial Intelligence
37
Marwa Dhiaf; Mohamed Ali Souibgui; Kai Wang; Yuyang Liu; Yousri Kessentini; Alicia Fornes; Ahmed Cheikh Rouhou
CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition
2023
Arxiv
Mickael Coustaty; Alicia Fornes
Document Analysis and Recognition – ICDAR 2023 Workshops
2023
Document Analysis and Recognition – ICDAR 2023 Workshops
14194
Ayan Banerjee; Sanket Biswas; Josep Llados; Umapada Pal
SemiDocSeg: Harnessing Semi-Supervised Learning for Document Layout Analysis
2024
International Journal on Document Analysis and Recognition
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar
Watching the News: Towards VideoQA Models that can Read
2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar
Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering
2023
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops
Ruben Perez Tito
Exploring the role of Text in Visual Question Answering on Natural Scenes and Documents
2023
PhD Thesis, Universitat Autonoma de Barcelona-CVC
Alloy Das; Sanket Biswas; Umapada Pal; Josep Llados
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes
2024
IEEE International Conference on Robotics and Automation in PACIFICO
Souhail Bakkali; Sanket Biswas; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades; Josep Llados
TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language
2023
Arxiv
Select All
Deselect All
[11–20]
<<
21
22
23
24
25
26
27
28
29
30
>>
[31–40]
List View
|
Citations
|
Details