toggle visibility Search & Display Options

Select All    Deselect All
 | 
Citations
 | 
Arnau Baro. 2022. Reading Music Systems: From Deep Optical Music Recognition to Contextual Methods. (Ph.D. thesis, IMPRIMA.)
toggle visibility
Ali Furkan Biten. 2022. A Bitter-Sweet Symphony on Vision and Language: Bias and World Knowledge. (Ph.D. thesis, IMPRIMA.)
toggle visibility
Andres Mafla. 2022. Leveraging Scene Text Information for Image Interpretation. (Ph.D. thesis, IMPRIMA.)
toggle visibility
Mohamed Ali Souibgui. 2022. Document Image Enhancement and Recognition in Low Resource Scenarios: Application to Ciphers and Handwritten Text. (Ph.D. thesis, IMPRIMA.)
toggle visibility
Andrea Gemelli, Sanket Biswas, Enrico Civitelli, Josep Llados and Simone Marinai. 2022. Doc2Graph: A Task Agnostic Document Understanding Framework Based on Graph Neural Networks. 17th European Conference on Computer Vision Workshops.329–344. (LNCS.)
toggle visibility
Kunal Biswas, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michel Blumenstein and Josep Llados. 2023. Classification of aesthetic natural scene images using statistical and semantic features. MTAP, 82(9), 13507–13532.
toggle visibility
Ali Furkan Biten, Ruben Tito, Lluis Gomez, Ernest Valveny and Dimosthenis Karatzas. 2022. OCR-IDL: OCR Annotations for Industry Document Library Dataset. ECCV Workshop on Text in Everything.
toggle visibility
Ruben Tito, Dimosthenis Karatzas and Ernest Valveny. 2023. Hierarchical multimodal transformers for Multi-Page DocVQA. PR, 144, 109834.
toggle visibility
Souhail Bakkali, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol and Oriol Ramos Terrades. 2023. VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification. PR, 139, 109419.
toggle visibility
Ruben Tito, Dimosthenis Karatzas and Ernest Valveny. 2023. Hierarchical multimodal transformers for Multipage DocVQA. PR, 144(109834).
toggle visibility
Select All    Deselect All
 | 
Citations
 |