Author |
Title |
Year |
Publication |
Volume |
Pages |
Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar |
Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering |
2023 |
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops |
|
|
Jordy Van Landeghem; Ruben Tito; Lukasz Borchmann; Michal Pietruszka; Pawel Joziak; Rafal Powalski; Dawid Jurkiewicz; Mickael Coustaty; Bertrand Anckaert; Ernest Valveny; Matthew Blaschko; Sien Moens; Tomasz Stanislawek |
Document Understanding Dataset and Evaluation (DUDE) |
2023 |
20th IEEE International Conference on Computer Vision |
|
19528-19540 |
Ruben Perez Tito |
Exploring the role of Text in Visual Question Answering on Natural Scenes and Documents |
2023 |
PhD Thesis, Universitat Autonoma de Barcelona-CVC |
|
|
Subhajit Maity; Sanket Biswas; Siladittya Manna; Ayan Banerjee; Josep Llados; Saumik Bhattacharya; Umapada Pal |
SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation |
2023 |
17th International Conference on Doccument Analysis and Recognition |
14187 |
342–360 |
Souhail Bakkali; Sanket Biswas; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades; Josep Llados |
TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language |
2023 |
Arxiv |
|
|
Ruben Tito; Khanh Nguyen; Marlon Tobaben; Raouf Kerkouche; Mohamed Ali Souibgui; Kangsoo Jung; Lei Kang; Ernest Valveny; Antti Honkela; Mario Fritz; Dimosthenis Karatzas |
Privacy-Aware Document Visual Question Answering |
2023 |
Arxiv |
|
|
Mohamed Ali Souibgui; Asma Bensalah; Jialuo Chen; Alicia Fornes; Michelle Waldispühl |
A User Perspective on HTR methods for the Automatic Transcription of Rare Scripts: The Case of Codex Runicus Just Accepted |
2023 |
ACM Journal on Computing and Cultural Heritage |
15 |
1-18 |
Souhail Bakkali; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades |
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification |
2023 |
Pattern Recognition |
139 |
109419 |
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny |
Hierarchical multimodal transformers for Multi-Page DocVQA |
2023 |
Pattern Recognition |
144 |
109834 |
Reuben Dorent; Aaron Kujawa; Marina Ivory; Spyridon Bakas; Nikola Rieke; Samuel Joutard; Ben Glocker; Jorge Cardoso; Marc Modat; Kayhan Batmanghelich; Arseniy Belkov; Maria Baldeon Calisto; Jae Won Choi; Benoit M. Dawant; Hexin Dong; Sergio Escalera; Yubo Fan; Lasse Hansen; Mattias P. Heinrich; Smriti Joshi; Victoriya Kashtanova; Hyeon Gyu Kim; Satoshi Kondo; Christian N. Kruse; Susana K. Lai-Yuen; Hao Li; Han Liu; Buntheng Ly; Ipek Oguz; Hyungseob Shin; Boris Shirokikh; Zixian Su; Guotai Wang; Jianghao Wu; Yanwu Xu; Kai Yao; Li Zhang; Sebastien Ourselin, |
CrossMoDA 2021 challenge: Benchmark of Cross-Modality Domain Adaptation techniques for Vestibular Schwannoma and Cochlea Segmentation |
2023 |
Medical Image Analysis |
83 |
102628 |
David Pujol Perich; Albert Clapes; Sergio Escalera |
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization |
2023 |
Arxiv |
|
|
Lei Li; Fuping Wu; Sihan Wang; Xinzhe Luo; Carlos Martin-Isla; Shuwei Zhai; Jianpeng Zhang; Yanfei Liu; Zhen Zhang; Markus J. Ankenbrand; Haochuan Jiang; Xiaoran Zhang; Linhong Wang; Tewodros Weldebirhan Arega; Elif Altunok; Zhou Zhao; Feiyan Li; Jun Ma; Xiaoping Yang; Elodie Puybareau; Ilkay Oksuz; Stephanie Bricq; Weisheng Li;Kumaradevan Punithakumar; Sotirios A. Tsaftaris; Laura M. Schreiber; Mingjing Yang; Guocai Liu; Yong Xia; Guotai Wang; Sergio Escalera; Xiahai Zhuag |
MyoPS: A benchmark of myocardial pathology segmentation combining three-sequence cardiac magnetic resonance images |
2023 |
Medical Image Analysis |
87 |
102808 |
Razieh Rastgoo; Kourosh Kiani; Sergio Escalera |
ZS-GR: zero-shot gesture recognition from RGB-D videos |
2023 |
Multimedia Tools and Applications |
82 |
43781-43796 |
Carlos Martin-Isla; Victor M Campello; Cristian Izquierdo; Kaisar Kushibar; Carla Sendra Balcells; Polyxeni Gkontra; Alireza Sojoudi; Mitchell J Fulton; Tewodros Weldebirhan Arega; Kumaradevan Punithakumar; Lei Li; Xiaowu Sun; Yasmina Al Khalil; Di Liu; Sana Jabbar; Sandro Queiros; Francesco Galati; Moona Mazher; Zheyao Gao; Marcel Beetz; Lennart Tautz; Christoforos Galazis; Marta Varela; Markus Hullebrand; Vicente Grau; Xiahai Zhuang; Domenec Puig; Maria A Zuluaga; Hassan Mohy Ud Din; Dimitris Metaxas; Marcel Breeuwer; Rob J van der Geest; Michelle Noga; Stephanie Bricq; Mark E Rentschler; Andrea Guala; Steffen E Petersen; Sergio Escalera; Jose F Rodriguez Palomares; Karim Lekadir |
Deep Learning Segmentation of the Right Ventricle in Cardiac MRI: The M&ms Challenge |
2023 |
IEEE Journal of Biomedical and Health Informatics |
27 |
3302-3313 |
Razieh Rastgoo; Kourosh Kiani; Sergio Escalera |
A deep co-attentive hand-based video question answering framework using multi-view skeleton |
2023 |
Multimedia Tools and Applications |
82 |
1401–1429 |