Publicacions CVC -- Query Results

Search & Display Options

Select All Deselect All

<< 1 2 3 4 5 6 7 8 9 10 >> [11–20]

List View

|

|

	Author	Title	Year	Publication	Volume	Pages	Links
	Pau Torras; Mohamed Ali Souibgui; Sanket Biswas; Alicia Fornes	Segmentation-Free Alignment of Arbitrary Symbol Transcripts to Images	2023	Document Analysis and Recognition – ICDAR 2023 Workshops	14193	83-93
	Ruben Perez Tito	Exploring the role of Text in Visual Question Answering on Natural Scenes and Documents	2023	PhD Thesis, Universitat Autonoma de Barcelona-CVC
	Ruben Tito; Dimosthenis Karatzas; Ernest Valveny	Hierarchical multimodal transformers for Multi-Page DocVQA	2023	Pattern Recognition	144	109834
	Ruben Tito; Dimosthenis Karatzas; Ernest Valveny	Hierarchical multimodal transformers for Multipage DocVQA	2023	Pattern Recognition	144
	Ruben Tito; Khanh Nguyen; Marlon Tobaben; Raouf Kerkouche; Mohamed Ali Souibgui; Kangsoo Jung; Lei Kang; Ernest Valveny; Antti Honkela; Mario Fritz; Dimosthenis Karatzas	Privacy-Aware Document Visual Question Answering	2023	Arxiv
	Sergi Garcia Bordils; Dimosthenis Karatzas; Marçal Rusiñol	Accelerating Transformer-Based Scene Text Detection and Recognition via Token Pruning	2023	17th International Conference on Document Analysis and Recognition	14192	106-121
	Souhail Bakkali; Sanket Biswas; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades; Josep Llados	TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language	2023	Arxiv
	Souhail Bakkali; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades	VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification	2023	Pattern Recognition	139	109419
	Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar	Understanding Video Scenes Through Text: Insights from Text-Based Video Question Answering	2023	Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops
	Soumya Jahagirdar; Minesh Mathew; Dimosthenis Karatzas; CV Jawahar	Watching the News: Towards VideoQA Models that can Read	2023	Proceedings of the IEEE/CVF Winter Conference on Applications of Computer

Select All Deselect All

<< 1 2 3 4 5 6 7 8 9 10 >> [11–20]

List View

|

|

All Found Records Selected Records:

Save Citations: Format:

Export Records: Format: