Author |
Title |
Year |
Publication |
Volume |
Pages |
Lluis Gomez; Ali Furkan Biten; Ruben Tito; Andres Mafla; Marçal Rusiñol; Ernest Valveny; Dimosthenis Karatzas |
Multimodal grid features and cell pointers for scene text visual question answering |
2021 |
Pattern Recognition Letters |
150 |
242-249 |
Maedeh Aghaei; Mariella Dimiccoli; Petia Radeva |
Multi-face tracking by extended bag-of-tracklets in egocentric photo-streams |
2016 |
Computer Vision and Image Understanding |
149 |
146-156 |
Gerard Canal; Sergio Escalera; Cecilio Angulo |
A Real-time Human-Robot Interaction system based on gestures for assistive scenarios |
2016 |
Computer Vision and Image Understanding |
149 |
65-77 |
Arka Ujjal Dey; Suman Ghosh; Ernest Valveny; Gaurav Harit |
Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding |
2021 |
Pattern Recognition Letters |
149 |
164-171 |
Iban Berganzo-Besga; Hector A. Orengo; Felipe Lumbreras; Paloma Aliende; Monica N. Ramsey |
Automated detection and classification of multi-cell Phytoliths using Deep Learning-Based Algorithms |
2022 |
Journal of Archaeological Science |
148 |
105654 |
Mohammad Momeny; Ali Asghar Neshat; Ahmad Jahanbakhshi; Majid Mahmoudi; Yiannis Ampatzidis; Petia Radeva |
Grading and fraud detection of saffron via learning-to-augment incorporated Inception-v4 CNN |
2023 |
Food Control |
147 |
109554 |
Katerine Diaz; Francesc J. Ferri; Aura Hernandez-Sabate |
An overview of incremental feature extraction methods based on linear subspaces |
2018 |
Knowledge-Based Systems |
145 |
219-235 |
Joakim Bruslund Haurum; Meysam Madadi; Sergio Escalera; Thomas B. Moeslund |
Multi-scale hybrid vision transformer and Sinkhorn tokenizer for sewer defect classification |
2022 |
Automation in Construction |
144 |
104614 |
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny |
Hierarchical multimodal transformers for Multi-Page DocVQA |
2023 |
Pattern Recognition |
144 |
109834 |
Ruben Tito; Dimosthenis Karatzas; Ernest Valveny |
Hierarchical multimodal transformers for Multipage DocVQA |
2023 |
Pattern Recognition |
144 |
|
Souhail Bakkali; Zuheng Ming; Mickael Coustaty; Marçal Rusiñol; Oriol Ramos Terrades |
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification |
2023 |
Pattern Recognition |
139 |
109419 |
Xavier Soria; Angel Sappa; Patricio Humanante; Arash Akbarinia |
Dense extreme inception network for edge detection |
2023 |
Pattern Recognition |
139 |
109461 |
Josep M. Gonfaus; Marco Pedersoli; Jordi Gonzalez; Andrea Vedaldi; Xavier Roca |
Factorized appearances for object detection |
2015 |
Computer Vision and Image Understanding |
138 |
92–101 |
Muhammad Anwer Rao; Fahad Shahbaz Khan; Joost Van de Weijer; Matthieu Molinier; Jorma Laaksonen |
Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification |
2018 |
ISPRS Journal of Photogrammetry and Remote Sensing |
138 |
74-85 |
Giuseppe Pezzano; Oliver Diaz; Vicent Ribas Ripoll; Petia Radeva |
CoLe-CNN+: Context learning – Convolutional neural network for COVID-19-Ground-Glass-Opacities detection and segmentation |
2021 |
Computers in Biology and Medicine |
136 |
104689 |