Records |
Author |
Josep Brugues Pujolras; Lluis Gomez; Dimosthenis Karatzas |
Title |
A Multilingual Approach to Scene Text Visual Question Answering |
Type |
Conference Article |
Year |
2022 |
Publication |
Document Analysis Systems.15th IAPR International Workshop, (DAS2022) |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
65-79 |
Keywords |
Scene text; Visual question answering; Multilingual word embeddings; Vision and language; Deep learning |
Abstract |
Scene Text Visual Question Answering (ST-VQA) has recently emerged as a hot research topic in Computer Vision. Current ST-VQA models have a big potential for many types of applications but lack the ability to perform well on more than one language at a time due to the lack of multilingual data, as well as the use of monolingual word embeddings for training. In this work, we explore the possibility to obtain bilingual and multilingual VQA models. In that regard, we use an already established VQA model that uses monolingual word embeddings as part of its pipeline and substitute them by FastText and BPEmb multilingual word embeddings that have been aligned to English. Our experiments demonstrate that it is possible to obtain bilingual and multilingual VQA models with a minimal loss in performance in languages not used during training, as well as a multilingual model trained in multiple languages that match the performance of the respective monolingual baselines. |
Address |
La Rochelle, France; May 22–25, 2022 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
DAS |
Notes |
DAG; 611.004; 600.155; 601.002 |
Approved |
no |
Call Number |
Admin @ si @ BGK2022b |
Serial |
3695 |
Permanent link to this record |
|
|
|
Author |
Josep Llados; Felipe Lumbreras; X. Varona |
Title |
A multidocument platform for automatic reading of identity cards. |
Type |
Miscellaneous |
Year |
1999 |
Publication |
Proceedings of the VIII Symposium Nacional de Reconocimiento de Formas y Analisis de Imagenes. |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
Bilbao |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ADAS;DAG |
Approved |
no |
Call Number |
ADAS @ adas @ LLV1999 |
Serial |
7 |
Permanent link to this record |
|
|
|
Author |
Maria Vanrell; Jordi Vitria; Xavier Roca |
Title |
A multidimensional scaling approach to explore the behavior of a texture perception algorithm. |
Type |
Journal Article |
Year |
1997 |
Publication |
Machine Vision and Applications |
Abbreviated Journal |
|
Volume |
9 |
Issue |
|
Pages |
262–271 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
OR;ISE;CIC;MV |
Approved |
no |
Call Number |
BCNPCL @ bcnpcl @ VVR1997 |
Serial |
35 |
Permanent link to this record |
|
|
|
Author |
Debora Gil; Guillermo Torres |
Title |
A multi-shape loss function with adaptive class balancing for the segmentation of lung structures |
Type |
Conference Article |
Year |
2020 |
Publication |
34th International Congress and Exhibition on Computer Assisted Radiology & Surgery |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
Virtual; June 2020 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
CARS |
Notes |
IAM; 600.139; 600.145 |
Approved |
no |
Call Number |
Admin @ si @ GiT2020 |
Serial |
3472 |
Permanent link to this record |
|
|
|
Author |
Guillermo Torres; Debora Gil |
Title |
A multi-shape loss function with adaptive class balancing for the segmentation of lung structures |
Type |
Journal Article |
Year |
2020 |
Publication |
International Journal of Computer Assisted Radiology and Surgery |
Abbreviated Journal |
IJCAR |
Volume |
15 |
Issue |
1 |
Pages |
S154-55 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
IAM |
Approved |
no |
Call Number |
Admin @ si @ ToG2020 |
Serial |
3590 |
Permanent link to this record |
|
|
|
Author |
Agnes Borras; Josep Llados |
Title |
A Multi-Scale Layout Descriptor Based on Delaunay Triangulation for Image Retrieval |
Type |
Conference Article |
Year |
2008 |
Publication |
3rd International Conference on Computer Vision Theory and Applications VISAPP (2) 2008 |
Abbreviated Journal |
|
Volume |
2 |
Issue |
|
Pages |
139-144 |
Keywords |
|
Abstract |
|
Address |
Funchal, Madeira (Portugal) |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG |
Approved |
no |
Call Number |
DAG @ dag @ BoL2008 |
Serial |
981 |
Permanent link to this record |
|
|
|
Author |
Judit Martinez; Eva Costa; P. Herreros; F. Javier Sanchez; Ramon Baldrich |
Title |
A Modular and Scalable Architecture for PC-Based Real-Time Vision Systems |
Type |
Journal Article |
Year |
2003 |
Publication |
Real–Time Imaging, (IF: 0.512) |
Abbreviated Journal |
|
Volume |
9 |
Issue |
|
Pages |
99-112 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
CIC |
Approved |
no |
Call Number |
CAT @ cat @ MCH2003b |
Serial |
394 |
Permanent link to this record |
|
|
|
Author |
Marçal Rusiñol |
Title |
A Model of Vectorial Signatures in Terms of Expressive Sub-Shapes: Symbol Indexation in Technical Documents |
Type |
Report |
Year |
2006 |
Publication |
CVC Technical Report #94 |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
CVC (UAB) |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG |
Approved |
no |
Call Number |
DAG @ dag @ Rus2006 |
Serial |
668 |
Permanent link to this record |
|
|
|
Author |
Ernest Valveny; Enric Marti |
Title |
A model for image generation and symbol recognition through the deformation of lineal shapes |
Type |
Journal Article |
Year |
2003 |
Publication |
Pattern Recognition Letters |
Abbreviated Journal |
PRL |
Volume |
24 |
Issue |
15 |
Pages |
2857-2867 |
Keywords |
|
Abstract |
We describe a general framework for the recognition of distorted images of lineal shapes, which relies on three items: a model to represent lineal shapes and their deformations, a model for the generation of distorted binary images and the combination of both models in a common probabilistic framework, where the generation of deformations is related to an internal energy, and the generation of binary images to an external energy. Then, recognition consists in the minimization of a global energy function, performed by using the EM algorithm. This general framework has been applied to the recognition of hand-drawn lineal symbols in graphic documents. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Elsevier Science Inc. |
Place of Publication |
New York, NY, USA |
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
0167-8655 |
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; IAM |
Approved |
no |
Call Number |
IAM @ iam @ VAM2003 |
Serial |
1653 |
Permanent link to this record |
|
|
|
Author |
Daniel Ponsa |
Title |
A model based pedestrian tracking review |
Type |
Report |
Year |
2001 |
Publication |
CVC Technical Report #69 |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
CVC (UAB) |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
invisible;ADAS |
Approved |
no |
Call Number |
ADAS @ adas @ Pon2001 |
Serial |
522 |
Permanent link to this record |
|
|
|
Author |
V. Valev; Petia Radeva |
Title |
A Method of Solving Pattern or image Recognition Problems by Learning Boolean Formulas. |
Type |
Miscellaneous |
Year |
1992 |
Publication |
Proc. of 11th IAPR International Conference on Pattern Recognition, Hague, Netherlands, IEEE Computer Society Press, vol. II, pp. 359–362. |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
|
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
MILAB |
Approved |
no |
Call Number |
BCNPCL @ bcnpcl @ VaR1992b |
Serial |
253 |
Permanent link to this record |
|
|
|
Author |
Francesco Ciompi; Oriol Pujol; Petia Radeva |
Title |
A meta-learning approach to Conditional Random Fields using Error-Correcting Output Codes |
Type |
Conference Article |
Year |
2010 |
Publication |
20th International Conference on Pattern Recognition |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
710–713 |
Keywords |
|
Abstract |
We present a meta-learning framework for the design of potential functions for Conditional Random Fields. The design of both node potential and edge potential is formulated as a classification problem where margin classifiers are used. The set of state transitions for the edge potential is treated as a set of different classes, thus defining a multi-class learning problem. The Error-Correcting Output Codes (ECOC) technique is used to deal with the multi-class problem. Furthermore, the point defined by the combination of margin classifiers in the ECOC space is interpreted in a probabilistic manner, and the obtained distance values are then converted into potential values. The proposed model exhibits very promising results when applied to two real detection problems. |
Address |
Istanbul;Turkey |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
1051-4651 |
ISBN |
978-1-4244-7542-1 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICPR |
Notes |
MILAB;HUPBA |
Approved |
no |
Call Number |
BCNPCL @ bcnpcl @ CPR2010a |
Serial |
1365 |
Permanent link to this record |
|
|
|
Author |
Sergio Vera; Miguel Angel Gonzalez Ballester; Debora Gil |
Title |
A medial map capturing the essential geometry of organs |
Type |
Conference Article |
Year |
2012 |
Publication |
ISBI Workshop on Open Source Medical Image Analysis software |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
1691 - 1694 |
Keywords |
Medial Surface Representation, Volume Reconstruction,Geometry , Image reconstruction , Liver , Manifolds , Shape , Surface morphology , Surface reconstruction |
Abstract |
Medial representations are powerful tools for describing and parameterizing the volumetric shape of anatomical structures. Accurate computation of one pixel wide medial surfaces is mandatory. Those surfaces must represent faithfully the geometry of the volume. Although morphological methods produce excellent results in 2D, their complexity and quality drops across dimensions, due to a more complex description of pixel neighborhoods. This paper introduces a continuous operator for accurate and efficient computation of medial structures of arbitrary dimension. Our experiments show its higher performance for medical imaging applications in terms of simplicity of medial structures and capability for reconstructing the anatomical volume |
Address |
Barcelona,Spain |
Corporate Author |
|
Thesis |
|
Publisher |
IEEE |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
1945-7928 |
ISBN |
978-1-4577-1857-1 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ISBI |
Notes |
IAM |
Approved |
no |
Call Number |
IAM @ iam @ VGG2012a |
Serial |
1989 |
Permanent link to this record |
|
|
|
Author |
Olivier Penacchio; Xavier Otazu; Arnold J Wilkings; Sara M. Haigh |
Title |
A mechanistic account of visual discomfort |
Type |
Journal Article |
Year |
2023 |
Publication |
Frontiers in Neuroscience |
Abbreviated Journal |
FN |
Volume |
17 |
Issue |
|
Pages |
|
Keywords |
|
Abstract |
Much of the neural machinery of the early visual cortex, from the extraction of local orientations to contextual modulations through lateral interactions, is thought to have developed to provide a sparse encoding of contour in natural scenes, allowing the brain to process efficiently most of the visual scenes we are exposed to. Certain visual stimuli, however, cause visual stress, a set of adverse effects ranging from simple discomfort to migraine attacks, and epileptic seizures in the extreme, all phenomena linked with an excessive metabolic demand. The theory of efficient coding suggests a link between excessive metabolic demand and images that deviate from natural statistics. Yet, the mechanisms linking energy demand and image spatial content in discomfort remain elusive. Here, we used theories of visual coding that link image spatial structure and brain activation to characterize the response to images observers reported as uncomfortable in a biologically based neurodynamic model of the early visual cortex that included excitatory and inhibitory layers to implement contextual influences. We found three clear markers of aversive images: a larger overall activation in the model, a less sparse response, and a more unbalanced distribution of activity across spatial orientations. When the ratio of excitation over inhibition was increased in the model, a phenomenon hypothesised to underlie interindividual differences in susceptibility to visual discomfort, the three markers of discomfort progressively shifted toward values typical of the response to uncomfortable stimuli. Overall, these findings propose a unifying mechanistic explanation for why there are differences between images and between observers, suggesting how visual input and idiosyncratic hyperexcitability give rise to abnormal brain responses that result in visual stress. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
NEUROBIT |
Approved |
no |
Call Number |
Admin @ si @ POW2023 |
Serial |
3886 |
Permanent link to this record |
|
|
|
Author |
Gemma Sanchez; Josep Llados; K. Tombre |
Title |
A mean string algorithm to compute the average among a set of 2D shapes |
Type |
Journal Article |
Year |
2002 |
Publication |
Pattern Recognition Letters |
Abbreviated Journal |
PRL |
Volume |
23 |
Issue |
1-3 |
Pages |
203–214 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; IF: 0.409 |
Approved |
no |
Call Number |
DAG @ dag @ SLT2002 |
Serial |
275 |
Permanent link to this record |