Publicacions CVC -- Query Results

[171–180] << 181 182 183 184 185 186 187 188 189 190 >> [191–200]

Details

Records
Author	David Masip; Jordi Vitria
Title	Shared Feature Extraction for Nearest Neighbor Face Recognition			Type	Journal
Year	2008	Publication	IEEE Transactions on Neural Networks	Abbreviated Journal
Volume	19	Issue	4	Pages	586–595
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ MaV2008			Serial	944
Permanent link to this record



Author	M. Campos-Taberner; Adriana Romero; Carlo Gatta; Gustavo Camps-Valls
Title	Shared feature representations of LiDAR and optical images: Trading sparsity for semantic discrimination			Type	Conference Article
Year	2015	Publication	IEEE International Geoscience and Remote Sensing Symposium IGARSS2015	Abbreviated Journal
Volume		Issue		Pages	4169 - 4172
Keywords
Abstract	This paper studies the level of complementary information conveyed by extremely high resolution LiDAR and optical images. We pursue this goal following an indirect approach via unsupervised spatial-spectral feature extraction. We used a recently presented unsupervised convolutional neural network trained to enforce both population and lifetime spar-sity in the feature representation. We derived independent and joint feature representations, and analyzed the sparsity scores and the discriminative power. Interestingly, the obtained results revealed that the RGB+LiDAR representation is no longer sparse, and the derived basis functions merge color and elevation yielding a set of more expressive colored edge filters. The joint feature representation is also more discriminative when used for clustering and topological data visualization.
Address	Milan; Italy; July 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	IGARSS
Notes	LAMP; 600.079;MILAB			Approved	no
Call Number	Admin @ si @ CRG2015			Serial	2724
Permanent link to this record



Author	Asma Bensalah; Pau Riba; Alicia Fornes; Josep Llados
Title	Shoot less and Sketch more: An Efficient Sketch Classification via Joining Graph Neural Networks and Few-shot Learning			Type	Conference Article
Year	2019	Publication	13th IAPR International Workshop on Graphics Recognition	Abbreviated Journal
Volume		Issue		Pages	80-85
Keywords	Sketch classification; Convolutional Neural Network; Graph Neural Network; Few-shot learning
Abstract	With the emergence of the touchpad devices and drawing tablets, a new era of sketching started afresh. However, the recognition of sketches is still a tough task due to the variability of the drawing styles. Moreover, in some application scenarios there is few labelled data available for training, which imposes a limitation for deep learning architectures. In addition, in many cases there is a need to generate models able to adapt to new classes. In order to cope with these limitations, we propose a method based on few-shot learning and graph neural networks for classifying sketches aiming for an efficient neural model. We test our approach with several databases of sketches, showing promising results.
Address	Sydney; Australia; September 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	GREC
Notes	DAG; 600.140; 601.302; 600.121			Approved	no
Call Number	Admin @ si @ BRF2019			Serial	3354
Permanent link to this record



Author	Miguel Oliveira; V.Santos; Angel Sappa
Title	Short term path planning using a multiple hypothesis evaluation approach for an autonomous driving competition			Type	Conference Article
Year	2012	Publication	IEEE 4th Workshop on Planning, Perception and Navigation for Intelligent Vehicles	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Algarve; Portugal
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	PPNIV
Notes	ADAS			Approved	no
Call Number	Admin @ si @ OSS2012c			Serial	2159
Permanent link to this record



Author	J.M. Sanchez; X. Binefa; Jordi Vitria
Title	Shot Partitioning Based Recognition of Tv Commercials			Type	Journal
Year	2002	Publication	Multimedia Tools and Applications, 18: 233–247, Kluwer Academic Publishers (IF: 0.421)	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ SBV2002			Serial	274
Permanent link to this record



Author	Alicia Fornes; Xavier Otazu; Josep Llados
Title	Show through cancellation and image enhancement by multiresolution contrast processing			Type	Conference Article
Year	2013	Publication	12th International Conference on Document Analysis and Recognition	Abbreviated Journal
Volume		Issue		Pages	200-204
Keywords
Abstract	Historical documents suffer from different types of degradation and noise such as background variation, uneven illumination or dark spots. In case of double-sided documents, another common problem is that the back side of the document usually interferes with the front side because of the transparency of the document or ink bleeding. This effect is called the show through phenomenon. Many methods are developed to solve these problems, and in the case of show-through, by scanning and matching both the front and back sides of the document. In contrast, our approach is designed to use only one side of the scanned document. We hypothesize that show-trough are low contrast components, while foreground components are high contrast ones. A Multiresolution Contrast (MC) decomposition is presented in order to estimate the contrast of features at different spatial scales. We cancel the show-through phenomenon by thresholding these low contrast components. This decomposition is also able to enhance the image removing shadowed areas by weighting spatial scales. Results show that the enhanced images improve the readability of the documents, allowing scholars both to recover unreadable words and to solve ambiguities.
Address	Washington; USA; August 2013
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1520-5363	ISBN		Medium
Area		Expedition		Conference	ICDAR
Notes	DAG; 602.006; 600.045; 600.061; 600.052;CIC			Approved	no
Call Number	Admin @ si @ FOL2013			Serial	2241
Permanent link to this record



Author	Khanh Nguyen; Ali Furkan Biten; Andres Mafla; Lluis Gomez; Dimosthenis Karatzas
Title	Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia			Type	Conference Article
Year	2023	Publication	Proceedings of the 37th AAAI Conference on Artificial Intelligence	Abbreviated Journal
Volume	37	Issue	2	Pages	1940-1948
Keywords
Abstract	Humans exploit prior knowledge to describe images, and are able to adapt their explanation to specific contextual information given, even to the extent of inventing plausible explanations when contextual information and images do not match. In this work, we propose the novel task of captioning Wikipedia images by integrating contextual knowledge. Specifically, we produce models that jointly reason over Wikipedia articles, Wikimedia images and their associated descriptions to produce contextualized captions. The same Wikimedia image can be used to illustrate different articles, and the produced caption needs to be adapted to the specific context allowing us to explore the limits of the model to adjust captions to different contextual information. Dealing with out-of-dictionary words and Named Entities is a challenging task in this domain. To address this, we propose a pre-training objective, Masked Named Entity Modeling (MNEM), and show that this pretext task results to significantly improved models. Furthermore, we verify that a model pre-trained in Wikipedia generalizes well to News Captioning datasets. We further define two different test splits according to the difficulty of the captioning task. We offer insights on the role and the importance of each modality and highlight the limitations of our model.
Address	Washington; USA; February 2023
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	AAAI
Notes	DAG			Approved	no
Call Number	Admin @ si @ NBM2023			Serial	3860
Permanent link to this record



Author	David Berga; Xose R. Fernandez-Vidal; Xavier Otazu; Xose M. Pardo
Title	SID4VAM: A Benchmark Dataset with Synthetic Images for Visual Attention Modeling			Type	Conference Article
Year	2019	Publication	18th IEEE International Conference on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	8788-8797
Keywords
Abstract	A benchmark of saliency models performance with a synthetic image dataset is provided. Model performance is evaluated through saliency metrics as well as the influence of model inspiration and consistency with human psychophysics. SID4VAM is composed of 230 synthetic images, with known salient regions. Images were generated with 15 distinct types of low-level features (e.g. orientation, brightness, color, size...) with a target-distractor popout type of synthetic patterns. We have used Free-Viewing and Visual Search task instructions and 7 feature contrasts for each feature category. Our study reveals that state-ofthe-art Deep Learning saliency models do not perform well with synthetic pattern images, instead, models with Spectral/Fourier inspiration outperform others in saliency metrics and are more consistent with human psychophysical experimentation. This study proposes a new way to evaluate saliency models in the forthcoming literature, accounting for synthetic images with uniquely low-level feature contexts, distinct from previous eye tracking image datasets.
Address	Seul; Corea; October 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICCV
Notes	NEUROBIT; 600.128			Approved	no
Call Number	Admin @ si @ BFO2019b			Serial	3372
Permanent link to this record



Author	Michal Drozdzal; Laura Igual; Jordi Vitria; Petia Radeva; Carolina Malagelada; Fernando Azpiroz
Title	SIFT flow-based Sequences Alignment			Type	Conference Article
Year	2010	Publication	Medical Image Computing in Catalunya: Graduate Student Workshop	Abbreviated Journal
Volume		Issue		Pages	7–8
Keywords
Abstract
Address	Girona, Spain
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	MICCAT
Notes	OR;MILAB;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ DIV2010			Serial	1475
Permanent link to this record



Author	Razieh Rastgoo; Kourosh Kiani; Sergio Escalera; Mohammad Sabokrou
Title	Sign Language Production: A Review			Type	Conference Article
Year	2021	Publication	Conference on Computer Vision and Pattern Recognition Workshops	Abbreviated Journal
Volume		Issue		Pages	3472-3481
Keywords
Abstract	Sign Language is the dominant yet non-primary form of communication language used in the deaf and hearing-impaired community. To make an easy and mutual communication between the hearing-impaired and the hearing communities, building a robust system capable of translating the spoken language into sign language and vice versa is fundamental. To this end, sign language recognition and production are two necessary parts for making such a two-way system. Sign language recognition and production need to cope with some critical challenges. In this survey, we review recent advances in Sign Language Production (SLP) and related areas using deep learning. This survey aims to briefly summarize recent achievements in SLP, discussing their advantages, limitations, and future directions of research.
Address	Virtual; June 2021
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CVPRW
Notes	HUPBA; no proj			Approved	no
Call Number	Admin @ si @ RKE2021b			Serial	3603
Permanent link to this record



Author	Razieh Rastgoo; Kourosh Kiani; Sergio Escalera
Title	Sign Language Recognition: A Deep Survey			Type	Journal Article
Year	2021	Publication	Expert Systems With Applications	Abbreviated Journal	ESWA
Volume	164	Issue		Pages	113794
Keywords
Abstract	Sign language, as a different form of the communication language, is important to large groups of people in society. There are different signs in each sign language with variability in hand shape, motion profile, and position of the hand, face, and body parts contributing to each sign. So, visual sign language recognition is a complex research area in computer vision. Many models have been proposed by different researchers with significant improvement by deep learning approaches in recent years. In this survey, we review the vision-based proposed models of sign language recognition using deep learning approaches from the last five years. While the overall trend of the proposed models indicates a significant improvement in recognition accuracy in sign language recognition, there are some challenges yet that need to be solved. We present a taxonomy to categorize the proposed models for isolated and continuous sign language recognition, discussing applications, datasets, hybrid models, complexity, and future lines of research in the field.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	HUPBA; no proj			Approved	no
Call Number	Admin @ si @ RKE2021a			Serial	3521
Permanent link to this record



Author	Sounak Dey; Anjan Dutta; Juan Ignacio Toledo; Suman Ghosh; Josep Llados; Umapada Pal
Title	SigNet: Convolutional Siamese Network for Writer Independent Offline Signature Verification			Type	Miscellaneous
Year	2018	Publication	Arxiv	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Offline signature verification is one of the most challenging tasks in biometrics and document forensics. Unlike other verification problems, it needs to model minute but critical details between genuine and forged signatures, because a skilled falsification might often resembles the real signature with small deformation. This verification task is even harder in writer independent scenarios which is undeniably fiscal for realistic cases. In this paper, we model an offline writer independent signature verification task with a convolutional Siamese network. Siamese networks are twin networks with shared weights, which can be trained to learn a feature space where similar observations are placed in proximity. This is achieved by exposing the network to a pair of similar and dissimilar observations and minimizing the Euclidean distance between similar pairs while simultaneously maximizing it between dissimilar pairs. Experiments conducted on cross-domain datasets emphasize the capability of our network to model forgery in different languages (scripts) and handwriting styles. Moreover, our designed Siamese network, named SigNet, exceeds the state-of-the-art results on most of the benchmark signature datasets, which paves the way for further research in this direction.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG; 600.097; 600.121			Approved	no
Call Number	Admin @ si @ DDT2018			Serial	3085
Permanent link to this record



Author	Agnes Borras; Josep Llados
Title	Similarity-Based Object Retrieval Using Appearance and Geometric Feature Combination			Type	Book Chapter
Year	2007	Publication	3rd Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2007), J. Marti et al. (Eds.) LNCS 4477:113–120	Abbreviated Journal	LNCS
Volume	4478	Issue		Pages	33–39
Keywords
Abstract	This work presents a content-based image retrieval system of general purpose that deals with cluttered scenes containing a given query object. The system is flexible enough to handle with a single image of an object despite its rotation, translation and scale variations. The image content is divided in parts that are described with a combination of features based on geometrical and color properties. The idea behind the feature combination is to benefit from a fuzzy similarity computation that provides robustness and tolerance to the retrieval process. The features can be independently computed and the image parts can be easily indexed by using a table structure on every feature value. Finally a process inspired in the alignment strategies is used to check the coherence of the object parts found in a scene. Our work presents a system of easy implementation that uses an open set of features and can suit a wide variety of applications.
Address	Girona (Spain)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-540-72848-1	Medium
Area		Expedition		Conference
Notes	DAG;			Approved	no
Call Number	DAG @ dag @ BoL2007a; IAM @ iam @ BoL2007a			Serial	776
Permanent link to this record



Author	Shiqi Yang; Kai Wang; Luis Herranz; Joost Van de Weijer
Title	Simple and effective localized attribute representations for zero-shot learning			Type	Miscellaneous
Year	2020	Publication	Arxiv	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	arXiv:2006.05938 Zero-shot learning (ZSL) aims to discriminate images from unseen classes by exploiting relations to seen classes via their semantic descriptions. Some recent papers have shown the importance of localized features together with fine-tuning the feature extractor to obtain discriminative and transferable features. However, these methods require complex attention or part detection modules to perform explicit localization in the visual space. In contrast, in this paper we propose localizing representations in the semantic/attribute space, with a simple but effective pipeline where localization is implicit. Focusing on attribute representations, we show that our method obtains state-of-the-art performance on CUB and SUN datasets, and also achieves competitive results on AWA2 dataset, outperforming generally more complex methods with explicit localization in the visual space. Our method can be implemented easily, which can be used as a new baseline for zero shot-learning. In addition, our localized representations are highly interpretable as attribute-specific heatmaps.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	LAMP; 600.120			Approved	no
Call Number	Admin @ si @ YWH2020			Serial	3542
Permanent link to this record



Author	Cristhian Aguilera; M.Ramos; Angel Sappa
Title	Simulated Annealing: A Novel Application of Image Processing in the Wood Area			Type	Book Chapter
Year	2012	Publication	Simulated Annealing – Advances, Applications and Hybridizations	Abbreviated Journal
Volume		Issue		Pages	91-104
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor	Marcos de Sales Guerra Tsuzuki
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-953-51-0710-1	Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	Admin @ si @ ARS2012			Serial	2156
Permanent link to this record