Publicacions CVC -- Query Results

[181–190] << 191 192 193 194 195 196 197 198 199 200 >> [201–210]

Details

Records
Author	Patricia Suarez; Angel Sappa; Boris X. Vintimilla
Title	Learning to Colorize Infrared Images			Type	Conference Article
Year	2017	Publication	15th International Conference on Practical Applications of Agents and Multi-Agent System	Abbreviated Journal
Volume		Issue		Pages
Keywords	CNN in multispectral imaging; Image colorization
Abstract	This paper focuses on near infrared (NIR) image colorization by using a Generative Adversarial Network (GAN) architecture model. The proposed architecture consists of two stages. Firstly, it learns to colorize the given input, resulting in a RGB image. Then, in the second stage, a discriminative model is used to estimate the probability that the generated image came from the training dataset, rather than the image automatically generated. The proposed model starts the learning process from scratch, because our set of images is very dierent from the dataset used in existing pre-trained models, so transfer learning strategies cannot be used. Infrared image colorization is an important problem when human perception need to be considered, e.g, in remote sensing applications. Experimental results with a large set of real images are provided showing the validity of the proposed approach.
Address	Porto; Portugal; June 2017
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	PAAMS
Notes	ADAS; MSIAU; 600.086; 600.122; 600.118			Approved	no
Call Number	Admin @ si @			Serial	2919
Permanent link to this record



Author	Vacit Oguz Yazici; Joost Van de Weijer; Arnau Ramisa
Title	Color Naming for Multi-Color Fashion Items			Type	Conference Article
Year	2018	Publication	6th World Conference on Information Systems and Technologies	Abbreviated Journal
Volume	747	Issue		Pages	64-73
Keywords	Deep learning; Color; Multi-label
Abstract	There exists a significant amount of research on color naming of single colored objects. However in reality many fashion objects consist of multiple colors. Currently, searching in fashion datasets for multi-colored objects can be a laborious task. Therefore, in this paper we focus on color naming for images with multi-color fashion items. We collect a dataset, which consists of images which may have from one up to four colors. We annotate the images with the 11 basic colors of the English language. We experiment with several designs for deep neural networks with different losses. We show that explicitly estimating the number of colors in the fashion item leads to improved results.
Address	Naples; March 2018
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	WORLDCIST
Notes	LAMP; 600.109; 601.309; 600.120			Approved	no
Call Number	Admin @ si @ YWR2018			Serial	3161
Permanent link to this record



Author	Patricia Suarez; Angel Sappa; Boris X. Vintimilla
Title	Vegetation Index Estimation from Monospectral Images			Type	Conference Article
Year	2018	Publication	15th International Conference on Images Analysis and Recognition	Abbreviated Journal
Volume	10882	Issue		Pages	353-362
Keywords
Abstract	This paper proposes a novel approach to estimate Normalized Difference Vegetation Index (NDVI) from just the red channel of a RGB image. The NDVI index is defined as the ratio of the difference of the red and infrared radiances over their sum. In other words, information from the red channel of a RGB image and the corresponding infrared spectral band are required for its computation. In the current work the NDVI index is estimated just from the red channel by training a Conditional Generative Adversarial Network (CGAN). The architecture proposed for the generative network consists of a single level structure, which combines at the final layer results from convolutional operations together with the given red channel with Gaussian noise to enhance details, resulting in a sharp NDVI image. Then, the discriminative model estimates the probability that the NDVI generated index came from the training dataset, rather than the index automatically generated. Experimental results with a large set of real images are provided showing that a Conditional GAN single level model represents an acceptable approach to estimate NDVI index.
Address	Povoa de Varzim; Portugal; June 2018
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICIAR
Notes	MSIAU; 600.086; 600.130; 600.122			Approved	no
Call Number	Admin @ si @ SSV2018c			Serial	3196
Permanent link to this record



Author	Santiago Segui; Laura Igual; Fernando Vilariño; Petia Radeva; Carolina Malagelada; Fernando Azpiroz; Jordi Vitria
Title	Diagnostic System for Intestinal Motility Disfunctions Using Video Capsule Endoscopy			Type	Book Chapter
Year	2008	Publication	Computer Vision Systems. 6th International	Abbreviated Journal
Volume	5008	Issue		Pages	251–260
Keywords
Abstract	Wireless Video Capsule Endoscopy is a clinical technique consisting of the analysis of images from the intestine which are pro- vided by an ingestible device with a camera attached to it. In this paper we propose an automatic system to diagnose severe intestinal motility disfunctions using the video endoscopy data. The system is based on the application of computer vision techniques within a machine learn- ing framework in order to obtain the characterization of diverse motil- ity events from video sequences. We present experimental results that demonstrate the effectiveness of the proposed system and compare them with the ground-truth provided by the gastroenterologists.
Address	Santorini (Greece)
Corporate Author				Thesis
Publisher	Springer-Verlag	Place of Publication	Berlin Heidelberg	Editor	A. Gasteratos, M. Vincze, and J.K. Tsotsos
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-540-79546-9	Medium
Area	800	Expedition		Conference	ICVS
Notes	OR; MV; MILAB; SIAI			Approved	no
Call Number	BCNPCL @ bcnpcl @ SIV2008; IAM @ iam @ SIV2008			Serial	962
Permanent link to this record



Author	Pierluigi Casale; Oriol Pujol; Petia Radeva
Title	Face-to-face social activity detection using data collected with a wearable device			Type	Conference Article
Year	2009	Publication	4th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	5524	Issue		Pages	56–63
Keywords
Abstract	In this work the feasibility of building a socially aware badge that learns from user activities is explored. A wearable multisensor device has been prototyped for collecting data about user movements and photos of the environment where the user acts. Using motion data, speaking and other activities have been classified. Images have been analysed in order to complement motion data and help for the detection of social behaviours. A face detector and an activity classifier are both used for detecting if users have a social activity in the time they worn the device. Good results encourage the improvement of the system at both hardware and software level
Address	Póvoa de Varzim, Portugal
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-02171-8	Medium
Area		Expedition		Conference	IbPRIA
Notes	MILAB;HuPBA			Approved	no
Call Number	BCNPCL @ bcnpcl @ CPR2009b			Serial	1206
Permanent link to this record



Author	David Aldavert; Ricardo Toledo; Arnau Ramisa; Ramon Lopez de Mantaras
Title	Efficient Object Pixel-Level Categorization using Bag of Features: Advances in Visual Computing			Type	Conference Article
Year	2009	Publication	5th International Symposium on Visual Computing	Abbreviated Journal
Volume	5875	Issue		Pages	44–55
Keywords
Abstract	In this paper we present a pixel-level object categorization method suitable to be applied under real-time constraints. Since pixels are categorized using a bag of features scheme, the major bottleneck of such an approach would be the feature pooling in local histograms of visual words. Therefore, we propose to bypass this time-consuming step and directly obtain the score from a linear Support Vector Machine classifier. This is achieved by creating an integral image of the components of the SVM which can readily obtain the classification score for any image sub-window with only 10 additions and 2 products, regardless of its size. Besides, we evaluated the performance of two efficient feature quantization methods: the Hierarchical K-Means and the Extremely Randomized Forest. All experiments have been done in the Graz02 database, showing comparable, or even better results to related work with a lower computational cost.
Address	Las Vegas, USA
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-10330-8	Medium
Area		Expedition		Conference	ISVC
Notes	ADAS			Approved	no
Call Number	Admin @ si @ ATR2009a			Serial	1246
Permanent link to this record



Author	Jaume Gibert; Ernest Valveny; Horst Bunke
Title	Graph of Words Embedding for Molecular Structure-Activity Relationship Analysis			Type	Conference Article
Year	2010	Publication	15th Iberoamerican Congress on Pattern Recognition	Abbreviated Journal
Volume	6419	Issue		Pages	30–37
Keywords
Abstract	Structure-Activity relationship analysis aims at discovering chemical activity of molecular compounds based on their structure. In this article we make use of a particular graph representation of molecules and propose a new graph embedding procedure to solve the problem of structure-activity relationship analysis. The embedding is essentially an arrangement of a molecule in the form of a vector by considering frequencies of appearing atoms and frequencies of covalent bonds between them. Results on two benchmark databases show the effectiveness of the proposed technique in terms of recognition accuracy while avoiding high operational costs in the transformation.
Address	Sao Paulo, Brazil
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-16686-0	Medium
Area		Expedition		Conference	CIARP
Notes	DAG			Approved	no
Call Number	DAG @ dag @ GVB2010			Serial	1462
Permanent link to this record



Author	Antonio Clavelli; Dimosthenis Karatzas; Josep Llados; Mario Ferraro; Giuseppe Boccignone
Title	Towards Modelling an Attention-Based Text Localization Process			Type	Conference Article
Year	2013	Publication	6th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
Volume	7887	Issue		Pages	296-303
Keywords	text localization; visual attention; eye guidance
Abstract	This note introduces a visual attention model of text localization in real-world scenes. The core of the model built upon the proto-object concept is discussed. It is shown how such dynamic mid-level representation of the scene can be derived in the framework of an action-perception loop engaging salience, text information value computation, and eye guidance mechanisms. Preliminary results that compare model generated scanpaths with those eye-tracked from human subjects are presented.
Address	Madeira; Portugal; June 2013
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-38627-5	Medium
Area		Expedition		Conference	IbPRIA
Notes	DAG			Approved	no
Call Number	Admin @ si @ CKL2013			Serial	2291
Permanent link to this record



Author	Marçal Rusiñol; Josep Llados
Title	A Performance Evaluation Protocol for Symbol Spotting Systems in Terms of Recognition and Location Indices			Type	Journal Article
Year	2009	Publication	International Journal on Document Analysis and Recognition	Abbreviated Journal	IJDAR
Volume	12	Issue	2	Pages	83-96
Keywords	Performance evaluation; Symbol Spotting; Graphics Recognition
Abstract	Symbol spotting systems are intended to retrieve regions of interest from a document image database where the queried symbol is likely to be found. They shall have the ability to recognize and locate graphical symbols in a single step. In this paper, we present a set of measures to evaluate the performance of a symbol spotting system in terms of recognition abilities, location accuracy and scalability. We show that the proposed measures allow to determine the weaknesses and strengths of different methods. In particular we have tested a symbol spotting method based on a set of four different off-the-shelf shape descriptors.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1433-2833	ISBN		Medium
Area		Expedition		Conference
Notes	DAG			Approved	no
Call Number	DAG @ dag @ RuL2009a			Serial	1166
Permanent link to this record



Author	Carme Julia; Angel Sappa; Felipe Lumbreras; Joan Serrat; Antonio Lopez
Title	An iterative multiresolution scheme for SFM with missing data			Type	Journal Article
Year	2009	Publication	Journal of Mathematical Imaging and Vision	Abbreviated Journal	JMIV
Volume	34	Issue	3	Pages	240–258
Keywords
Abstract	Several techniques have been proposed for tackling the Structure from Motion problem through factorization in the case of missing data. However, when the percentage of unknown data is high, most of them may not perform as well as expected. Focussing on this problem, an iterative multiresolution scheme, which aims at recovering missing entries in the originally given input matrix, is proposed. Information recovered following a coarse-to-fine strategy is used for filling in the missing entries. The objective is to recover, as much as possible, missing data in the given matrix. Thus, when a factorization technique is applied to the partially or totally filled in matrix, instead of to the originally given input one, better results will be obtained. An evaluation study about the robustness to missing and noisy data is reported. Experimental results obtained with synthetic and real video sequences are presented to show the viability of the proposed approach.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ JSL2009a			Serial	1163
Permanent link to this record



Author	Anastasios Doulamis; Nikolaos Doulamis; Marco Bertini; Jordi Gonzalez; Thomas B. Moeslund
Title	Introduction to the Special Issue on the Analysis and Retrieval of Events/Actions and Workflows in Video Streams			Type	Journal Article
Year	2016	Publication	Multimedia Tools and Applications	Abbreviated Journal	MTAP
Volume	75	Issue	22	Pages	14985-14990
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE; HUPBA			Approved	no
Call Number	Admin @ si @ DDB2016			Serial	2934
Permanent link to this record



Author	W.Win; B.Bao; Q.Xu; Luis Herranz; Shuqiang Jiang
Title	Editorial Note: Efficient Multimedia Processing Methods and Applications			Type	Miscellaneous
Year	2019	Publication	Multimedia Tools and Applications	Abbreviated Journal	MTAP
Volume	78	Issue	1	Pages
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	LAMP; 600.141; 600.120			Approved	no
Call Number	Admin @ si @ WBX2019			Serial	3257
Permanent link to this record



Author	Rahma Kalboussi; Aymen Azaza; Joost Van de Weijer; Mehrez Abdellaoui; Ali Douik
Title	Object proposals for salient object segmentation in videos			Type	Journal Article
Year	2020	Publication	Multimedia Tools and Applications	Abbreviated Journal	MTAP
Volume	79	Issue	13	Pages	8677-8693
Keywords
Abstract	Salient object segmentation in videos is generally broken up in a video segmentation part and a saliency assignment part. Recently, object proposals, which are used to segment the image, have had significant impact on many computer vision applications, including image segmentation, object detection, and recently saliency detection in still images. However, their usage has not yet been evaluated for salient object segmentation in videos. Therefore, in this paper, we investigate the application of object proposals to salient object segmentation in videos. In addition, we propose a new motion feature derived from the optical flow structure tensor for video saliency detection. Experiments on two standard benchmark datasets for video saliency show that the proposed motion feature improves saliency estimation results, and that object proposals are an efficient method for salient object segmentation. Results on the challenging SegTrack v2 and Fukuchi benchmark data sets show that we significantly outperform the state-of-the-art.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	LAMP; 600.120			Approved	no
Call Number	KAW2020			Serial	3504
Permanent link to this record



Author	Henry Velesaca; Gisel Bastidas-Guacho; Mohammad Rouhani; Angel Sappa
Title	Multimodal image registration techniques: a comprehensive survey			Type	Journal Article
Year	2024	Publication	Multimedia Tools and Applications	Abbreviated Journal	MTAP
Volume		Issue		Pages
Keywords
Abstract	This manuscript presents a review of state-of-the-art techniques proposed in the literature for multimodal image registration, addressing instances where images from different modalities need to be precisely aligned in the same reference system. This scenario arises when the images to be registered come from different modalities, among the visible and thermal spectral bands, 3D-RGB, or flash-no flash, or NIR-visible. The review spans different techniques from classical approaches to more modern ones based on deep learning, aiming to highlight the particularities required at each step in the registration pipeline when dealing with multimodal images. It is noteworthy that medical images are excluded from this review due to their specific characteristics, including the use of both active and passive sensors or the non-rigid nature of the body contained in the image.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	MSIAU			Approved	no
Call Number	Admin @ si @ VBR2024			Serial	3997
Permanent link to this record



Author	Razieh Rastgoo; Kourosh Kiani; Sergio Escalera
Title	A transformer model for boundary detection in continuous sign language			Type	Journal Article
Year	2024	Publication	Multimedia Tools and Applications	Abbreviated Journal	MTAP
Volume		Issue		Pages
Keywords
Abstract	Sign Language Recognition (SLR) has garnered significant attention from researchers in recent years, particularly the intricate domain of Continuous Sign Language Recognition (CSLR), which presents heightened complexity compared to Isolated Sign Language Recognition (ISLR). One of the prominent challenges in CSLR pertains to accurately detecting the boundaries of isolated signs within a continuous video stream. Additionally, the reliance on handcrafted features in existing models poses a challenge to achieving optimal accuracy. To surmount these challenges, we propose a novel approach utilizing a Transformer-based model. Unlike traditional models, our approach focuses on enhancing accuracy while eliminating the need for handcrafted features. The Transformer model is employed for both ISLR and CSLR. The training process involves using isolated sign videos, where hand keypoint features extracted from the input video are enriched using the Transformer model. Subsequently, these enriched features are forwarded to the final classification layer. The trained model, coupled with a post-processing method, is then applied to detect isolated sign boundaries within continuous sign videos. The evaluation of our model is conducted on two distinct datasets, including both continuous signs and their corresponding isolated signs, demonstrates promising results.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	HUPBA			Approved	no
Call Number	Admin @ si @ RKE2024			Serial	4016
Permanent link to this record