Publicacions CVC -- Query Results

[101–110] << 111 112 113 114 115 116 117 118 119 120 >> [121–130]

Details

Records
Author	Michael Teutsch; Angel Sappa; Riad I. Hammoud
Title	Cross-Spectral Image Processing			Type	Book Chapter
Year	2022	Publication	Computer Vision in the Infrared Spectrum. Synthesis Lectures on Computer Vision	Abbreviated Journal
Volume		Issue		Pages	23-34
Keywords
Abstract	Although this book is on IR computer vision and its main focus lies on IR image and video processing and analysis, a special attention is dedicated to cross-spectral image processing due to the increasing number of publications and applications in this domain. In these cross-spectral frameworks, IR information is used together with information from other spectral bands to tackle some specific problems by developing more robust solutions. Tasks considered for cross-spectral processing are for instance dehazing, segmentation, vegetation index estimation, or face recognition. This increasing number of applications is motivated by cross- and multi-spectral camera setups available already on the market like for example smartphones, remote sensing multispectral cameras, or multi-spectral cameras for automotive systems or drones. In this chapter, different cross-spectral image processing techniques will be reviewed together with possible applications. Initially, image registration approaches for the cross-spectral case are reviewed: the registration stage is the first image processing task, which is needed to align images acquired by different sensors within the same reference coordinate system. Then, recent cross-spectral image colorization approaches, which are intended to colorize infrared images for different applications are presented. Finally, the cross-spectral image enhancement problem is tackled by including guided super resolution techniques, image dehazing approaches, cross-spectral filtering and edge detection. Figure 3.1 illustrates cross-spectral image processing stages as well as their possible connections. Table 3.1 presents some of the available public cross-spectral datasets generally used as reference data to evaluate cross-spectral image registration, colorization, enhancement, or exploitation results.
Address
Corporate Author				Thesis
Publisher	Springer	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	SLCV
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-031-00698-2	Medium
Area		Expedition		Conference
Notes	MSIAU; MACO			Approved	no
Call Number	Admin @ si @ TSH2022b			Serial	3805
Permanent link to this record



Author	Jaume Gibert; Ernest Valveny; Horst Bunke
Title	Dimensionality Reduction for Graph of Words Embedding			Type	Conference Article
Year	2011	Publication	8th IAPR-TC-15 International Workshop. Graph-Based Representations in Pattern Recognition	Abbreviated Journal
Volume	6658	Issue		Pages	22-31
Keywords
Abstract	The Graph of Words Embedding consists in mapping every graph of a given dataset to a feature vector by counting unary and binary relations between node attributes of the graph. While it shows good properties in classification problems, it suffers from high dimensionality and sparsity. These two issues are addressed in this article. Two well-known techniques for dimensionality reduction, kernel principal component analysis (kPCA) and independent component analysis (ICA), are applied to the embedded graphs. We discuss their performance compared to the classification of the original vectors on three different public databases of graphs.
Address	Münster, Germany
Corporate Author				Thesis
Publisher		Place of Publication		Editor	Xiaoyi Jiang; Miquel Ferrer; Andrea Torsello
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-642-20843-0	Medium
Area		Expedition		Conference	GbRPR
Notes	DAG			Approved	no
Call Number	Admin @ si @ GVB2011a			Serial	1743
Permanent link to this record



Author	Kaida Xiao; Chenyang Fu; D.Mylonas; Dimosthenis Karatzas; S. Wuerger
Title	Unique Hue Data for Colour Appearance Models. Part ii: Chromatic Adaptation Transform			Type	Journal Article
Year	2013	Publication	Color Research & Application	Abbreviated Journal	CRA
Volume	38	Issue	1	Pages	22-29
Keywords
Abstract	Unique hue settings of 185 observers under three room-lighting conditions were used to evaluate the accuracy of full and mixed chromatic adaptation transform models of CIECAM02 in terms of unique hue reproduction. Perceptual hue shifts in CIECAM02 were evaluated for both models with no clear difference using the current Commission Internationale de l'Éclairage (CIE) recommendation for mixed chromatic adaptation ratio. Using our large dataset of unique hue data as a benchmark, an optimised parameter is proposed for chromatic adaptation under mixed illumination conditions that produces more accurate results in unique hue reproduction. © 2011 Wiley Periodicals, Inc. Col Res Appl, 2013
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG			Approved	no
Call Number	Admin @ si @ XFM2013			Serial	1822
Permanent link to this record



Author	Mohammad Rouhani; Angel Sappa; E. Boyer
Title	Implicit B-Spline Surface Reconstruction			Type	Journal Article
Year	2015	Publication	IEEE Transactions on Image Processing	Abbreviated Journal	TIP
Volume	24	Issue	1	Pages	22 - 32
Keywords
Abstract	This paper presents a fast and flexible curve, and surface reconstruction technique based on implicit B-spline. This representation does not require any parameterization and it is locally supported. This fact has been exploited in this paper to propose a reconstruction technique through solving a sparse system of equations. This method is further accelerated to reduce the dimension to the active control lattice. Moreover, the surface smoothness and user interaction are allowed for controlling the surface. Finally, a novel weighting technique has been introduced in order to blend small patches and smooth them in the overlapping regions. The whole framework is very fast and efficient and can handle large cloud of points with very low computational cost. The experimental results show the flexibility and accuracy of the proposed algorithm to describe objects with complex topologies. Comparisons with other fitting methods highlight the superiority of the proposed approach in the presence of noise and missing data.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1057-7149	ISBN		Medium
Area		Expedition		Conference
Notes	ADAS; 600.076			Approved	no
Call Number	Admin @ si @ RSB2015			Serial	2541
Permanent link to this record



Author	Mohammad Ali Bagheri; Qigang Gao; Sergio Escalera; Albert Clapes; Kamal Nasrollahi; Michael Holte; Thomas B. Moeslund
Title	Keep it Accurate and Diverse: Enhancing Action Recognition Performance by Ensemble Learning			Type	Conference Article
Year	2015	Publication	IEEE Conference on Computer Vision and Pattern Recognition Worshops (CVPRW)	Abbreviated Journal
Volume		Issue		Pages	22-29
Keywords
Abstract	The performance of different action recognition techniques has recently been studied by several computer vision researchers. However, the potential improvement in classification through classifier fusion by ensemble-based methods has remained unattended. In this work, we evaluate the performance of an ensemble of action learning techniques, each performing the recognition task from a different perspective. The underlying idea is that instead of aiming a very sophisticated and powerful representation/learning technique, we can learn action categories using a set of relatively simple and diverse classifiers, each trained with different feature set. In addition, combining the outputs of several learners can reduce the risk of an unfortunate selection of a learner on an unseen action recognition scenario. This leads to having a more robust and general-applicable framework. In order to improve the recognition performance, a powerful combination strategy is utilized based on the Dempster-Shafer theory, which can effectively make use of diversity of base learners trained on different sources of information. The recognition results of the individual classifiers are compared with those obtained from fusing the classifiers’ output, showing enhanced performance of the proposed methodology.
Address	Boston; EEUU; June 2015
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CVPRW
Notes	HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ BGE2015			Serial	2655
Permanent link to this record



Author	Xim Cerda-Company; Xavier Otazu
Title	Color induction in equiluminant flashed stimuli			Type	Journal Article
Year	2019	Publication	Journal of the Optical Society of America A	Abbreviated Journal	JOSA A
Volume	36	Issue	1	Pages	22-31
Keywords
Abstract	Color induction is the influence of the surrounding color (inducer) on the perceived color of a central region. There are two different types of color induction: color contrast (the color of the central region shifts away from that of the inducer) and color assimilation (the color shifts towards the color of the inducer). Several studies on these effects have used uniform and striped surrounds, reporting color contrast and color assimilation, respectively. Other authors [J. Vis. 12(1), 22 (2012) [CrossRef] ] have studied color induction using flashed uniform surrounds, reporting that the contrast is higher for shorter flash duration. Extending their study, we present new psychophysical results using both flashed and static (i.e., non-flashed) equiluminant stimuli for both striped and uniform surrounds. Similarly to them, for uniform surround stimuli we observed color contrast, but we did not obtain the maximum contrast for the shortest (10 ms) flashed stimuli, but for 40 ms. We only observed this maximum contrast for red, green, and lime inducers, while for a purple inducer we obtained an asymptotic profile along the flash duration. For striped stimuli, we observed color assimilation only for the static (infinite flash duration) red–green surround inducers (red first inducer, green second inducer). For the other inducers’ configurations, we observed color contrast or no induction. Since other studies showed that non-equiluminant striped static stimuli induce color assimilation, our results also suggest that luminance differences could be a key factor to induce it.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	NEUROBIT; 600.120; 600.128			Approved	no
Call Number	Admin @ si @ CeO2019			Serial	3226
Permanent link to this record



Author	Ariel Amato; Angel Sappa; Alicia Fornes; Felipe Lumbreras; Josep Llados
Title	Divide and Conquer: Atomizing and Parallelizing A Task in A Mobile Crowdsourcing Platform			Type	Conference Article
Year	2013	Publication	2nd International ACM Workshop on Crowdsourcing for Multimedia	Abbreviated Journal
Volume		Issue		Pages	21-22
Keywords
Abstract	In this paper we present some conclusions about the advantages of having an efficient task formulation when a crowdsourcing platform is used. In particular we show how the task atomization and distribution can help to obtain results in an efficient way. Our proposal is based on a recursive splitting of the original task into a set of smaller and simpler tasks. As a result both more accurate and faster solutions are obtained. Our evaluation is performed on a set of ancient documents that need to be digitized.
Address	Barcelona; October 2013
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-4503-2396-3	Medium
Area		Expedition		Conference	CrowdMM
Notes	ADAS; ISE; DAG; 600.054; 600.055; 600.045; 600.061; 602.006			Approved	no
Call Number	Admin @ si @ SLA2013			Serial	2335
Permanent link to this record



Author	Sergio Escalera
Title	Multi-Modal Human Behaviour Analysis from Visual Data Sources			Type	Journal
Year	2013	Publication	ERCIM News journal	Abbreviated Journal	ERCIM
Volume	95	Issue		Pages	21-22
Keywords
Abstract	The Human Pose Recovery and Behaviour Analysis group (HuPBA), University of Barcelona, is developing a line of research on multi-modal analysis of humans in visual data. The novel technology is being applied in several scenarios with high social impact, including sign language recognition, assisted technology and supported diagnosis for the elderly and people with mental/physical disabilities, fitness conditioning, and Human Computer Interaction.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0926-4981	ISBN		Medium
Area		Expedition		Conference
Notes	HuPBA;MILAB			Approved	no
Call Number	Admin @ si @ Esc2013			Serial	2361
Permanent link to this record



Author	Miguel Angel Bautista; Sergio Escalera; Xavier Baro; Oriol Pujol; Jordi Vitria; Petia Radeva
Title	On the Design of Low Redundancy Error-Correcting Output Codes			Type	Book Chapter
Year	2011	Publication	Ensembles in Machine Learning Applications	Abbreviated Journal
Volume	373	Issue	2	Pages	21-38
Keywords
Abstract	The classification of large number of object categories is a challenging trend in the Pattern Recognition field. In the literature, this is often addressed using an ensemble of classifiers . In this scope, the Error-Correcting Output Codes framework has demonstrated to be a powerful tool for combining classifiers. However, most of the state-of-the-art ECOC approaches use a linear or exponential number of classifiers, making the discrimination of a large number of classes unfeasible. In this paper, we explore and propose a compact design of ECOC in terms of the number of classifiers. Evolutionary computation is used for tuning the parameters of the classifiers and looking for the best compact ECOC code configuration. The results over several public UCI data sets and different multi-class Computer Vision problems show that the proposed methodology obtains comparable (even better) results than the state-of-the-art ECOC methodologies with far less number of dichotomizers.
Address
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1860-949X	ISBN	978-3-642-22909-1	Medium
Area		Expedition		Conference
Notes	MILAB; OR;HuPBA;MV			Approved	no
Call Number	Admin @ si @ BEB2011b			Serial	1886
Permanent link to this record



Author	Md. Mostafa Kamal Sarker; Hatem A. Rashwan; Farhan Akram; Syeda Furruka Banu; Adel Saleh; Vivek Kumar Singh; Forhad U. H. Chowdhury; Saddam Abdulwahab; Santiago Romani; Petia Radeva; Domenec Puig
Title	SLSDeep: Skin Lesion Segmentation Based on Dilated Residual and Pyramid Pooling Networks.			Type	Conference Article
Year	2018	Publication	21st International Conference on Medical Image Computing & Computer Assisted Intervention	Abbreviated Journal
Volume	2	Issue		Pages	21-29
Keywords
Abstract	Skin lesion segmentation (SLS) in dermoscopic images is a crucial task for automated diagnosis of melanoma. In this paper, we present a robust deep learning SLS model, so-called SLSDeep, which is represented as an encoder-decoder network. The encoder network is constructed by dilated residual layers, in turn, a pyramid pooling network followed by three convolution layers is used for the decoder. Unlike the traditional methods employing a cross-entropy loss, we investigated a loss function by combining both Negative Log Likelihood (NLL) and End Point Error (EPE) to accurately segment the melanoma regions with sharp boundaries. The robustness of the proposed model was evaluated on two public databases: ISBI 2016 and 2017 for skin lesion analysis towards melanoma detection challenge. The proposed model outperforms the state-of-the-art methods in terms of segmentation accuracy. Moreover, it is capable to segment more than 100 images of size 384x384 per second on a recent GPU.
Address	Granada; Espanya; September 2018
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	MICCAI
Notes	MILAB; no proj			Approved	no
Call Number	Admin @ si @ SRA2018			Serial	3112
Permanent link to this record



Author	Pau Rodriguez; Miguel Angel Bautista; Sergio Escalera; Jordi Gonzalez
Title	Beyond Oneshot Encoding: lower dimensional target embedding			Type	Journal Article
Year	2018	Publication	Image and Vision Computing	Abbreviated Journal	IMAVIS
Volume	75	Issue		Pages	21-31
Keywords	Error correcting output codes; Output embeddings; Deep learning; Computer vision
Abstract	Target encoding plays a central role when learning Convolutional Neural Networks. In this realm, one-hot encoding is the most prevalent strategy due to its simplicity. However, this so widespread encoding schema assumes a flat label space, thus ignoring rich relationships existing among labels that can be exploited during training. In large-scale datasets, data does not span the full label space, but instead lies in a low-dimensional output manifold. Following this observation, we embed the targets into a low-dimensional space, drastically improving convergence speed while preserving accuracy. Our contribution is two fold: (i) We show that random projections of the label space are a valid tool to find such lower dimensional embeddings, boosting dramatically convergence rates at zero computational cost; and (ii) we propose a normalized eigenrepresentation of the class manifold that encodes the targets with minimal information loss, improving the accuracy of random projections encoding while enjoying the same convergence rates. Experiments on CIFAR-100, CUB200-2011, Imagenet, and MIT Places demonstrate that the proposed approach drastically improves convergence speed while reaching very competitive accuracy rates.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE; HuPBA; 600.098; 602.133; 602.121; 600.119			Approved	no
Call Number	Admin @ si @ RBE2018			Serial	3120
Permanent link to this record



Author	F.Negin; Pau Rodriguez; M.Koperski; A.Kerboua; Jordi Gonzalez; J.Bourgeois; E.Chapoulie; P.Robert; F.Bremond
Title	PRAXIS: Towards automatic cognitive assessment using gesture recognition			Type	Journal Article
Year	2018	Publication	Expert Systems with Applications	Abbreviated Journal	ESWA
Volume	106	Issue		Pages	21-35
Keywords
Abstract	Praxis test is a gesture-based diagnostic test which has been accepted as diagnostically indicative of cortical pathologies such as Alzheimer’s disease. Despite being simple, this test is oftentimes skipped by the clinicians. In this paper, we propose a novel framework to investigate the potential of static and dynamic upper-body gestures based on the Praxis test and their potential in a medical framework to automatize the test procedures for computer-assisted cognitive assessment of older adults. In order to carry out gesture recognition as well as correctness assessment of the performances we have recollected a novel challenging RGB-D gesture video dataset recorded by Kinect v2, which contains 29 specific gestures suggested by clinicians and recorded from both experts and patients performing the gesture set. Moreover, we propose a framework to learn the dynamics of upper-body gestures, considering the videos as sequences of short-term clips of gestures. Our approach first uses body part detection to extract image patches surrounding the hands and then, by means of a fine-tuned convolutional neural network (CNN) model, it learns deep hand features which are then linked to a long short-term memory to capture the temporal dependencies between video frames. We report the results of four developed methods using different modalities. The experiments show effectiveness of our deep learning based approach in gesture recognition and performance assessment tasks. Satisfaction of clinicians from the assessment reports indicates the impact of framework corresponding to the diagnosis.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	Admin @ si @ NRK2018			Serial	3669
Permanent link to this record



Author	Juan Ramon Terven Salinas; Joaquin Salas; Bogdan Raducanu
Title	Estado del Arte en Sistemas de Vision Artificial para Personas Invidentes			Type	Journal
Year	2013	Publication	Komputer Sapiens	Abbreviated Journal	KS
Volume	1	Issue		Pages	20-25
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	Admin @ si @ TSR2013			Serial	2231
Permanent link to this record



Author	Alicia Fornes; Josep Llados; Oriol Ramos Terrades; Marçal Rusiñol
Title	La Visió per Computador com a Eina per a la Interpretació Automàtica de Fonts Documentals			Type	Journal
Year	2016	Publication	Lligall, Revista Catalana d'Arxivística	Abbreviated Journal
Volume	39	Issue		Pages	20-46
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	DAG; 600.097			Approved	no
Call Number	Admin @ si @ FLR2016			Serial	2897
Permanent link to this record



Author	Antonio Clavelli; Dimosthenis Karatzas; Josep Llados
Title	A framework for the assessment of text extraction algorithms on complex colour images			Type	Conference Article
Year	2010	Publication	9th IAPR International Workshop on Document Analysis Systems	Abbreviated Journal
Volume		Issue		Pages	19–26
Keywords
Abstract	The availability of open, ground-truthed datasets and clear performance metrics is a crucial factor in the development of an application domain. The domain of colour text image analysis (real scenes, Web and spam images, scanned colour documents) has traditionally suffered from a lack of a comprehensive performance evaluation framework. Such a framework is extremely difficult to specify, and corresponding pixel-level accurate information tedious to define. In this paper we discuss the challenges and technical issues associated with developing such a framework. Then, we describe a complete framework for the evaluation of text extraction methods at multiple levels, provide a detailed ground-truth specification and present a case study on how this framework can be used in a real-life situation.
Address	Boston; USA;
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-1-60558-773-8	Medium
Area		Expedition		Conference	DAS
Notes	DAG			Approved	no
Call Number	DAG @ dag @ CKL2010			Serial	1432
Permanent link to this record