Publicacions CVC -- Query Results

	Publicacions CVC Home \| Show All \| Simple Search \| Advanced Search \| Add Record \| Import	Login Quick Search: Field: contains: ...
	601–615 of 3413 records found matching your query (RSS):

Search & Display Options

Select All Deselect All

[31–40] << 41 42 43 44 45 46 47 48 49 50 >> [51–60]

List View

Citations

Details

	Records
	Author	Shida Beigpour; Christian Riess; Joost Van de Weijer; Elli Angelopoulou
	Title	Multi-Illuminant Estimation with Conditional Random Fields			Type	Journal Article
	Year	2014	Publication	IEEE Transactions on Image Processing	Abbreviated Journal	TIP
	Volume	23	Issue	1	Pages	83-95
	Keywords	color constancy; CRF; multi-illuminant
	Abstract	Most existing color constancy algorithms assume uniform illumination. However, in real-world scenes, this is not often the case. Thus, we propose a novel framework for estimating the colors of multiple illuminants and their spatial distribution in the scene. We formulate this problem as an energy minimization task within a conditional random field over a set of local illuminant estimates. In order to quantitatively evaluate the proposed method, we created a novel data set of two-dominant-illuminant images comprised of laboratory, indoor, and outdoor scenes. Unlike prior work, our database includes accurate pixel-wise ground truth illuminant information. The performance of our method is evaluated on multiple data sets. Experimental results show that our framework clearly outperforms single illuminant estimators as well as a recently proposed multi-illuminant estimation approach.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1057-7149	ISBN		Medium
	Area		Expedition		Conference
	Notes	CIC; LAMP; 600.074; 600.079			Approved	no
	Call Number	Admin @ si @ BRW2014			Serial	2451
Permanent link to this record



	Author	C. Alejandro Parraga
	Title	Color Vision, Computational Methods for			Type	Book Chapter
	Year	2014	Publication	Encyclopedia of Computational Neuroscience	Abbreviated Journal
	Volume		Issue		Pages	1-11
	Keywords	Color computational vision; Computational neuroscience of color
	Abstract	The study of color vision has been aided by a whole battery of computational methods that attempt to describe the mechanisms that lead to our perception of colors in terms of the information-processing properties of the visual system. Their scope is highly interdisciplinary, linking apparently dissimilar disciplines such as mathematics, physics, computer science, neuroscience, cognitive science, and psychology. Since the sensation of color is a feature of our brains, computational approaches usually include biological features of neural systems in their descriptions, from retinal light-receptor interaction to subcortical color opponency, cortical signal decoding, and color categorization. They produce hypotheses that are usually tested by behavioral or psychophysical experiments.
	Address
	Corporate Author				Thesis
	Publisher	Springer-Verlag Berlin Heidelberg	Place of Publication		Editor	Dieter Jaeger; Ranu Jung
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4614-7320-6	Medium
	Area		Expedition		Conference
	Notes	CIC; 600.074			Approved	no
	Call Number	Admin @ si @ Par2014			Serial	2512
Permanent link to this record



	Author	Ivet Rafegas; Maria Vanrell
	Title	Color encoding in biologically-inspired convolutional neural networks			Type	Journal Article
	Year	2018	Publication	Vision Research	Abbreviated Journal	VR
	Volume	151	Issue		Pages	7-17
	Keywords	Color coding; Computer vision; Deep learning; Convolutional neural networks
	Abstract	Convolutional Neural Networks have been proposed as suitable frameworks to model biological vision. Some of these artificial networks showed representational properties that rival primate performances in object recognition. In this paper we explore how color is encoded in a trained artificial network. It is performed by estimating a color selectivity index for each neuron, which allows us to describe the neuron activity to a color input stimuli. The index allows us to classify whether they are color selective or not and if they are of a single or double color. We have determined that all five convolutional layers of the network have a large number of color selective neurons. Color opponency clearly emerges in the first layer, presenting 4 main axes (Black-White, Red-Cyan, Blue-Yellow and Magenta-Green), but this is reduced and rotated as we go deeper into the network. In layer 2 we find a denser hue sampling of color neurons and opponency is reduced almost to one new main axis, the Bluish-Orangish coinciding with the dataset bias. In layers 3, 4 and 5 color neurons are similar amongst themselves, presenting different type of neurons that detect specific colored objects (e.g., orangish faces), specific surrounds (e.g., blue sky) or specific colored or contrasted object-surround configurations (e.g. blue blob in a green surround). Overall, our work concludes that color and shape representation are successively entangled through all the layers of the studied network, revealing certain parallelisms with the reported evidences in primate brains that can provide useful insight into intermediate hierarchical spatio-chromatic representations.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	CIC; 600.051; 600.087			Approved	no
	Call Number	Admin @ si @RaV2018			Serial	3114
Permanent link to this record



	Author	Jorge Bernal
	Title	Polyp Localization and Segmentation in Colonoscopy Images by Means of a Model of Appearance for Polyps			Type	Journal Article
	Year	2014	Publication	Electronic Letters on Computer Vision and Image Analysis	Abbreviated Journal	ELCVIA
	Volume	13	Issue	2	Pages	9-10
	Keywords	Colonoscopy; polyp localization; polyp segmentation; Eye-tracking
	Abstract	Colorectal cancer is the fourth most common cause of cancer death worldwide and its survival rate depends on the stage in which it is detected on hence the necessity for an early colon screening. There are several screening techniques but colonoscopy is still nowadays the gold standard, although it has some drawbacks such as the miss rate. Our contribution, in the field of intelligent systems for colonoscopy, aims at providing a polyp localization and a polyp segmentation system based on a model of appearance for polyps. To develop both methods we define a model of appearance for polyps, which describes a polyp as enclosed by intensity valleys. The novelty of our contribution resides on the fact that we include in our model aspects of the image formation and we also consider the presence of other elements from the endoluminal scene such as specular highlights and blood vessels, which have an impact on the performance of our methods. In order to develop our polyp localization method we accumulate valley information in order to generate energy maps, which are also used to guide the polyp segmentation. Our methods achieve promising results in polyp localization and segmentation. As we want to explore the usability of our methods we present a comparative analysis between physicians fixations obtained via an eye tracking device and our polyp localization method. The results show that our method is indistinguishable to novice physicians although it is far from expert physicians.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor	Alicia Fornes; Volkmar Frinken
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	MV			Approved	no
	Call Number	Admin @ si @ Ber2014			Serial	2487
Permanent link to this record



	Author	Joan M. Nuñez; Jorge Bernal; Miquel Ferrer; Fernando Vilariño
	Title	Impact of Keypoint Detection on Graph-based Characterization of Blood Vessels in Colonoscopy Videos			Type	Conference Article
	Year	2014	Publication	CARE workshop	Abbreviated Journal
	Volume		Issue		Pages
	Keywords	Colonoscopy; Graph Matching; Biometrics; Vessel; Intersection
	Abstract	We explore the potential of the use of blood vessels as anatomical landmarks for developing image registration methods in colonoscopy images. An unequivocal representation of blood vessels could be used to guide follow-up methods to track lesions over different interventions. We propose a graph-based representation to characterize network structures, such as blood vessels, based on the use of intersections and endpoints. We present a study consisting of the assessment of the minimal performance a keypoint detector should achieve so that the structure can still be recognized. Experimental results prove that, even by achieving a loss of 35% of the keypoints, the descriptive power of the associated graphs to the vessel pattern is still high enough to recognize blood vessels.
	Address	Boston; USA; September 2014
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	CARE
	Notes	MV; DAG; 600.060; 600.047; 600.077;SIAI			Approved	no
	Call Number	Admin @ si @ NBF2014			Serial	2504
Permanent link to this record



	Author	Joan M. Nuñez; Jorge Bernal; F. Javier Sanchez; Fernando Vilariño
	Title	Blood Vessel Characterization in Colonoscopy Images to Improve Polyp Localization			Type	Conference Article
	Year	2013	Publication	Proceedings of the International Conference on Computer Vision Theory and Applications	Abbreviated Journal
	Volume	1	Issue		Pages	162-171
	Keywords	Colonoscopy; Blood vessel; Linear features; Valley detection
	Abstract	This paper presents an approach to mitigate the contribution of blood vessels to the energy image used at different tasks of automatic colonoscopy image analysis. This goal is achieved by introducing a characterization of endoluminal scene objects which allows us to differentiate between the trace of 2-dimensional visual objects,such as vessels, and shades from 3-dimensional visual objects, such as folds. The proposed characterization is based on the influence that the object shape has in the resulting visual feature, and it leads to the development of a blood vessel attenuation algorithm. A database consisting of manually labelled masks was built in order to test the performance of our method, which shows an encouraging success in blood vessel mitigation while keeping other structures intact. Moreover, by extending our method to the only available polyp localization algorithm tested on a public database, blood vessel mitigation proved to have a positive influence on the overall performance.
	Address	Barcelona; February 2013
	Corporate Author				Thesis
	Publisher	SciTePress	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area	800	Expedition		Conference	VISIGRAPP
	Notes	MV; 600.054; 600.057;SIAI			Approved	no
	Call Number	IAM @ iam @ NBS2013			Serial	2198
Permanent link to this record



	Author	Jorge Bernal; F. Javier Sanchez; Fernando Vilariño
	Title	Towards Automatic Polyp Detection with a Polyp Appearance Model			Type	Journal Article
	Year	2012	Publication	Pattern Recognition	Abbreviated Journal	PR
	Volume	45	Issue	9	Pages	3166-3182
	Keywords	Colonoscopy,PolypDetection,RegionSegmentation,SA-DOVA descriptot
	Abstract	This work aims at the automatic polyp detection by using a model of polyp appearance in the context of the analysis of colonoscopy videos. Our method consists of three stages: region segmentation, region description and region classification. The performance of our region segmentation method guarantees that if a polyp is present in the image, it will be exclusively and totally contained in a single region. The output of the algorithm also defines which regions can be considered as non-informative. We define as our region descriptor the novel Sector Accumulation-Depth of Valleys Accumulation (SA-DOVA), which provides a necessary but not sufficient condition for the polyp presence. Finally, we classify our segmented regions according to the maximal values of the SA-DOVA descriptor. Our preliminary classification results are promising, especially when classifying those parts of the image that do not contain a polyp inside.
	Address
	Corporate Author				Thesis
	Publisher	Elsevier	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	0031-3203	ISBN		Medium
	Area	800	Expedition		Conference	IbPRIA
	Notes	MV;SIAI			Approved	no
	Call Number	Admin @ si @ BSV2012; IAM @ iam			Serial	1997
Permanent link to this record



	Author	Jorge Bernal; F. Javier Sanchez; Fernando Vilariño
	Title	A Region Segmentation Method for Colonoscopy Images Using a Model of Polyp Appearance			Type	Conference Article
	Year	2011	Publication	5th Iberian Conference on Pattern Recognition and Image Analysis	Abbreviated Journal
	Volume	6669	Issue		Pages	134-143
	Keywords	Colonoscopy, Polyp Detection, Region Merging, Region Segmentation.
	Abstract	This work aims at the segmentation of colonoscopy images into a minimum number of informative regions. Our method performs in a way such, if a polyp is present in the image, it will be exclusively and totally contained in a single region. This result can be used in later stages to classify regions as polyp-containing candidates. The output of the algorithm also defines which regions can be considered as non-informative. The algorithm starts with a high number of initial regions and merges them taking into account the model of polyp appearance obtained from available data. The results show that our segmentations of polyp regions are more accurate than state-of-the-art methods.
	Address	Las Palmas de Gran Canaria, June 2011
	Corporate Author	SpringerLink			Thesis
	Publisher		Place of Publication		Editor	Vitrià, Jordi and Sanches, João and Hernández, Mario
	Language		Summary Language		Original Title
	Series Editor		Series Title	Lecture Notes in Computer Science	Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-3-642-21256-7	Medium
	Area	800	Expedition		Conference	IbPRIA
	Notes	MV;SIAI			Approved	no
	Call Number	IAM @ iam @ BSV2011c			Serial	1696
Permanent link to this record



	Author	Debora Gil; F. Javier Sanchez; Gloria Fernandez Esparrach; Jorge Bernal
	Title	3D Stable Spatio-temporal Polyp Localization in Colonoscopy Videos			Type	Book Chapter
	Year	2015	Publication	Computer-Assisted and Robotic Endoscopy. Revised selected papers of Second International Workshop, CARE 2015, Held in Conjunction with MICCAI 2015	Abbreviated Journal
	Volume	9515	Issue		Pages	140-152
	Keywords	Colonoscopy, Polyp Detection, Polyp Localization, Region Extraction, Watersheds
	Abstract	Computational intelligent systems could reduce polyp miss rate in colonoscopy for colon cancer diagnosis and, thus, increase the efficiency of the procedure. One of the main problems of existing polyp localization methods is a lack of spatio-temporal stability in their response. We propose to explore the response of a given polyp localization across temporal windows in order to select those image regions presenting the highest stable spatio-temporal response. Spatio-temporal stability is achieved by extracting 3D watershed regions on the temporal window. Stability in localization response is statistically determined by analysis of the variance of the output of the localization method inside each 3D region. We have explored the benefits of considering spatio-temporal stability in two different tasks: polyp localization and polyp detection. Experimental results indicate an average improvement of 21:5% in polyp localization and 43:78% in polyp detection.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title	LNCS
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	CARE
	Notes	IAM; MV; 600.075			Approved	no
	Call Number	Admin @ si @ GSF2015			Serial	2733
Permanent link to this record



	Author	David Vazquez; Jorge Bernal; F. Javier Sanchez; Gloria Fernandez Esparrach; Antonio Lopez; Adriana Romero; Michal Drozdzal; Aaron Courville
	Title	A Benchmark for Endoluminal Scene Segmentation of Colonoscopy Images			Type	Journal Article
	Year	2017	Publication	Journal of Healthcare Engineering	Abbreviated Journal	JHCE
	Volume		Issue		Pages	2040-2295
	Keywords	Colonoscopy images; Deep Learning; Semantic Segmentation
	Abstract	Colorectal cancer (CRC) is the third cause of cancer death world-wide. Currently, the standard approach to reduce CRC-related mortality is to perform regular screening in search for polyps and colonoscopy is the screening tool of choice. The main limitations of this screening procedure are polyp miss- rate and inability to perform visual assessment of polyp malignancy. These drawbacks can be reduced by designing Decision Support Systems (DSS) aim- ing to help clinicians in the different stages of the procedure by providing endoluminal scene segmentation. Thus, in this paper, we introduce an extended benchmark of colonoscopy image segmentation, with the hope of establishing a new strong benchmark for colonoscopy image analysis research. The proposed dataset consists of 4 relevant classes to inspect the endolumninal scene, tar- geting different clinical needs. Together with the dataset and taking advantage of advances in semantic segmentation literature, we provide new baselines by training standard fully convolutional networks (FCN). We perform a compar- ative study to show that FCN significantly outperform, without any further post-processing, prior results in endoluminal scene segmentation, especially with respect to polyp segmentation and localization.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	ADAS; MV; 600.075; 600.085; 600.076; 601.281; 600.118			Approved	no
	Call Number	VBS2017b			Serial	2940
Permanent link to this record



	Author	Carles Fernandez; Pau Baiget; Xavier Roca; Jordi Gonzalez
	Title	Interpretation of Complex Situations in a Semantic-based Surveillance Framework			Type	Journal
	Year	2008	Publication	Signal Processing: Image Communication, Special Issue on Semantic Analysis for Interactive Multimedia Services	Abbreviated Journal
	Volume	23	Issue	7	Pages	554-569
	Keywords	Cognitive vision system; Situation analysis; Applied ontologies
	Abstract	The integration of cognitive capabilities in computer vision systems requires both to enable high semantic expressiveness and to deal with high computational costs as large amounts of data are involved in the analysis. This contribution describes a cognitive vision system conceived to automatically provide high-level interpretations of complex real-time situations in outdoor and indoor scenarios, and to eventually maintain communication with casual end users in multiple languages. The main contributions are: (i) the design of an integrative multilevel architecture for cognitive surveillance purposes; (ii) the proposal of a coherent taxonomy of knowledge to guide the process of interpretation, which leads to the conception of a situation-based ontology; (iii) the use of situational analysis for content detection and a progressive interpretation of semantically rich scenes, by managing incomplete or uncertain knowledge, and (iv) the use of such an ontological background to enable multilingual capabilities and advanced end-user interfaces. Experimental results are provided to show the feasibility of the proposed approach.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	ISE			Approved	no
	Call Number	ISE @ ise @ FBR2008			Serial	954
Permanent link to this record



	Author	Carles Fernandez; Pau Baiget; Xavier Roca; Jordi Gonzalez
	Title	Determining the Best Suited Semantic Events for Cognitive Surveillance			Type	Journal Article
	Year	2011	Publication	Expert Systems with Applications	Abbreviated Journal	EXSY
	Volume	38	Issue	4	Pages	4068–4079
	Keywords	Cognitive surveillance; Event modeling; Content-based video retrieval; Ontologies; Advanced user interfaces
	Abstract	State-of-the-art systems on cognitive surveillance identify and describe complex events in selected domains, thus providing end-users with tools to easily access the contents of massive video footage. Nevertheless, as the complexity of events increases in semantics and the types of indoor/outdoor scenarios diversify, it becomes difficult to assess which events describe better the scene, and how to model them at a pixel level to fulfill natural language requests. We present an ontology-based methodology that guides the identification, step-by-step modeling, and generalization of the most relevant events to a specific domain. Our approach considers three steps: (1) end-users provide textual evidence from surveilled video sequences; (2) transcriptions are analyzed top-down to build the knowledge bases for event description; and (3) the obtained models are used to generalize event detection to different image sequences from the surveillance domain. This framework produces user-oriented knowledge that improves on existing advanced interfaces for video indexing and retrieval, by determining the best suited events for video understanding according to end-users. We have conducted experiments with outdoor and indoor scenes showing thefts, chases, and vandalism, demonstrating the feasibility and generalization of this proposal.
	Address
	Corporate Author				Thesis
	Publisher	Elsevier	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	ISE			Approved	no
	Call Number	Admin @ si @ FBR2011a			Serial	1722
Permanent link to this record



	Author	Aura Hernandez-Sabate; Jose Elias Yauri; Pau Folch; Miquel Angel Piera; Debora Gil
	Title	Recognition of the Mental Workloads of Pilots in the Cockpit Using EEG Signals			Type	Journal Article
	Year	2022	Publication	Applied Sciences	Abbreviated Journal	APPLSCI
	Volume	12	Issue	5	Pages	2298
	Keywords	Cognitive states; Mental workload; EEG analysis; Neural networks; Multimodal data fusion
	Abstract	The commercial flightdeck is a naturally multi-tasking work environment, one in which interruptions are frequent come in various forms, contributing in many cases to aviation incident reports. Automatic characterization of pilots’ workloads is essential to preventing these kind of incidents. In addition, minimizing the physiological sensor network as much as possible remains both a challenge and a requirement. Electroencephalogram (EEG) signals have shown high correlations with specific cognitive and mental states, such as workload. However, there is not enough evidence in the literature to validate how well models generalize in cases of new subjects performing tasks with workloads similar to the ones included during the model’s training. In this paper, we propose a convolutional neural network to classify EEG features across different mental workloads in a continuous performance task test that partly measures working memory and working memory capacity. Our model is valid at the general population level and it is able to transfer task learning to pilot mental workload recognition in a simulated operational environment.
	Address	February 2022
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	IAM; ADAS; 600.139; 600.145; 600.118			Approved	no
	Call Number	Admin @ si @ HYF2022			Serial	3720
Permanent link to this record



	Author	Jose Elias Yauri; Aura Hernandez-Sabate; Pau Folch; Debora Gil
	Title	Mental Workload Detection Based on EEG Analysis			Type	Conference Article
	Year	2021	Publication	Artificial Intelligent Research and Development. Proceedings 23rd International Conference of the Catalan Association for Artificial Intelligence.	Abbreviated Journal
	Volume	339	Issue		Pages	268-277
	Keywords	Cognitive states; Mental workload; EEG analysis; Neural Networks.
	Abstract	The study of mental workload becomes essential for human work efficiency, health conditions and to avoid accidents, since workload compromises both performance and awareness. Although workload has been widely studied using several physiological measures, minimising the sensor network as much as possible remains both a challenge and a requirement. Electroencephalogram (EEG) signals have shown a high correlation to specific cognitive and mental states like workload. However, there is not enough evidence in the literature to validate how well models generalize in case of new subjects performing tasks of a workload similar to the ones included during model’s training. In this paper we propose a binary neural network to classify EEG features across different mental workloads. Two workloads, low and medium, are induced using two variants of the N-Back Test. The proposed model was validated in a dataset collected from 16 subjects and shown a high level of generalization capability: model reported an average recall of 81.81% in a leave-one-out subject evaluation.
	Address	Virtual; October 20-22 2021
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference	CCIA
	Notes	IAM; 600.139; 600.118; 600.145			Approved	no
	Call Number	Admin @ si @			Serial	3723
Permanent link to this record



	Author	Jose Luis Gomez; Gabriel Villalonga; Antonio Lopez
	Title	Co-Training for Deep Object Detection: Comparing Single-Modal and Multi-Modal Approaches			Type	Journal Article
	Year	2021	Publication	Sensors	Abbreviated Journal	SENS
	Volume	21	Issue	9	Pages	3185
	Keywords	co-training; multi-modality; vision-based object detection; ADAS; self-driving
	Abstract	Top-performing computer vision models are powered by convolutional neural networks (CNNs). Training an accurate CNN highly depends on both the raw sensor data and their associated ground truth (GT). Collecting such GT is usually done through human labeling, which is time-consuming and does not scale as we wish. This data-labeling bottleneck may be intensified due to domain shifts among image sensors, which could force per-sensor data labeling. In this paper, we focus on the use of co-training, a semi-supervised learning (SSL) method, for obtaining self-labeled object bounding boxes (BBs), i.e., the GT to train deep object detectors. In particular, we assess the goodness of multi-modal co-training by relying on two different views of an image, namely, appearance (RGB) and estimated depth (D). Moreover, we compare appearance-based single-modal co-training with multi-modal. Our results suggest that in a standard SSL setting (no domain shift, a few human-labeled data) and under virtual-to-real domain shift (many virtual-world labeled data, no human-labeled data) multi-modal co-training outperforms single-modal. In the latter case, by performing GAN-based domain translation both co-training modalities are on par, at least when using an off-the-shelf depth estimation model not specifically trained on the translated images.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN		Medium
	Area		Expedition		Conference
	Notes	ADAS; 600.118			Approved	no
	Call Number	Admin @ si @ GVL2021			Serial	3562
Permanent link to this record