Publicacions CVC -- Edit Record

	Publicacions CVC Home \| Show All \| Simple Search \| Advanced Search \| Add Record \| Import	You must login to submit this form! Login Quick Search: Field: contains: ...
	Edit the following record:

Author	...				is Editor
Title	...			Type
Year	...	Publication	...	Abbreviated Journal	...
Volume	...	Issue	...	Pages	...
Keywords	...
Abstract	In autonomous driving, artificial intelligence (AI) processes the traffic environment to drive the vehicle to a desired destination. Currently, there are different paradigms that address the development of AI-enabled drivers. On the one hand, we find modular pipelines, which divide the driving task into sub-tasks such as perception, maneuver planning, and control. On the other hand, we find end-to-end driving approaches that attempt to learn the direct mapping of raw data from input sensors to vehicle control signals. The latter are relatively less studied but are gaining popularity as they are less demanding in terms of data labeling. Therefore, in this thesis, our goal is to investigate end-to-end autonomous driving. We propose to evaluate three approaches to tackle the challenge of end-to-end autonomous driving. First, we focus on the input, considering adding depth information as complementary to RGB data, in order to mimic the human being’s ability to estimate the distance to obstacles. Notice that, in the real world, these depth maps can be obtained either from a LiDAR sensor, or a trained monocular depth estimation module, where human labeling is not needed. Then, based on the intuition that the latent space of end-to-end driving models encodes relevant information for driving, we use it as prior knowledge for training an affordancebased driving model. In this case, the trained affordance-based model can achieve good performance while requiring less human-labeled data, and it can provide interpretability regarding driving actions. Finally, we present a new pure vision-based end-to-end driving model termed CIL++, which is trained by imitation learning. CIL++ leverages modern best practices, such as a large horizontal field of view and a self-attention mechanism, which are contributing to the agent’s understanding of the driving scene and bringing a better imitation of human drivers. Using training data without any human labeling, our model yields almost expert performance in the CARLA NoCrash benchmark and could rival SOTA models that require large amounts of human-labeled data.
Address	...
Corporate Author	...			Thesis
Publisher	...	Place of Publication	...	Editor	...
Language	...	Summary Language	...	Original Title	...
Series Editor	...	Series Title	...	Abbreviated Series Title	...
Series Volume	...	Series Issue	...	Edition	...
ISSN	...	ISBN	...	Medium	...
Area	...	Expedition	...	Conference	...
Notes	...			Approved	yes no
Location
Call Number	...			Serial
Marked	yes no	Copy		Selected	yes no
User Keys	...
User Notes	...			User File	...
User Groups	...			Cite Key	...
Related	...
File
URL	...			DOI	...
	Online publication. Cite with this text: ...

Location Field:	my name & email address

Home

SQL Search | Library Search | Show Record | Extract Citations

Help