Publicacions CVC -- Query Results

[131–140] << 141 142 143 144 145 146 147 148 149 150 >> [151–160]

Details

Records
Author	Fahad Shahbaz Khan; Muhammad Anwer Rao; Joost Van de Weijer; Michael Felsberg; J.Laaksonen
Title	Deep semantic pyramids for human attributes and action recognition			Type	Conference Article
Year	2015	Publication	Image Analysis, Proceedings of 19th Scandinavian Conference , SCIA 2015	Abbreviated Journal
Volume	9127	Issue		Pages	341-353
Keywords	Action recognition; Human attributes; Semantic pyramids
Abstract	Describing persons and their actions is a challenging problem due to variations in pose, scale and viewpoint in real-world images. Recently, semantic pyramids approach [1] for pose normalization has shown to provide excellent results for gender and action recognition. The performance of semantic pyramids approach relies on robust image description and is therefore limited due to the use of shallow local features. In the context of object recognition [2] and object detection [3], convolutional neural networks (CNNs) or deep features have shown to improve the performance over the conventional shallow features. We propose deep semantic pyramids for human attributes and action recognition. The method works by constructing spatial pyramids based on CNNs of different part locations. These pyramids are then combined to obtain a single semantic representation. We validate our approach on the Berkeley and 27 Human Attributes datasets for attributes classification. For action recognition, we perform experiments on two challenging datasets: Willow and PASCAL VOC 2010. The proposed deep semantic pyramids provide a significant gain of 17.2%, 13.9%, 24.3% and 22.6% compared to the standard shallow semantic pyramids on Berkeley, 27 Human Attributes, Willow and PASCAL VOC 2010 datasets respectively. Our results also show that deep semantic pyramids outperform conventional CNNs based on the full bounding box of the person. Finally, we compare our approach with state-of-the-art methods and show a gain in performance compared to best methods in literature.
Address	Denmark; Copenhagen; June 2015
Corporate Author				Thesis
Publisher	Springer International Publishing	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-319-19664-0	Medium
Area		Expedition		Conference	SCIA
Notes	LAMP; 600.068; 600.079;ADAS			Approved	no
Call Number	Admin @ si @ KRW2015b			Serial	2672
Permanent link to this record



Author	Yi Xiao; Felipe Codevilla; Diego Porres; Antonio Lopez
Title	Scaling Vision-Based End-to-End Autonomous Driving with Multi-View Attention Learning			Type	Conference Article
Year	2023	Publication	International Conference on Intelligent Robots and Systems	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	On end-to-end driving, human driving demonstrations are used to train perception-based driving models by imitation learning. This process is supervised on vehicle signals (e.g., steering angle, acceleration) but does not require extra costly supervision (human labeling of sensor data). As a representative of such vision-based end-to-end driving models, CILRS is commonly used as a baseline to compare with new driving models. So far, some latest models achieve better performance than CILRS by using expensive sensor suites and/or by using large amounts of human-labeled data for training. Given the difference in performance, one may think that it is not worth pursuing vision-based pure end-to-end driving. However, we argue that this approach still has great value and potential considering cost and maintenance. In this paper, we present CIL++, which improves on CILRS by both processing higher-resolution images using a human-inspired HFOV as an inductive bias and incorporating a proper attention mechanism. CIL++ achieves competitive performance compared to models which are more costly to develop. We propose to replace CILRS with CIL++ as a strong vision-based pure end-to-end driving baseline supervised by only vehicle signals and trained by conditional imitation learning.
Address	Detroit; USA; October 2023
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	IROS
Notes	ADAS			Approved	no
Call Number	Admin @ si @ XCP2023			Serial	3930
Permanent link to this record



Author	Patricia Suarez; Dario Carpio; Angel Sappa; Henry Velesaca
Title	Transformer based Image Dehazing			Type	Conference Article
Year	2022	Publication	16th IEEE International Conference on Signal Image Technology & Internet Based System	Abbreviated Journal
Volume		Issue		Pages
Keywords	atmospheric light; brightness component; computational cost; dehazing quality; haze-free image
Abstract	This paper presents a novel approach to remove non homogeneous haze from real images. The proposed method consists mainly of image feature extraction, haze removal, and image reconstruction. To accomplish this challenging task, we propose an architecture based on transformers, which have been recently introduced and have shown great potential in different computer vision tasks. Our model is based on the SwinIR an image restoration architecture based on a transformer, but by modifying the deep feature extraction module, the depth level of the model, and by applying a combined loss function that improves styling and adapts the model for the non-homogeneous haze removal present in images. The obtained results prove to be superior to those obtained by state-of-the-art models.
Address	Dijon; France; October 2022
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	SITIS
Notes	MSIAU; no proj			Approved	no
Call Number	Admin @ si @ SCS2022			Serial	3803
Permanent link to this record



Author	Subhajit Maity; Sanket Biswas; Siladittya Manna; Ayan Banerjee; Josep Llados; Saumik Bhattacharya; Umapada Pal
Title	SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation			Type	Conference Article
Year	2023	Publication	17th International Conference on Doccument Analysis and Recognition	Abbreviated Journal
Volume	14187	Issue		Pages	342–360
Keywords
Abstract	Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc. However, most of the existing works have ignored the crucial fact regarding the scarcity of labeled data. With growing internet connectivity to personal life, an enormous amount of documents had been available in the public domain and thus making data annotation a tedious task. We address this challenge using self-supervision and unlike, the few existing self-supervised document segmentation approaches which use text mining and textual labels, we use a complete vision-based approach in pre-training without any ground-truth label or its derivative. Instead, we generate pseudo-layouts from the document images to pre-train an image encoder to learn the document object representation and localization in a self-supervised framework before fine-tuning it with an object detection model. We show that our pipeline sets a new benchmark in this context and performs at par with the existing methods and the supervised counterparts, if not outperforms. The code is made publicly available at: this https URL
Address	Document Layout Analysis; Document
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICDAR
Notes	DAG			Approved	no
Call Number	Admin @ si @ MBM2023			Serial	3990
Permanent link to this record



Author	Karel Paleček; David Geronimo; Frederic Lerasle
Title	Pre-attention cues for person detection			Type	Conference Article
Year	2012	Publication	Cognitive Behavioural Systems, COST 2102 International Training School	Abbreviated Journal
Volume		Issue		Pages	225-235
Keywords
Abstract	Current state-of-the-art person detectors have been proven reliable and achieve very good detection rates. However, the performance is often far from real time, which limits their use to low resolution images only. In this paper, we deal with candidate window generation problem for person detection, i.e. we want to reduce the computational complexity of a person detector by reducing the number of regions that has to be evaluated. We base our work on Alexe’s paper [1], which introduced several pre-attention cues for generic object detection. We evaluate these cues in the context of person detection and show that their performance degrades rapidly for scenes containing multiple objects of interest such as pictures from urban environment. We extend this set by new cues, which better suits our class-specific task. The cues are designed to be simple and efficient, so that they can be used in the pre-attention phase of a more complex sliding window based person detector.
Address	Dresden, Germany
Corporate Author				Thesis
Publisher	Springer Berlin Heidelberg	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN	0302-9743	ISBN	978-3-642-34583-8	Medium
Area		Expedition		Conference	COST-TS
Notes	ADAS			Approved	no
Call Number	Admin @ si @ PGL2012			Serial	2148
Permanent link to this record



Author	Marçal Rusiñol; David Aldavert; Dimosthenis Karatzas; Ricardo Toledo; Josep Llados
Title	Interactive Trademark Image Retrieval by Fusing Semantic and Visual Content. Advances in Information Retrieval			Type	Conference Article
Year	2011	Publication	33rd European Conference on Information Retrieval	Abbreviated Journal
Volume	6611	Issue		Pages	314-325
Keywords
Abstract	In this paper we propose an efficient queried-by-example retrieval system which is able to retrieve trademark images by similarity from patent and trademark offices' digital libraries. Logo images are described by both their semantic content, by means of the Vienna codes, and their visual contents, by using shape and color as visual cues. The trademark descriptors are then indexed by a locality-sensitive hashing data structure aiming to perform approximate k-NN search in high dimensional spaces in sub-linear time. The resulting ranked lists are combined by using the Condorcet method and a relevance feedback step helps to iteratively revise the query and refine the obtained results. The experiments demonstrate the effectiveness and efficiency of this system on a realistic and large dataset.
Address	Dublin, Ireland
Corporate Author				Thesis
Publisher	Springer	Place of Publication	Berlin	Editor	P. Clough; C. Foley; C. Gurrin; G.J.F. Jones; W. Kraaij; H. Lee; V. Murdoch
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title	LNCS
Series Volume		Series Issue		Edition
ISSN		ISBN	978-3-642-20160-8	Medium
Area		Expedition		Conference	ECIR
Notes	DAG; RV;ADAS			Approved	no
Call Number	Admin @ si @ RAK2011			Serial	1737
Permanent link to this record



Author	Enric Marti; Debora Gil; Carme Julia
Title	A PBL experience in the teaching of Computer Graphics			Type	Conference Article
Year	2005	Publication	EUROGRAPHICS Proceedings	Abbreviated Journal
Volume	5	Issue	1	Pages	95-103
Keywords	project-based learning; computer graphics education; Open GL; rendering techniques; computer animation techniques; Graphics packages; Hierarchy and geometric transformations; Animation; Color; shading; shadowing and texture; fractals; hidden line/surface removal; Problem Based Learning
Abstract	Project-Based Learning (PBL) is an educational strategy to improve student’s learning capability that, in recent years, has had a progressive acceptance in undergraduate studies. This methodology is based on solving a problem or project in a student working group. In this way, PBL focuses on learning the necessary tools to correctly find a solution to given problems. Since the learning initiative is transferred to the student, the PBL method promotes students own abilities. This allows a better assessment of the true workload that carries out the student in the subject. It follows that the methodology conforms to the guidelines of the Bologna document, which quantifies the student workload in a subject by means of the European credit transfer system (ECTS). PBL is currently applied in undergraduate studies needing strong practical training such as medicine, nursing or law sciences. Although this is also the case in engineering studies, amazingly, few experiences have been reported. In this paper we propose to use PBL in the educational organization of the Computer Graphics subjects in the Computer Science degree. Our PBL project focuses in the development of a C++ graphical environment based on the OpenGL libraries for visualization and handling of different graphical objects. The starting point is a basic skeleton that already includes lighting functions, perspective projection with mouse interaction to change the point of view and three predefined objects. Students have to complete this skeleton by adding their own functions to solve the project. A total number of 10 projects have been proposed and successfully solved. The exercises range from human face rendering to articulated objects, such as robot arms or puppets. In the present paper we extensively report the statement and educational objectives for two of the projects: solar system visualization and a chess game. We report our earlier educational experience based on the standard classroom theoretical, problem and practice sessions and the reasons that motivated searching for other learning methods. We have mainly chosen PBL because it improves the student learning initiative. We have applied the PBL educational model since the beginning of the second semester. The student’s feedback increases in his interest for the subject. We present a comparative study of the teachers’ and students’ workload between PBL and the classic teaching approach, which suggests that the workload increase in PBL is not as high as it seems.
Address	Dublin; Ireland; September 2005
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	EUROGRAPHICS
Notes	IAM;ADAS;			Approved	no
Call Number	IAM @ iam @ MGJ2005			Serial	1593
Permanent link to this record



Author	Laura Lopez-Fuentes; Joost Van de Weijer; Marc Bolaños; Harald Skinnemoen
Title	Multi-modal Deep Learning Approach for Flood Detection			Type	Conference Article
Year	2017	Publication	MediaEval Benchmarking Initiative for Multimedia Evaluation	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	In this paper we propose a multi-modal deep learning approach to detect floods in social media posts. Social media posts normally contain some metadata and/or visual information, therefore in order to detect the floods we use this information. The model is based on a Convolutional Neural Network which extracts the visual features and a bidirectional Long Short-Term Memory network to extract the semantic features from the textual metadata. We validate the method on images extracted from Flickr which contain both visual information and metadata and compare the results when using both, visual information only or metadata only. This work has been done in the context of the MediaEval Multimedia Satellite Task.
Address	Dublin; Ireland; September 2017
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	MediaEval
Notes	LAMP; 600.084; 600.109; 600.120			Approved	no
Call Number	Admin @ si @ LWB2017a			Serial	2974
Permanent link to this record



Author	Jaime Lopez-Krahe; Josep Llados; Enric Marti
Title	Architectural Floor Plan Analysis			Type	Report
Year	2000	Publication	CVonline	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Edimburg, UK
Corporate Author				Thesis
Publisher	University of Edinburgh	Place of Publication		Editor	Robert B. Fisher
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium	online pdf
Area		Expedition		Conference
Notes	DAG;IAM			Approved	no
Call Number	IAM @ iam @ LLM2000			Serial	1561
Permanent link to this record



Author	Fadi Dornaika; Angel Sappa
Title	Real Time on Board Stereo Camera Pose through Image Registration			Type	Conference Article
Year	2008	Publication	IEEE Intelligent Vehicles Symposium,	Abbreviated Journal
Volume		Issue		Pages	804–809
Keywords
Abstract
Address	Eindhoven (Netherlands)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS			Approved	no
Call Number	ADAS @ adas @ DoS2008a			Serial	1015
Permanent link to this record



Author	Jose Manuel Alvarez; Antonio Lopez; Ramon Baldrich
Title	Illuminant Invariant Model-Based Road Segmentation			Type	Conference Article
Year	2008	Publication	IEEE Intelligent Vehicles Symposium,	Abbreviated Journal
Volume		Issue		Pages	1155–1180
Keywords	road detection
Abstract
Address	Eindhoven (The Netherlands)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS;CIC			Approved	no
Call Number	ADAS @ adas @ ALB2008			Serial	1045
Permanent link to this record



Author	Ignasi Rius; Dani Rowe; Jordi Gonzalez; Xavier Roca
Title	A 3D Dynamic Model of Human Actions for Probabilistic Image Tracking			Type	Book Chapter
Year	2005	Publication	Pattern Recognition and Image Analysis (IbPRIA 2005), LNCS 3522: 529–536	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Estoril (Portugal)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	ISE @ ise @ RRG2005b			Serial	544
Permanent link to this record



Author	Dani Rowe; Ignasi Rius; Jordi Gonzalez; Xavier Roca; Juan J. Villanueva
Title	Probabilistic Image-Based Tracking: Improving Particle Filtering			Type	Book Chapter
Year	2005	Publication	Pattern Recognition and Image Analysis (IbPRIA 2005), LNCS 3522: 85–92	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Estoril (Portugal)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ISE			Approved	no
Call Number	ISE @ ise @ RRG2005a			Serial	545
Permanent link to this record



Author	Agata Lapedriza; David Masip; Jordi Vitria
Title	The contribution of external features to face recognition			Type	Book Chapter
Year	2005	Publication	Pattern Recognition and Image Analysis (IbPRIA 2005), LNCS 3523: 537–544	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Estoril (Portugal)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ LMV2005a			Serial	546
Permanent link to this record



Author	Jaume Amores; N. Sebe; Petia Radeva
Title	Efficient Object-Class Recognition by Boosting Contextual Information			Type	Miscellaneous
Year	2005	Publication	Pattern Recognition and Image Analysis, IbPRIA 2005, LNCS 3522:28–35	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Estoril (Portugal)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	ADAS;MILAB			Approved	no
Call Number	ADAS @ adas @ ASR2005b			Serial	554
Permanent link to this record