Publicacions CVC -- Query Results

	Publicacions CVC Home \| Show All \| Simple Search \| Advanced Search \| Add Record \| Import	Login Quick Search: Field: contains: ...
	376–390 of 3413 records found matching your query (RSS \| history):

Search & Display Options

Select All Deselect All

[11–20] << 21 22 23 24 25 26 27 28 29 30 >> [31–40]

List View

Citations

Details

	Records
	Author	Jiaolong Xu; David Vazquez; Antonio Lopez; Javier Marin; Daniel Ponsa
	Title	Learning a Part-based Pedestrian Detector in Virtual World			Type	Journal Article
	Year	2014	Publication	IEEE Transactions on Intelligent Transportation Systems	Abbreviated Journal	TITS
	Volume	15	Issue	5	Pages	2121-2131
	Keywords	Domain Adaptation; Pedestrian Detection; Virtual Worlds
	Abstract	Detecting pedestrians with on-board vision systems is of paramount interest for assisting drivers to prevent vehicle-to-pedestrian accidents. The core of a pedestrian detector is its classification module, which aims at deciding if a given image window contains a pedestrian. Given the difficulty of this task, many classifiers have been proposed during the last fifteen years. Among them, the so-called (deformable) part-based classifiers including multi-view modeling are usually top ranked in accuracy. Training such classifiers is not trivial since a proper aspect clustering and spatial part alignment of the pedestrian training samples are crucial for obtaining an accurate classifier. In this paper, first we perform automatic aspect clustering and part alignment by using virtual-world pedestrians, i.e., human annotations are not required. Second, we use a mixture-of-parts approach that allows part sharing among different aspects. Third, these proposals are integrated in a learning framework which also allows to incorporate real-world training data to perform domain adaptation between virtual- and real-world cameras. Overall, the obtained results on four popular on-board datasets show that our proposal clearly outperforms the state-of-the-art deformable part-based detector known as latent SVM.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1931-0587	ISBN	978-1-4673-2754-1	Medium
	Area		Expedition		Conference
	Notes	ADAS; 600.076			Approved	no
	Call Number	ADAS @ adas @ XVL2014			Serial	2433
Permanent link to this record



	Author	Ekaterina Zaytseva; Jordi Vitria
	Title	A search based approach to non maximum suppression in face detection			Type	Conference Article
	Year	2012	Publication	19th IEEE International Conference on Image Processing	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract	Poster paper TA.P5.12 Face detectors typically produce a large number of false positives and this leads to the need to have a further non maximum suppression stage to eliminate multiple and spurious responses. This stage is based on considering spatial heuristics: true positive responses are selected by implicitly considering several restrictions on the spatial distribution of detector responses in natural images. In this paper we analyze the limitations of this approach and propose an efficient search method to overcome them. Results show how the application of this new non-maximum suppression approach to a simple face detector boosts its performance to state of the art results.
	Address	Orlando; USA; September 2012
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1522-4880	ISBN	978-1-4673-2534-9	Medium
	Area		Expedition		Conference	ICIP
	Notes	OR;MV			Approved	no
	Call Number	Admin @ si @ ZaV2012			Serial	2060
Permanent link to this record



	Author	Laura Igual; Joan Carles Soliva; Antonio Hernandez; Sergio Escalera; Oscar Vilarroya; Petia Radeva
	Title	Supervised Brain Segmentation and Classification in Diagnostic of Attention-Deficit/Hyperactivity Disorder			Type	Conference Article
	Year	2012	Publication	High Performance Computing and Simulation, International Conference on	Abbreviated Journal
	Volume		Issue		Pages	182-187
	Keywords
	Abstract	This paper presents an automatic method for external and internal segmentation of the caudate nucleus in Magnetic Resonance Images (MRI) based on statistical and structural machine learning approaches. This method is applied in Attention-Deficit/Hyperactivity Disorder (ADHD) diagnosis. The external segmentation method adapts the Graph Cut energy-minimization model to make it suitable for segmenting small, low-contrast structures, such as the caudate nucleus. In particular, new energy function data and boundary potentials are defined and a supervised energy term based on contextual brain structures is added. Furthermore, the internal segmentation method learns a classifier based on shape features of the Region of Interest (ROI) in MRI slices. The results show accurate external and internal caudate segmentation in a real data set and similar performance of ADHD diagnostic test to manual annotation.
	Address	Madrid
	Corporate Author				Thesis
	Publisher	IEEE Xplore	Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4673-2359-8	Medium
	Area		Expedition		Conference	HPCS
	Notes	MILAB;HuPBA			Approved	no
	Call Number	Admin @ si @ ISH2012a			Serial	2038
Permanent link to this record



	Author	Petia Radeva; Michal Drozdzal; Santiago Segui; Laura Igual; Carolina Malagelada; Fernando Azpiroz; Jordi Vitria
	Title	Active labeling: Application to wireless endoscopy analysis			Type	Conference Article
	Year	2012	Publication	High Performance Computing and Simulation, International Conference on	Abbreviated Journal
	Volume		Issue		Pages	174-181
	Keywords
	Abstract	Today, robust learners trained in a real supervised machine learning application should count with a rich collection of positive and negative examples. Although in many applications, it is not difficult to obtain huge amount of data, labeling those data can be a very expensive process, especially when dealing with data of high variability and complexity. A good example of such cases are data from medical imaging applications where annotating anomalies like tumors, polyps, atherosclerotic plaque or informative frames in wireless endoscopy need highly trained experts. Building a representative set of training data from medical videos (e.g. Wireless Capsule Endoscopy) means that thousands of frames to be labeled by an expert. It is quite normal that data in new videos come different and thus are not represented by the training set. In this paper, we review the main approaches on active learning and illustrate how active learning can help to reduce expert effort in constructing the training sets. We show that applying active learning criteria, the number of human interventions can be significantly reduced. The proposed system allows the annotation of informative/non-informative frames of Wireless Capsule Endoscopy video containing more than 30000 frames each one with less than 100 expert ”clicks”.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4673-2359-8	Medium
	Area		Expedition		Conference	HPCS
	Notes	MILAB; OR;MV			Approved	no
	Call Number	Admin @ si @ RDS2012			Serial	2152
Permanent link to this record



	Author	Jon Almazan; David Fernandez; Alicia Fornes; Josep Llados; Ernest Valveny
	Title	A Coarse-to-Fine Approach for Handwritten Word Spotting in Large Scale Historical Documents Collection			Type	Conference Article
	Year	2012	Publication	13th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	453-458
	Keywords
	Abstract	In this paper we propose an approach for word spotting in handwritten document images. We state the problem from a focused retrieval perspective, i.e. locating instances of a query word in a large scale dataset of digitized manuscripts. We combine two approaches, namely one based on word segmentation and another one segmentation-free. The first approach uses a hashing strategy to coarsely prune word images that are unlikely to be instances of the query word. This process is fast but has a low precision due to the errors introduced in the segmentation step. The regions containing candidate words are sent to the second process based on a state of the art technique from the visual object detection field. This discriminative model represents the appearance of the query word and computes a similarity score. In this way we propose a coarse-to-fine approach achieving a compromise between efficiency and accuracy. The validation of the model is shown using a collection of old handwritten manuscripts. We appreciate a substantial improvement in terms of precision regarding the previous proposed method with a low computational cost increase.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4673-2262-1	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG			Approved	no
	Call Number	DAG @ dag @ AFF2012			Serial	1983
Permanent link to this record



	Author	Marçal Rusiñol; Josep Llados
	Title	The Role of the Users in Handwritten Word Spotting Applications: Query Fusion and Relevance Feedback			Type	Conference Article
	Year	2012	Publication	13th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	55-60
	Keywords
	Abstract	In this paper we present the importance of including the user in the loop in a handwritten word spotting framework. Several off-the-shelf query fusion and relevance feedback strategies have been tested in the handwritten word spotting context. The increase in terms of precision when the user is included in the loop is assessed using two datasets of historical handwritten documents and a baseline word spotting approach based on a bag-of-visual-words model.
	Address	Bari, Italy
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4673-2262-1	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ RuL2012			Serial	2054
Permanent link to this record



	Author	Volkmar Frinken; Markus Baumgartner; Andreas Fischer; Horst Bunke
	Title	Semi-Supervised Learning for Cursive Handwriting Recognition using Keyword Spotting			Type	Conference Article
	Year	2012	Publication	13th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	49-54
	Keywords
	Abstract	State-of-the-art handwriting recognition systems are learning-based systems that require large sets of training data. The creation of training data, and consequently the creation of a well-performing recognition system, requires therefore a substantial amount of human work. This can be reduced with semi-supervised learning, which uses unlabeled text lines for training as well. Current approaches estimate the correct transcription of the unlabeled data via handwriting recognition which is not only extremely demanding as far as computational costs are concerned but also requires a good model of the target language. In this paper, we propose a different approach that makes use of keyword spotting, which is significantly faster and does not need any language model. In a set of experiments we demonstrate its superiority over existing approaches.
	Address	Bari, Italy
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	10.1109/ICFHR.2012.268	ISBN	978-1-4673-2262-1	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ FBF2012			Serial	2055
Permanent link to this record



	Author	Emanuel Indermühle; Volkmar Frinken; Horst Bunke
	Title	Mode Detection in Online Handwritten Documents using BLSTM Neural Networks			Type	Conference Article
	Year	2012	Publication	13th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	302-307
	Keywords
	Abstract	Mode detection in online handwritten documents refers to the process of distinguishing different types of contents, such as text, formulas, diagrams, or tables, one from another. In this paper a new approach to mode detection is proposed that uses bidirectional long-short term memory (BLSTM) neural networks. The BLSTM neural network is a novel type of recursive neural network that has been successfully applied in speech and handwriting recognition. In this paper we show that it has the potential to significantly outperform traditional methods for mode detection, which are usually based on stroke classification. As a further advantage over previous approaches, the proposed system is trainable and does not rely on user-defined heuristics. Moreover, it can be easily adapted to new or additional types of modes by just providing the system with new training data.
	Address	Bari, italy
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4673-2262-1	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ IFB2012			Serial	2056
Permanent link to this record



	Author	David Fernandez; Josep Llados; Alicia Fornes; R.Manmatha
	Title	On Influence of Line Segmentation in Efficient Word Segmentation in Old Manuscripts			Type	Conference Article
	Year	2012	Publication	13th International Conference on Frontiers in Handwriting Recognition	Abbreviated Journal
	Volume		Issue		Pages	763-768
	Keywords	document image processing;handwritten character recognition;history;image segmentation;Spanish document;historical document;line segmentation;old handwritten document;old manuscript;word segmentation;Bifurcation;Dynamic programming;Handwriting recognition;Image segmentation;Measurement;Noise;Skeleton;Segmentation;document analysis;document and text processing;handwriting analysis;heuristics;path-finding
	Abstract	he objective of this work is to show the importance of a good line segmentation to obtain better results in the segmentation of words of historical documents. We have used the approach developed by Manmatha and Rothfeder [1] to segment words in old handwritten documents. In their work the lines of the documents are extracted using projections. In this work, we have developed an approach to segment lines more efficiently. The new line segmentation algorithm tackles with skewed, touching and noisy lines, so it is significantly improves word segmentation. Experiments using Spanish documents from the Marriages Database of the Barcelona Cathedral show that this approach reduces the error rate by more than 20%
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN		ISBN	978-1-4673-2262-1	Medium
	Area		Expedition		Conference	ICFHR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ FLF2012			Serial	2200
Permanent link to this record



	Author	David Vazquez; Antonio Lopez; Daniel Ponsa
	Title	Unsupervised Domain Adaptation of Virtual and Real Worlds for Pedestrian Detection			Type	Conference Article
	Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	3492 - 3495
	Keywords	Pedestrian Detection; Domain Adaptation; Virtual worlds
	Abstract	Vision-based object detectors are crucial for different applications. They rely on learnt object models. Ideally, we would like to deploy our vision system in the scenario where it must operate, and lead it to self-learn how to distinguish the objects of interest, i.e., without human intervention. However, the learning of each object model requires labelled samples collected through a tiresome manual process. For instance, we are interested in exploring the self-training of a pedestrian detector for driver assistance systems. Our first approach to avoid manual labelling consisted in the use of samples coming from realistic computer graphics, so that their labels are automatically available [12]. This would make possible the desired self-training of our pedestrian detector. However, as we showed in [14], between virtual and real worlds it may be a dataset shift. In order to overcome it, we propose the use of unsupervised domain adaptation techniques that avoid human intervention during the adaptation process. In particular, this paper explores the use of the transductive SVM (T-SVM) learning algorithm in order to adapt virtual and real worlds for pedestrian detection (Fig. 1).
	Address	Tsukuba Science City, Japan
	Corporate Author				Thesis
	Publisher	IEEE	Place of Publication	Tsukuba Science City, JAPAN	Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
	Area		Expedition		Conference	ICPR
	Notes	ADAS			Approved	no
	Call Number	ADAS @ adas @ VLP2012			Serial	1981
Permanent link to this record



	Author	Jose Carlos Rubio; Joan Serrat; Antonio Lopez; N. Paragios
	Title	Image Contextual Representation and Matching through Hierarchies and Higher Order Graphs			Type	Conference Article
	Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	2664 - 2667
	Keywords
	Abstract	We present a region matching algorithm which establishes correspondences between regions from two segmented images. An abstract graph-based representation conceals the image in a hierarchical graph, exploiting the scene properties at two levels. First, the similarity and spatial consistency of the image semantic objects is encoded in a graph of commute times. Second, the cluttered regions of the semantic objects are represented with a shape descriptor. Many-to-many matching of regions is specially challenging due to the instability of the segmentation under slight image changes, and we explicitly handle it through high order potentials. We demonstrate the matching approach applied to images of world famous buildings, captured under different conditions, showing the robustness of our method to large variations in illumination and viewpoint.
	Address	Tsukuba Science City, Japan
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
	Area		Expedition		Conference	ICPR
	Notes	ADAS			Approved	no
	Call Number	Admin @ si @ RSL2012a;			Serial	2032
Permanent link to this record



	Author	German Ros; Jesus Martinez del Rincon; Gines Garcia-Mateos
	Title	Articulated Particle Filter for Hand Tracking			Type	Conference Article
	Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	3581 - 3585
	Keywords
	Abstract	This paper proposes a new version of Particle Filter, called Articulated Particle Filter – ArPF -, which has been specifically designed for an efficient sampling of hierarchical spaces, generated by articulated objects. Our approach decomposes the articulated motion into layers for efficiency purposes, making use of a careful modeling of the diffusion noise along with its propagation through the articulations. This produces an increase of accuracy and prevent for divergences. The algorithm is tested on hand tracking due to its complex hierarchical articulated nature. With this purpose, a new dataset generation tool for quantitative evaluation is also presented in this paper.
	Address	Tsukuba Science City, Japan
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
	Area		Expedition		Conference	ICPR
	Notes	ADAS			Approved	no
	Call Number	Admin @ si @ RMG2012			Serial	2031
Permanent link to this record



	Author	Volkmar Frinken; Francisco Zamora; Salvador España; Maria Jose Castro; Andreas Fischer; Horst Bunke
	Title	Long-Short Term Memory Neural Networks Language Modeling for Handwriting Recognition			Type	Conference Article
	Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	701-704
	Keywords
	Abstract	Unconstrained handwritten text recognition systems maximize the combination of two separate probability scores. The first one is the observation probability that indicates how well the returned word sequence matches the input image. The second score is the probability that reflects how likely a word sequence is according to a language model. Current state-of-the-art recognition systems use statistical language models in form of bigram word probabilities. This paper proposes to model the target language by means of a recurrent neural network with long-short term memory cells. Because the network is recurrent, the considered context is not limited to a fixed size especially as the memory cells are designed to deal with long-term dependencies. In a set of experiments conducted on the IAM off-line database we show the superiority of the proposed language model over statistical n-gram models.
	Address	Tsukuba Science City, Japan
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ FZE2012			Serial	2052
Permanent link to this record



	Author	Marçal Rusiñol; Dimosthenis Karatzas; Andrew Bagdanov; Josep Llados
	Title	Multipage Document Retrieval by Textual and Visual Representations			Type	Conference Article
	Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages	521-524
	Keywords
	Abstract	In this paper we present a multipage administrative document image retrieval system based on textual and visual representations of document pages. Individual pages are represented by textual or visual information using a bag-of-words framework. Different fusion strategies are evaluated which allow the system to perform multipage document retrieval on the basis of a single page retrieval system. Results are reported on a large dataset of document images sampled from a banking workflow.
	Address	Tsukuba Science City, Japan
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
	Area		Expedition		Conference	ICPR
	Notes	DAG			Approved	no
	Call Number	Admin @ si @ RKB2012			Serial	2053
Permanent link to this record



	Author	Antonio Hernandez; Miguel Angel Bautista; Xavier Perez Sala; Victor Ponce; Xavier Baro; Oriol Pujol; Cecilio Angulo; Sergio Escalera
	Title	BoVDW: Bag-of-Visual-and-Depth-Words for Gesture Recognition			Type	Conference Article
	Year	2012	Publication	21st International Conference on Pattern Recognition	Abbreviated Journal
	Volume		Issue		Pages
	Keywords
	Abstract	We present a Bag-of-Visual-and-Depth-Words (BoVDW) model for gesture recognition, an extension of the Bag-of-Visual-Words (BoVW) model, that benefits from the multimodal fusion of visual and depth features. State-of-the-art RGB and depth features, including a new proposed depth descriptor, are analysed and combined in a late fusion fashion. The method is integrated in a continuous gesture recognition pipeline, where Dynamic Time Warping (DTW) algorithm is used to perform prior segmentation of gestures. Results of the method in public data sets, within our gesture recognition pipeline, show better performance in comparison to a standard BoVW model.
	Address
	Corporate Author				Thesis
	Publisher		Place of Publication		Editor
	Language		Summary Language		Original Title
	Series Editor		Series Title		Abbreviated Series Title
	Series Volume		Series Issue		Edition
	ISSN	1051-4651	ISBN	978-1-4673-2216-4	Medium
	Area		Expedition		Conference	ICPR
	Notes	HuPBA;MV			Approved	no
	Call Number	Admin @ si @ HBP2012			Serial	2122
Permanent link to this record