Publicacions CVC -- Query Results

	Publicacions CVC Home \| Show All \| Simple Search \| Advanced Search \| Add Record \| Import	Login Quick Search: Field: contains: ...
	301–315 of 3413 records found matching your query (RSS):

Search & Display Options

Select All Deselect All

[11–20] << 21 22 23 24 25 26 27 28 29 30 >> [31–40]

|

Citations

|

	David Guillamet, & Jordi Vitria. (2000). A Comparison of Local versus Global Color Histograms for Object Recognition. In 15 th International Conference on Pattern Recognition (Vol. 2, pp. 422–425). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	V. Valev, B. Sankur, & Petia Radeva. (2000). Generalized Non Reducible Descriptors. In 15 th International Conference on Pattern Recognition (Vol. 2, p. 397). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	David Lloret, Joan Serrat, Antonio Lopez, A. Soler, & Juan J. Villanueva. (2000). Retinal image registration using creases as anatomical landmarks. In 15 th International Conference on Pattern Recognition (Vol. 3, pp. 207–2010). Abstract: Retinal images are routinely used in ophthalmology to study the optical nerve head and the retina. To assess objectively the evolution of an illness, images taken at different times must be registered. Most methods so far have been designed specifically for a single image modality, like temporal series or stereo pairs of angiographies, fluorescein angiographies or scanning laser ophthalmoscope (SLO) images, which makes them prone to fail when conditions vary. In contrast, the method we propose has shown to be accurate and reliable on all the former modalities. It has been adapted from the 3D registration of CT and MR image to 2D. Relevant features (also known as landmarks) are extracted by means of a robust creaseness operator, and resulting images are iteratively transformed until a maximum in their correlation is achieved. Our method has succeeded in more than 100 pairs tried so far, in all cases including also the scaling as a parameter to be optimized Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Xose M. Pardo, & Petia Radeva. (2000). Discriminant snakes for 3D reconstruction in medical Images. In 15 th International Conference on Pattern Recognition (Vol. 4, pp. 336–339). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Ricardo Toledo, X. Orriols, Petia Radeva, X. Binefa, Jordi Vitria, Cristina Cañero, et al. (2000). Eigensnakes for vessel segmentation in angiography. In 15 th International Conference on Pattern Recognition (Vol. 4, pp. 340–343). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	A. Pujol, Felipe Lumbreras, Javier Varona, & Juan J. Villanueva. (2000). Locating people in indoor scenes for real applications. In 15 th International Conference on Pattern Recognition (Vol. 4, pp. 632–635). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Cristina Cañero, Petia Radeva, Ricardo Toledo, Juan J. Villanueva, & J. Mauri. (2000). 3D Curve Reconstruction by Biplane Snakes. In 15 th International Conference on Pattern Recognition (Vol. 4, pp. 563–566). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Joan Serrat, Antonio Lopez, & David Lloret. (2000). On ridges and valleys. In 15 th International Conference on Pattern Recognition (Vol. 4, pp. 59–66). Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Youssef El Rhabi, Simon Loic, & Brun Luc. (2015). Estimation de la pose d’une caméra à partir d’un flux vidéo en s’approchant du temps réel. In 15ème édition d'ORASIS, journées francophones des jeunes chercheurs en vision par ordinateur ORASIS2015. Abstract: Finding a way to estimate quickly and robustly the pose of an image is essential in augmented reality. Here we will discuss the approach we chose in order to get closer to real time by using SIFT points [4]. We propose a method based on filtering both SIFT points and images on which to focus on. Hence we will focus on relevant data. Keywords: Augmented Reality; SFM; SLAM; real time pose computation; 2D/3D registration Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	R. Herault, Franck Davoine, Fadi Dornaika, & Y. Grandvalet. (2006). Simultaneous and robust face and facial action tracking. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Sergio Escalera, Jordi Gonzalez, Xavier Baro, Miguel Reyes, Oscar Lopes, Isabelle Guyon, et al. (2013). Multi-modal Gesture Recognition Challenge 2013: Dataset and Results. In 15th ACM International Conference on Multimodal Interaction (pp. 445–452). Abstract: The recognition of continuous natural gestures is a complex and challenging problem due to the multi-modal nature of involved visual cues (e.g. fingers and lips movements, subtle facial expressions, body pose, etc.), as well as technical limitations such as spatial and temporal resolution and unreliable depth cues. In order to promote the research advance on this field, we organized a challenge on multi-modal gesture recognition. We made available a large video database of 13; 858 gestures from a lexicon of 20 Italian gesture categories recorded with a KinectTM camera, providing the audio, skeletal model, user mask, RGB and depth images. The focus of the challenge was on user independent multiple gesture learning. There are no resting positions and the gestures are performed in continuous sequences lasting 1-2 minutes, containing between 8 and 20 gesture instances in each sequence. As a result, the dataset contains around 1:720:800 frames. In addition to the 20 main gesture categories, ‘distracter’ gestures are included, meaning that additional audio and gestures out of the vocabulary are included. The final evaluation of the challenge was defined in terms of the Levenshtein edit distance, where the goal was to indicate the real order of gestures within the sequence. 54 international teams participated in the challenge, and outstanding results were obtained by the first ranked participants. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Victor Ponce, Sergio Escalera, & Xavier Baro. (2013). Multi-modal Social Signal Analysis for Predicting Agreement in Conversation Settings. In 15th ACM International Conference on Multimodal Interaction (pp. 495–502). Abstract: In this paper we present a non-invasive ambient intelligence framework for the analysis of non-verbal communication applied to conversational settings. In particular, we apply feature extraction techniques to multi-modal audio-RGB-depth data. We compute a set of behavioral indicators that define communicative cues coming from the fields of psychology and observational methodology. We test our methodology over data captured in victim-offender mediation scenarios. Using different state-of-the-art classification approaches, our system achieve upon 75% of recognition predicting agreement among the parts involved in the conversations, using as ground truth the experts opinions. Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Yaxing Wang, Chenshen Wu, Luis Herranz, Joost Van de Weijer, Abel Gonzalez-Garcia, & Bogdan Raducanu. (2018). Transferring GANs: generating images from limited data. In 15th European Conference on Computer Vision (Vol. 11210, pp. 220–236). LNCS. Abstract: ransferring knowledge of pre-trained networks to new domains by means of fine-tuning is a widely used practice for applications based on discriminative models. To the best of our knowledge this practice has not been studied within the context of generative deep networks. Therefore, we study domain adaptation applied to image generation with generative adversarial networks. We evaluate several aspects of domain adaptation, including the impact of target domain size, the relative distance between source and target domain, and the initialization of conditional GANs. Our results show that using knowledge from pre-trained networks can shorten the convergence time and can significantly improve the quality of the generated images, especially when target data is limited. We show that these conclusions can also be drawn for conditional GANs even when the pre-trained model was trained without conditioning. Our results also suggest that density is more important than diversity and a dataset with one or few densely sampled classes is a better source model than more diverse datasets such as ImageNet or Places. Keywords: Generative adversarial networks; Transfer learning; Domain adaptation; Image generation Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Pau Rodriguez, Josep M. Gonfaus, Guillem Cucurull, Xavier Roca, & Jordi Gonzalez. (2018). Attend and Rectify: A Gated Attention Mechanism for Fine-Grained Recovery. In 15th European Conference on Computer Vision (Vol. 11212, pp. 357–372). LNCS. Abstract: We propose a novel attention mechanism to enhance Convolutional Neural Networks for fine-grained recognition. It learns to attend to lower-level feature activations without requiring part annotations and uses these activations to update and rectify the output likelihood distribution. In contrast to other approaches, the proposed mechanism is modular, architecture-independent and efficient both in terms of parameters and computation required. Experiments show that networks augmented with our approach systematically improve their classification accuracy and become more robust to clutter. As a result, Wide Residual Networks augmented with our proposal surpasses the state of the art classification accuracies in CIFAR-10, the Adience gender recognition task, Stanford dogs, and UEC Food-100. Keywords: Deep Learning; Convolutional Neural Networks; Attention Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML
	Lluis Gomez, Andres Mafla, Marçal Rusiñol, & Dimosthenis Karatzas. (2018). Single Shot Scene Text Retrieval. In 15th European Conference on Computer Vision (Vol. 11218, pp. 728–744). LNCS. Abstract: Textual information found in scene images provides high level semantic information about the image and its context and it can be leveraged for better scene understanding. In this paper we address the problem of scene text retrieval: given a text query, the system must return all images containing the queried text. The novelty of the proposed model consists in the usage of a single shot CNN architecture that predicts at the same time bounding boxes and a compact text representation of the words in them. In this way, the text based image retrieval task can be casted as a simple nearest neighbor search of the query text representation over the outputs of the CNN over the entire image database. Our experiments demonstrate that the proposed architecture outperforms previous state-of-the-art while it offers a significant increase in processing speed. Keywords: Image retrieval; Scene text; Word spotting; Convolutional Neural Networks; Region Proposals Networks; PHOC Permanent link \| Save citation: RTF PDF LaTeX \| Export record: BibTeX Endnote ISI RIS Atom XML MODS XML ODF XML Word XML

Select All Deselect All

[11–20] << 21 22 23 24 25 26 27 28 29 30 >> [31–40]

|

Citations

|

Cite, Group & Export Options

SQL Search | Library Search | Show Record | Extract Citations