Home | [31–40] << 41 42 43 44 45 46 47 48 49 50 >> [51–60] |
![]() |
Records | |||||
---|---|---|---|---|---|
Author | Sergio Escalera; R. M. Martinez; Jordi Vitria; Petia Radeva; Maria Teresa Anguera | ||||
Title | Dominance Detection in Face-to-face Conversations | Type | Conference Article | ||
Year | 2009 | Publication ![]() |
2nd IEEE Workshop on CVPR for Human communicative Behavior analysis | Abbreviated Journal | |
Volume | Issue | Pages | 97–102 | ||
Keywords | |||||
Abstract | Dominance is referred to the level of influence a person has in a conversation. Dominance is an important research area in social psychology, but the problem of its automatic estimation is a very recent topic in the contexts of social and wearable computing. In this paper, we focus on dominance detection from visual cues. We estimate the correlation among observers by categorizing the dominant people in a set of face-to-face conversations. Different dominance indicators from gestural communication are defined, manually annotated, and compared to the observers opinion. Moreover, the considered indicators are automatically extracted from video sequences and learnt by using binary classifiers. Results from the three analysis shows a high correlation and allows the categorization of dominant people in public discussion video sequences. | ||||
Address | Miami, USA | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 2160-7508 | ISBN | 978-1-4244-3994-2 | Medium | |
Area | Expedition | Conference | CVPR | ||
Notes | HuPBA; OR; MILAB;MV | Approved | no | ||
Call Number | BCNPCL @ bcnpcl @ EMV2009 | Serial | 1227 | ||
Permanent link to this record | |||||
Author | Ariel Amato; Angel Sappa; Alicia Fornes; Felipe Lumbreras; Josep Llados | ||||
Title | Divide and Conquer: Atomizing and Parallelizing A Task in A Mobile Crowdsourcing Platform | Type | Conference Article | ||
Year | 2013 | Publication ![]() |
2nd International ACM Workshop on Crowdsourcing for Multimedia | Abbreviated Journal | |
Volume | Issue | Pages | 21-22 | ||
Keywords | |||||
Abstract | In this paper we present some conclusions about the advantages of having an efficient task formulation when a crowdsourcing platform is used. In particular we show how the task atomization and distribution can help to obtain results in an efficient way. Our proposal is based on a recursive splitting of the original task into a set of smaller and simpler tasks. As a result both more accurate and faster solutions are obtained. Our evaluation is performed on a set of ancient documents that need to be digitized. | ||||
Address | Barcelona; October 2013 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | 978-1-4503-2396-3 | Medium | ||
Area | Expedition | Conference | CrowdMM | ||
Notes | ADAS; ISE; DAG; 600.054; 600.055; 600.045; 600.061; 602.006 | Approved | no | ||
Call Number | Admin @ si @ SLA2013 | Serial | 2335 | ||
Permanent link to this record | |||||
Author | Partha Pratim Roy; Josep Llados | ||||
Title | Multi-Oriented Character Recognition from Graphical Documents | Type | Conference Article | ||
Year | 2008 | Publication ![]() |
2nd International Conference on Cognition and Recognition | Abbreviated Journal | |
Volume | Issue | Pages | 30–35 | ||
Keywords | |||||
Abstract | |||||
Address | Mandya (India) | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICCR | ||
Notes | DAG | Approved | no | ||
Call Number | DAG @ dag @ RLP2008 | Serial | 965 | ||
Permanent link to this record | |||||
Author | Sergio Escalera; Oriol Pujol; Petia Radeva | ||||
Title | Traffic Sign Classification using Error Correcting Techniques | Type | Conference Article | ||
Year | 2007 | Publication ![]() |
2nd International Conference on Computer Vision Theory and Applications | Abbreviated Journal | |
Volume | Issue | Pages | 281–285 | ||
Keywords | |||||
Abstract | |||||
Address | Barcelona (Spain) | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | VISAPP | ||
Notes | MILAB;HuPBA | Approved | no | ||
Call Number | BCNPCL @ bcnpcl @ EPR2007a | Serial | 909 | ||
Permanent link to this record | |||||
Author | David Masip; Agata Lapedriza; Jordi Vitria | ||||
Title | Face Verification Sharing Knowledge from Different Subjects | Type | Conference Article | ||
Year | 2007 | Publication ![]() |
2nd International Conference on Computer Vision Theory and Applications | Abbreviated Journal | |
Volume | 2 | Issue | Pages | 268–289 | |
Keywords | |||||
Abstract | |||||
Address | Barcelona (Spain) | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | VISAPP´07 | ||
Notes | OR; MV | Approved | no | ||
Call Number | BCNPCL @ bcnpcl @ MLV2007a | Serial | 995 | ||
Permanent link to this record | |||||
Author | M. Cruz; Cristhian A. Aguilera-Carrasco; Boris X. Vintimilla; Ricardo Toledo; Angel Sappa | ||||
Title | Cross-spectral image registration and fusion: an evaluation study | Type | Conference Article | ||
Year | 2015 | Publication ![]() |
2nd International Conference on Machine Vision and Machine Learning | Abbreviated Journal | |
Volume | Issue | Pages | |||
Keywords | multispectral imaging; image registration; data fusion; infrared and visible spectra | ||||
Abstract | This paper presents a preliminary study on the registration and fusion of cross-spectral imaging. The objective is to evaluate the validity of widely used computer vision approaches when they are applied at different
spectral bands. In particular, we are interested in merging images from the infrared (both long wave infrared: LWIR and near infrared: NIR) and visible spectrum (VS). Experimental results with different data sets are presented. |
||||
Address | Barcelona; July 2015 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | MVML | ||
Notes | ADAS; 600.076 | Approved | no | ||
Call Number | Admin @ si @ CAV2015 | Serial | 2629 | ||
Permanent link to this record | |||||
Author | Alex Pardo; Albert Clapes; Sergio Escalera; Oriol Pujol | ||||
Title | Actions in Context: System for people with Dementia | Type | Conference Article | ||
Year | 2013 | Publication ![]() |
2nd International Workshop on Citizen Sensor Networks (Citisen2013) at the European Conference on Complex Systems | Abbreviated Journal | |
Volume | Issue | Pages | 3-14 | ||
Keywords | Multi-modal data Fusion; Computer vision; Wearable sensors; Gesture recognition; Dementia | ||||
Abstract | In the next forty years, the number of people living with dementia is expected to triple. In the last stages, people affected by this disease become dependent. This hinders the autonomy of the patient and has a huge social impact in time, money and effort. Given this scenario, we propose an ubiquitous system capable of recognizing daily specific actions. The system fuses and synchronizes data obtained from two complementary modalities – ambient and egocentric. The ambient approach consists in a fixed RGB-Depth camera for user and object recognition and user-object interaction, whereas the egocentric point of view is given by a personal area network (PAN) formed by a few wearable sensors and a smartphone, used for gesture recognition. The system processes multi-modal data in real-time, performing paralleled task recognition and modality synchronization, showing high performance recognizing subjects, objects, and interactions, showing its reliability to be applied in real case scenarios. | ||||
Address | Barcelona; September 2013 | ||||
Corporate Author | Thesis | ||||
Publisher | Springer International Publishing | Place of Publication | Editor | ||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | 0302-9743 | ISBN | 978-3-319-04177-3 | Medium | |
Area | Expedition | Conference | ECCS | ||
Notes | HUPBA;MILAB | Approved | no | ||
Call Number | Admin @ si @ PCE2013 | Serial | 2354 | ||
Permanent link to this record | |||||
Author | David Fernandez; Simone Marinai; Josep Llados; Alicia Fornes | ||||
Title | Contextual Word Spotting in Historical Manuscripts using Markov Logic Networks | Type | Conference Article | ||
Year | 2013 | Publication ![]() |
2nd International Workshop on Historical Document Imaging and Processing | Abbreviated Journal | |
Volume | Issue | Pages | 36-43 | ||
Keywords | |||||
Abstract | Natural languages can often be modelled by suitable grammars whose knowledge can improve the word spotting results. The implicit contextual information is even more useful when dealing with information that is intrinsically described as one collection of records. In this paper, we present one approach to word spotting which uses the contextual information of records to improve the results. The method relies on Markov Logic Networks to probabilistically model the relational organization of handwritten records. The performance has been evaluated on the Barcelona Marriages Dataset that contains structured handwritten records that summarize marriage information. | ||||
Address | washington; USA; August 2013 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | 978-1-4503-2115-0 | Medium | ||
Area | Expedition | Conference | HIP | ||
Notes | DAG; 600.056; 600.045; 600.061; 602.006 | Approved | no | ||
Call Number | Admin @ si @ FML2013 | Serial | 2308 | ||
Permanent link to this record | |||||
Author | Volkmar Frinken; Andreas Fischer; Carlos David Martinez Hinarejos | ||||
Title | Handwriting Recognition in Historical Documents using Very Large Vocabularies | Type | Conference Article | ||
Year | 2013 | Publication ![]() |
2nd International Workshop on Historical Document Imaging and Processing | Abbreviated Journal | |
Volume | Issue | Pages | 67-72 | ||
Keywords | |||||
Abstract | Language models are used in automatic transcription system to resolve ambiguities. This is done by limiting the vocabulary of words that can be recognized as well as estimating the n-gram probability of the words in the given text. In the context of historical documents, a non-unified spelling and the limited amount of written text pose a substantial problem for the selection of the recognizable vocabulary as well as the computation of the word probabilities. In this paper we propose for the transcription of historical Spanish text to keep the corpus for the n-gram limited to a sample of the target text, but expand the vocabulary with words gathered from external resources. We analyze the performance of such a transcription system with different sizes of external vocabularies and demonstrate the applicability and the significant increase in recognition accuracy of using up to 300 thousand external words. | ||||
Address | Washington; USA; August 2013 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | 978-1-4503-2115-0 | Medium | ||
Area | Expedition | Conference | HIP | ||
Notes | DAG; 600.056; 600.045; 600.061; 602.006; 602.101 | Approved | no | ||
Call Number | Admin @ si @ FFM2013 | Serial | 2296 | ||
Permanent link to this record | |||||
Author | Antonio Lopez; Joan Serrat; J. Saludes; Cristina Cañero; Felipe Lumbreras; T. Graf | ||||
Title | Ridgeness for Detecting Lane Markings | Type | Miscellaneous | ||
Year | 2005 | Publication ![]() |
2nd International Workshop on Intelligent Transportation Systems (WIT2005), Conference Proceedings (Sponsored by the IEEE Communication Society, Germany Chapter) | Abbreviated Journal | |
Volume | Issue | Pages | |||
Keywords | lane markings | ||||
Abstract | |||||
Address | Hamburg (Germany) | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | |||
Notes | ADAS | Approved | no | ||
Call Number | ADAS @ adas @ LSS2005 | Serial | 548 | ||
Permanent link to this record | |||||
Author | Manuel Carbonell; Joan Mas; Mauricio Villegas; Alicia Fornes; Josep Llados | ||||
Title | End-to-End Handwritten Text Detection and Transcription in Full Pages | Type | Conference Article | ||
Year | 2019 | Publication ![]() |
2nd International Workshop on Machine Learning | Abbreviated Journal | |
Volume | 5 | Issue | Pages | 29-34 | |
Keywords | Handwritten Text Recognition; Layout Analysis; Text segmentation; Deep Neural Networks; Multi-task learning | ||||
Abstract | When transcribing handwritten document images, inaccuracies in the text segmentation step often cause errors in the subsequent transcription step. For this reason, some recent methods propose to perform the recognition at paragraph level. But still, errors in the segmentation of paragraphs can affect
the transcription performance. In this work, we propose an end-to-end framework to transcribe full pages. The joint text detection and transcription allows to remove the layout analysis requirement at test time. The experimental results show that our approach can achieve comparable results to models that assume segmented paragraphs, and suggest that joining the two tasks brings an improvement over doing the two tasks separately. |
||||
Address | Sydney; Australia; September 2019 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | ICDAR WML | ||
Notes | DAG; 600.140; 601.311; 600.140 | Approved | no | ||
Call Number | Admin @ si @ CMV2019 | Serial | 3353 | ||
Permanent link to this record | |||||
Author | Jorge Bernal; F. Javier Sanchez; Fernando Vilariño | ||||
Title | Current Challenges on Polyp Detection in Colonoscopy Videos: From Region Segmentation to Region Classification. a Pattern Recognition-based Approach.ased Approach | Type | Conference Article | ||
Year | 2011 | Publication ![]() |
2nd International Workshop on Medical Image Analysis and Descriptionfor Diagnosis Systems | Abbreviated Journal | |
Volume | Issue | Pages | 62-71 | ||
Keywords | Medical Imaging, Colonoscopy, Pattern Recognition, Segmentation, Polyp Detection, Region Description, Machine Learning, Real-time. | ||||
Abstract | In this paper we present our approach on real-time polyp detection in colonoscopy videos. Our method consists of three stages: Image Segmentation, Region Description and Image Classification. Taking into account the constraints of our project, we introduce our segmentation system that is based on the model of appearance of the polyp that we have defined after observing real videos from colonoscopy processes. The output of this stage will ideally be a low number of regions of which one of them should cover the whole polyp region (if there is one in the image). This regions will be described in terms of features and, as a result of a machine learning schema, classified based on the values that they have for the several features that we will use on their description. Although we are still on the early stages of the project, we present some preliminary segmentation results that indicates that we are going in a good direction. | ||||
Address | Rome, Italy | ||||
Corporate Author | Thesis | ||||
Publisher | SciTePress | Place of Publication | Editor | Djemal, Khalifa | |
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | 800 | Expedition | Conference | MIAD | |
Notes | MV;SIAI | Approved | no | ||
Call Number | IAM @ iam @ BSV2011a | Serial | 1695 | ||
Permanent link to this record | |||||
Author | Onur Ferhat; Arcadi Llanza; Fernando Vilariño | ||||
Title | Gaze interaction for multi-display systems using natural light eye-tracker | Type | Conference Article | ||
Year | 2015 | Publication ![]() |
2nd International Workshop on Solutions for Automatic Gaze Data Analysis | Abbreviated Journal | |
Volume | Issue | Pages | |||
Keywords | |||||
Abstract | |||||
Address | Bielefeld; Germany; September 2015 | ||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | SAGA | ||
Notes | MV;SIAI | Approved | no | ||
Call Number | Admin @ si @ FLV2015b | Serial | 2676 | ||
Permanent link to this record | |||||
Author | Javier Vazquez; Robert Benavente; Maria Vanrell | ||||
Title | Naming constraints constancy | Type | Conference Article | ||
Year | 2012 | Publication ![]() |
2nd Joint AVA / BMVA Meeting on Biological and Machine Vision | Abbreviated Journal | |
Volume | Issue | Pages | |||
Keywords | |||||
Abstract | Different studies have shown that languages from industrialized cultures
share a set of 11 basic colour terms: red, green, blue, yellow, pink, purple, brown, orange, black, white, and grey (Berlin & Kay, 1969, Basic Color Terms, University of California Press)( Kay & Regier, 2003, PNAS, 100, 9085-9089). Some of these studies have also reported the best representatives or focal values of each colour (Boynton and Olson, 1990, Vision Res. 30,1311–1317), (Sturges and Whitfield, 1995, CRA, 20:6, 364–376). Some further studies have provided us with fuzzy datasets for color naming by asking human observers to rate colours in terms of membership values (Benavente -et al-, 2006, CRA. 31:1, 48–56,). Recently, a computational model based on these human ratings has been developed (Benavente -et al-, 2008, JOSA-A, 25:10, 2582-2593). This computational model follows a fuzzy approach to assign a colour name to a particular RGB value. For example, a pixel with a value (255,0,0) will be named 'red' with membership 1, while a cyan pixel with a RGB value of (0, 200, 200) will be considered to be 0.5 green and 0.5 blue. In this work, we show how this colour naming paradigm can be applied to different computer vision tasks. In particular, we report results in colour constancy (Vazquez-Corral -et al-, 2012, IEEE TIP, in press) showing that the classical constraints on either illumination or surface reflectance can be substituted by the statistical properties encoded in the colour names. [Supported by projects TIN2010-21771-C02-1, CSD2007-00018]. |
||||
Address | |||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | AV A | ||
Notes | CIC | Approved | no | ||
Call Number | Admin @ si @ VBV2012 | Serial | 2131 | ||
Permanent link to this record | |||||
Author | Xavier Otazu; Olivier Penacchio; Laura Dempere-Marco | ||||
Title | An investigation into plausible neural mechanisms related to the the CIWaM computational model for brightness induction | Type | Conference Article | ||
Year | 2012 | Publication ![]() |
2nd Joint AVA / BMVA Meeting on Biological and Machine Vision | Abbreviated Journal | |
Volume | Issue | Pages | |||
Keywords | |||||
Abstract | Brightness induction is the modulation of the perceived intensity of an area by the luminance of surrounding areas. From a purely computational perspective, we built a low-level computational model (CIWaM) of early sensory processing based on multi-resolution wavelets with the aim of replicating brightness and colour (Otazu et al., 2010, Journal of Vision, 10(12):5) induction effects. Furthermore, we successfully used the CIWaM architecture to define a computational saliency model (Murray et al, 2011, CVPR, 433-440; Vanrell et al, submitted to AVA/BMVA'12). From a biological perspective, neurophysiological evidence suggests that perceived brightness information may be explicitly represented in V1. In this work we investigate possible neural mechanisms that offer a plausible explanation for such effects. To this end, we consider the model by Z.Li (Li, 1999, Network:Comput. Neural Syst., 10, 187-212) which is based on biological data and focuses on the part of V1 responsible for contextual influences, namely, layer 2-3 pyramidal cells, interneurons, and horizontal intracortical connections. This model has proven to account for phenomena such as visual saliency, which share with brightness induction the relevant effect of contextual influences (the ones modelled by CIWaM). In the proposed model, the input to the network is derived from a complete multiscale and multiorientation wavelet decomposition taken from the computational model (CIWaM).
This model successfully accounts for well known pyschophysical effects (among them: the White's and modied White's effects, the Todorovic, Chevreul, achromatic ring patterns, and grating induction effects) for static contexts and also for brigthness induction in dynamic contexts defined by modulating the luminance of surrounding areas. From a methodological point of view, we conclude that the results obtained by the computational model (CIWaM) are compatible with the ones obtained by the neurodynamical model proposed here. |
||||
Address | |||||
Corporate Author | Thesis | ||||
Publisher | Place of Publication | Editor | |||
Language | Summary Language | Original Title | |||
Series Editor | Series Title | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | |||
ISSN | ISBN | Medium | |||
Area | Expedition | Conference | AV A | ||
Notes | CIC | Approved | no | ||
Call Number | Admin @ si @ OPD2012a | Serial | 2132 | ||
Permanent link to this record |