Publicacions CVC -- Query Results

[81–90] << 91 92 93 94 95 96 97 98 99 100 >> [101–110]

Details

Records
Author	Antonio Clavelli; Dimosthenis Karatzas; Josep Llados; Mario Ferraro; Giuseppe Boccignone
Title	Modelling task-dependent eye guidance to objects in pictures			Type	Journal Article
Year	2014	Publication	Cognitive Computation	Abbreviated Journal	CoCom
Volume	6	Issue	3	Pages	558-584
Keywords	Visual attention; Gaze guidance; Value; Payoff; Stochastic fixation prediction
Abstract	5Y Impact Factor: 1.14 / 3rd (Computer Science, Artificial Intelligence) We introduce a model of attentional eye guidance based on the rationale that the deployment of gaze is to be considered in the context of a general action-perception loop relying on two strictly intertwined processes: sensory processing, depending on current gaze position, identifies sources of information that are most valuable under the given task; motor processing links such information with the oculomotor act by sampling the next gaze position and thus performing the gaze shift. In such a framework, the choice of where to look next is task-dependent and oriented to classes of objects embedded within pictures of complex scenes. The dependence on task is taken into account by exploiting the value and the payoff of gazing at certain image patches or proto-objects that provide a sparse representation of the scene objects. The different levels of the action-perception loop are represented in probabilistic form and eventually give rise to a stochastic process that generates the gaze sequence. This way the model also accounts for statistical properties of gaze shifts such as individual scan path variability. Results of the simulations are compared either with experimental data derived from publicly available datasets and from our own experiments.
Address
Corporate Author				Thesis
Publisher	Springer US	Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN	1866-9956	ISBN		Medium
Area		Expedition		Conference
Notes	DAG; 600.056; 600.045; 605.203; 601.212; 600.077			Approved	no
Call Number	Admin @ si @ CKL2014			Serial	2419
Permanent link to this record



Author	C. Alejandro Parraga; Xavier Otazu; Arash Akbarinia
Title	Modelling symmetry perception with banks of quadrature convolutional Gabor kernels			Type	Conference Article
Year	2019	Publication	42nd edition of the European Conference on Visual Perception	Abbreviated Journal
Volume		Issue		Pages	224-224
Keywords
Abstract	Mirror symmetry is a property most likely to be encountered in animals than in medium scale vegetation or inanimate objects in the natural world. This might be the reason why the human visual system has evolved to detect it quickly and robustly. Indeed, the perception of symmetry assists higher-level visual processing that are crucial for survival such as target recognition and identification irrespective of position and location. Although the task of detecting symmetrical objects seems effortless to us, it is very challenging for computers (to the extent that it has been proposed as a robust “captcha” by Funk & Liu in 2016). Indeed, the exact mechanism of symmetry detection in primates is not well understood: fMRI studies have shown that symmetrical shapes activate specific higher-level areas of the visual cortex (Sasaki et al.; 2005) and similarly, a large body of psychophysical experiments suggest that the symmetry perception is critically influenced by low-level mechanisms (Treder; 2010). In this work we attempt to find plausible low-level mechanisms that might form the basis for symmetry perception. Our simple model is made from banks of (i) odd-symmetric Gabors (resembling edge-detecting V1 neurons); and (ii) banks of larger odd- and even-symmetric Gabors (resembling higher visual cortex neurons), that pool signals from the 'edge image'. As reported previously (Akbarinia et al, ECVP2017), the convolution of the symmetrical lines with the two Gabor kernels of alternative phase produces a minimum in one and a maximum in the other (Osorio; 1996), and the rectification and combination of these signals create lines which hint of mirror symmetry in natural images. We improved the algorithm by combining these signals across several spatial scales. Our preliminary results suggest that such multiscale combination of convolutional operations might form the basis for much of the operation of the HVS in terms of symmetry detection and representation.
Address	Leuven; Belgium; August 2019
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ECVP
Notes	NEUROBIT; 600.128			Approved	no
Call Number	Admin @ si @ POA2019			Serial	3371
Permanent link to this record



Author	Misael Rosales; Petia Radeva;Oriol Rodriguez-Leon; Debora Gil
Title	Modelling of image-catheter motion for 3-D IVUS			Type	Journal Article
Year	2009	Publication	Medical image analysis	Abbreviated Journal	MIA
Volume	13	Issue	1	Pages	91-104
Keywords	Intravascular ultrasound (IVUS); Motion estimation; Motion decomposition; Fourier
Abstract	Three-dimensional intravascular ultrasound (IVUS) allows to visualize and obtain volumetric measurements of coronary lesions through an exploration of the cross sections and longitudinal views of arteries. However, the visualization and subsequent morpho-geometric measurements in IVUS longitudinal cuts are subject to distortion caused by periodic image/vessel motion around the IVUS catheter. Usually, to overcome the image motion artifact ECG-gating and image-gated approaches are proposed, leading to slowing the pullback acquisition or disregarding part of IVUS data. In this paper, we argue that the image motion is due to 3-D vessel geometry as well as cardiac dynamics, and propose a dynamic model based on the tracking of an elliptical vessel approximation to recover the rigid transformation and align IVUS images without loosing any IVUS data. We report an extensive validation with synthetic simulated data and in vivo IVUS sequences of 30 patients achieving an average reduction of the image artifact of 97% in synthetic data and 79% in real-data. Our study shows that IVUS alignment improves longitudinal analysis of the IVUS data and is a necessary step towards accurate reconstruction and volumetric measurements of 3-D IVUS.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	IAM;MILAB			Approved	no
Call Number	IAM @ iam @ RRR2009			Serial	1646
Permanent link to this record



Author	C. Alejandro Parraga; Robert Benavente; Maria Vanrell; Ramon Baldrich
Title	Modelling Inter-Colour Regions of Colour Naming Space			Type	Conference Article
Year	2008	Publication	4th European Conference on Colour in Graphics, Imaging and Vision Proceedings	Abbreviated Journal
Volume		Issue		Pages	218–222
Keywords
Abstract
Address	Terrassa (Spain)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	CGIV08
Notes	CAT;CIC			Approved	no
Call Number	CAT @ cat @ PBV2008			Serial	969
Permanent link to this record



Author	Sergio Escalera; Petia Radeva; Jordi Vitria; Xavier Baro; Bogdan Raducanu
Title	Modelling and Analyzing Multimodal Dyadic Interactions Using Social Networks			Type	Conference Article
Year	2010	Publication	12th International Conference on Multimodal Interfaces and 7th Workshop on Machine Learning for Multimodal Interaction.	Abbreviated Journal
Volume		Issue		Pages
Keywords	Social interaction; Multimodal fusion, Influence model; Social network analysis
Abstract	Social network analysis became a common technique used to model and quantify the properties of social interactions. In this paper, we propose an integrated framework to explore the characteristics of a social network extracted from multimodal dyadic interactions. First, speech detection is performed through an audio/visual fusion scheme based on stacked sequential learning. In the audio domain, speech is detected through clusterization of audio features. Clusters are modelled by means of an One-state Hidden Markov Model containing a diagonal covariance Gaussian Mixture Model. In the visual domain, speech detection is performed through differential-based feature extraction from the segmented mouth region, and a dynamic programming matching procedure. Second, in order to model the dyadic interactions, we employed the Influence Model whose states encode the previous integrated audio/visual data. Third, the social network is extracted based on the estimated influences. For our study, we used a set of videos belonging to New York Times’ Blogging Heads opinion blog. The results are reported both in terms of accuracy of the audio/visual data fusion and centrality measures used to characterize the social network.
Address	Beijing (China)
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICMI-MLI
Notes	OR;MILAB;HUPBA;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ ERV2010			Serial	1427
Permanent link to this record



Author	Jaume Garcia; Debora Gil; Francesc Carreras; Sandra Pujades; R.Leta
Title	Modelització 4-Dimensional de la Funció Siatólica del Ventricle Esquerre			Type	Conference Article
Year	2007	Publication	XIX Congrés de la Societat Catalana de Cardiologia de Barcelona	Abbreviated Journal
Volume		Issue		Pages	133-134
Keywords
Abstract	L’evolució tecnològica en el tractament de les imatges mèdiques permet reconstruir, amb el software apropiat, imatges tridimensionals de les estructures cardiovasculars i dotar-les de moviment. Les imatges 4D resultants faciliten l’estudi de la fisiopatologia de la insuficiència cardíaca en base als transtorns de l’activació electromecànica ventricular, el que pot ser d’interès en el procés de selecció de pacients candidats a teràpies de resincronització. Presentem els resultats preliminars de la reconstrucció 4D del ventricle esquerre (VE) a partir de les seqüències de tagging miocàrdic del VE.
Address
Corporate Author				Thesis
Publisher		Place of Publication	Barcelona (Spain)	Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	IAM			Approved	no
Call Number	IAM @ iam @ GGC2007			Serial	1505
Permanent link to this record



Author	Hugo Jair Escalante; Heysem Kaya; Albert Ali Salah; Sergio Escalera; Yagmur Gucluturk; Umut Guçlu; Xavier Baro; Isabelle Guyon; Julio C. S. Jacques Junior; Meysam Madadi; Stephane Ayache; Evelyne Viegas; Furkan Gurpinar; Achmadnoer Sukma Wicaksana; Cynthia Liem; Marcel A. J. Van Gerven; Rob Van Lier
Title	Modeling, Recognizing, and Explaining Apparent Personality from Videos			Type	Journal Article
Year	2022	Publication	IEEE Transactions on Affective Computing	Abbreviated Journal	TAC
Volume	13	Issue	2	Pages	894-911
Keywords
Abstract	Explainability and interpretability are two critical aspects of decision support systems. Despite their importance, it is only recently that researchers are starting to explore these aspects. This paper provides an introduction to explainability and interpretability in the context of apparent personality recognition. To the best of our knowledge, this is the first effort in this direction. We describe a challenge we organized on explainability in first impressions analysis from video. We analyze in detail the newly introduced data set, evaluation protocol, proposed solutions and summarize the results of the challenge. We investigate the issue of bias in detail. Finally, derived from our study, we outline research opportunities that we foresee will be relevant in this area in the near future.
Address	1 April-June 2022
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	HuPBA; no menciona			Approved	no
Call Number	Admin @ si @ EKS2022			Serial	3406
Permanent link to this record



Author	Marc Serra
Title	Modeling, estimation and evaluation of intrinsic images considering color information			Type	Book Whole
Year	2015	Publication	PhD Thesis, Universitat Autonoma de Barcelona-CVC	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	Image values are the result of a combination of visual information coming from multiple sources. Recovering information from the multiple factors thatproduced an image seems a hard and ill-posed problem. However, it is important to observe that humans develop the ability to interpret images and recognize and isolate specific physical properties of the scene. Images describing a single physical characteristic of an scene are called intrinsic images. These images would benefit most computer vision tasks which are often affected by the multiple complex effects that are usually found in natural images (e.g. cast shadows, specularities, interreflections...). In this thesis we analyze the problem of intrinsic image estimation from different perspectives, including the theoretical formulation of the problem, the visual cues that can be used to estimate the intrinsic components and the evaluation mechanisms of the problem.
Address	September 2015
Corporate Author				Thesis	Ph.D. thesis
Publisher	Ediciones Graficas Rey	Place of Publication		Editor	Robert Benavente;Olivier Penacchio
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN	978-84-943427-4-5	Medium
Area		Expedition		Conference
Notes	CIC; 600.074			Approved	no
Call Number	Admin @ si @ Ser2015			Serial	2688
Permanent link to this record



Author	Wenjuan Gong; Jürgen Brauer; Michael Arens; Jordi Gonzalez
Title	Modeling vs. Learning Approaches for Monocular 3D Human Pose Estimation			Type	Conference Article
Year	2011	Publication	1st IEEE International Workshop on Performance Evaluation on Recognition of Human Actions and Pose Estimation Methods	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	London, United Kingdom
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	PERHAPS
Notes	ISE			Approved	no
Call Number	Admin @ si @ GBA2011			Serial	1812
Permanent link to this record



Author	David Berga; C. Wloka; JK. Tsotsos
Title	Modeling task influences for saccade sequence and visual relevance prediction			Type	Journal Article
Year	2019	Publication	Journal of Vision	Abbreviated Journal	JV
Volume	19	Issue	10	Pages	106c-106c
Keywords
Abstract	Previous work from Wloka et al. (2017) presented the Selective Tuning Attentive Reference model Fixation Controller (STAR-FC), an active vision model for saccade prediction. Although the model is able to efficiently predict saccades during free-viewing, it is well known that stimulus and task instructions can strongly affect eye movement patterns (Yarbus, 1967). These factors are considered in previous Selective Tuning architectures (Tsotsos and Kruijne, 2014)(Tsotsos, Kotseruba and Wloka, 2016)(Rosenfeld, Biparva & Tsotsos 2017), proposing a way to combine bottom-up and top-down contributions to fixation and saccade programming. In particular, task priming has been shown to be crucial to the deployment of eye movements, involving interactions between brain areas related to goal-directed behavior, working and long-term memory in combination with stimulus-driven eye movement neuronal correlates. Initial theories and models of these influences include (Rao, Zelinsky, Hayhoe and Ballard, 2002)(Navalpakkam and Itti, 2005)(Huang and Pashler, 2007) and show distinct ways to process the task requirements in combination with bottom-up attention. In this study we extend the STAR-FC with novel computational definitions of Long-Term Memory, Visual Task Executive and a Task Relevance Map. With these modules we are able to use textual instructions in order to guide the model to attend to specific categories of objects and/or places in the scene. We have designed our memory model by processing a hierarchy of visual features learned from salient object detection datasets. The relationship between the executive task instructions and the memory representations has been specified using a tree of semantic similarities between the learned features and the object category labels. Results reveal that by using this model, the resulting relevance maps and predicted saccades have a higher probability to fall inside the salient regions depending on the distinct task instructions.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	NEUROBIT; 600.128; 600.120			Approved	no
Call Number	Admin @ si @ BWT2019			Serial	3308
Permanent link to this record



Author	Luis Herranz; Shuqiang Jiang; Ruihan Xu
Title	Modeling Restaurant Context for Food Recognition			Type	Journal Article
Year	2017	Publication	IEEE Transactions on Multimedia	Abbreviated Journal	TMM
Volume	19	Issue	2	Pages	430 - 440
Keywords
Abstract	Food photos are widely used in food logs for diet monitoring and in social networks to share social and gastronomic experiences. A large number of these images are taken in restaurants. Dish recognition in general is very challenging, due to different cuisines, cooking styles, and the intrinsic difficulty of modeling food from its visual appearance. However, contextual knowledge can be crucial to improve recognition in such scenario. In particular, geocontext has been widely exploited for outdoor landmark recognition. Similarly, we exploit knowledge about menus and location of restaurants and test images. We first adapt a framework based on discarding unlikely categories located far from the test image. Then, we reformulate the problem using a probabilistic model connecting dishes, restaurants, and locations. We apply that model in three different tasks: dish recognition, restaurant recognition, and location refinement. Experiments on six datasets show that by integrating multiple evidences (visual, location, and external knowledge) our system can boost the performance in all tasks.
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	LAMP; 600.120			Approved	no
Call Number	Admin @ si @ HJX2017			Serial	2965
Permanent link to this record



Author	Alejandro Cartas; Petia Radeva; Mariella Dimiccoli
Title	Modeling long-term interactions to enhance action recognition			Type	Conference Article
Year	2021	Publication	25th International Conference on Pattern Recognition	Abbreviated Journal
Volume		Issue		Pages	10351-10358
Keywords
Abstract	In this paper, we propose a new approach to under-stand actions in egocentric videos that exploits the semantics of object interactions at both frame and temporal levels. At the frame level, we use a region-based approach that takes as input a primary region roughly corresponding to the user hands and a set of secondary regions potentially corresponding to the interacting objects and calculates the action score through a CNN formulation. This information is then fed to a Hierarchical LongShort-Term Memory Network (HLSTM) that captures temporal dependencies between actions within and across shots. Ablation studies thoroughly validate the proposed approach, showing in particular that both levels of the HLSTM architecture contribute to performance improvement. Furthermore, quantitative comparisons show that the proposed approach outperforms the state-of-the-art in terms of action recognition on standard benchmarks,without relying on motion information
Address	January 2021
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference	ICPR
Notes	MILAB;			Approved	no
Call Number	Admin @ si @ CRD2021			Serial	3626
Permanent link to this record



Author	Pau Baiget
Title	Modeling Human Behavior for Image Sequence Understanding and Generation			Type	Book Whole
Year	2009	Publication	PhD Thesis, Universitat Autonoma de Barcelona-CVC	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract	The comprehension of animal behavior, especially human behavior, is one of the most ancient and studied problems since the beginning of civilization. The big list of factors that interact to determine a person action require the collaboration of different disciplines, such as psichology, biology, or sociology. In the last years the analysis of human behavior has received great attention also from the computer vision community, given the latest advances in the acquisition of human motion data from image sequences. Despite the increasing availability of that data, there still exists a gap towards obtaining a conceptual representation of the obtained observations. Human behavior analysis is based on a qualitative interpretation of the results, and therefore the assignment of concepts to quantitative data is linked to a certain ambiguity. This Thesis tackles the problem of obtaining a proper representation of human behavior in the contexts of computer vision and animation. On the one hand, a good behavior model should permit the recognition and explanation the observed activity in image sequences. On the other hand, such a model must allow the generation of new synthetic instances, which model the behavior of virtual agents. First, we propose methods to automatically learn the models from observations. Given a set of quantitative results output by a vision system, a normal behavior model is learnt. This results provides a tool to determine the normality or abnormality of future observations. However, machine learning methods are unable to provide a richer description of the observations. We confront this problem by means of a new method that incorporates prior knowledge about the enviornment and about the expected behaviors. This framework, formed by the reasoning engine FMTL and the modeling tool SGT allows the generation of conceptual descriptions of activity in new image sequences. Finally, we demonstrate the suitability of the proposed framework to simulate behavior of virtual agents, which are introduced into real image sequences and interact with observed real agents, thereby easing the generation of augmented reality sequences. The set of approaches presented in this Thesis has a growing set of potential applications. The analysis and description of behavior in image sequences has its principal application in the domain of smart video--surveillance, in order to detect suspicious or dangerous behaviors. Other applications include automatic sport commentaries, elderly monitoring, road traffic analysis, and the development of semantic video search engines. Alternatively, behavioral virtual agents allow to simulate accurate real situations, such as fires or crowds. Moreover, the inclusion of virtual agents into real image sequences has been widely deployed in the games and cinema industries.
Address	Bellaterra (Spain)
Corporate Author				Thesis	Ph.D. thesis
Publisher	Ediciones Graficas Rey	Place of Publication		Editor	Jordi Gonzalez;Xavier Roca
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes				Approved	no
Call Number	Admin @ si @ Bai2009			Serial	1210
Permanent link to this record



Author	David Guillamet; B. Moghaddam; Jordi Vitria
Title	Modeling High-Order Dependencies in Local Appearance Models			Type	Miscellaneous
Year	2003	Publication	In Pattern Recognition and Image Analysis, Lecture Notes in Computer Science. 2652: 308–316	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address	Springer-Verlag
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	OR;MV			Approved	no
Call Number	BCNPCL @ bcnpcl @ GMV2003a			Serial	376
Permanent link to this record



Author	C. Alejandro Parraga; Robert Benavente; Maria Vanrell
Title	Modeling Colour-Naming Space with Fuzzy Sets			Type	Journal
Year	2007	Publication	Perception 36:198–198, supp	Abbreviated Journal
Volume		Issue		Pages
Keywords
Abstract
Address
Corporate Author				Thesis
Publisher		Place of Publication		Editor
Language		Summary Language		Original Title
Series Editor		Series Title		Abbreviated Series Title
Series Volume		Series Issue		Edition
ISSN		ISBN		Medium
Area		Expedition		Conference
Notes	CIC			Approved	no
Call Number	CAT @ cat @ PBV2007			Serial	843
Permanent link to this record