Home | << 1 2 3 4 5 6 7 8 9 10 >> [11–13] |
![]() |
Records | Links | |||||
---|---|---|---|---|---|---|
Author ![]() |
M. Danelljan; Fahad Shahbaz Khan; Michael Felsberg; Joost Van de Weijer |
![]() ![]() ![]() ![]() |
||||
Title | Adaptive color attributes for real-time visual tracking | Type | Conference Article | |||
Year | 2014 | Publication | 27th IEEE Conference on Computer Vision and Pattern Recognition | Abbreviated Journal | ||
Volume | Issue | Pages | 1090 - 1097 | |||
Keywords | ||||||
Abstract | Visual tracking is a challenging problem in computer vision. Most state-of-the-art visual trackers either rely on luminance information or use simple color representations for image description. Contrary to visual tracking, for object
recognition and detection, sophisticated color features when combined with luminance have shown to provide excellent performance. Due to the complexity of the tracking problem, the desired color feature should be computationally efficient, and possess a certain amount of photometric invariance while maintaining high discriminative power. This paper investigates the contribution of color in a tracking-by-detection framework. Our results suggest that color attributes provides superior performance for visual tracking. We further propose an adaptive low-dimensional variant of color attributes. Both quantitative and attributebased evaluations are performed on 41 challenging benchmark color sequences. The proposed approach improves the baseline intensity-based tracker by 24% in median distance precision. Furthermore, we show that our approach outperforms state-of-the-art tracking methods while running at more than 100 frames per second. |
|||||
Address | Nottingham; UK; September 2014 | |||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | CVPR | |||
Notes | CIC; LAMP; 600.074; 600.079 | Approved | no | |||
Call Number | Admin @ si @ DKF2014 | Serial | 2509 | |||
Permanent link to this record | ||||||
Author ![]() |
Justine Giroux; Mohammad Reza Karimi Dastjerdi; Yannick Hold-Geoffroy; Javier Vazquez; Jean François Lalonde |
![]() ![]() ![]() ![]() |
||||
Title | Towards a Perceptual Evaluation Framework for Lighting Estimation | Type | Conference Article | |||
Year | 2024 | Publication | Arxiv | Abbreviated Journal | ||
Volume | Issue | Pages | ||||
Keywords | ||||||
Abstract | rogress in lighting estimation is tracked by computing existing image quality assessment (IQA) metrics on images from standard datasets. While this may appear to be a reasonable approach, we demonstrate that doing so does not correlate to human preference when the estimated lighting is used to relight a virtual scene into a real photograph. To study this, we design a controlled psychophysical experiment where human observers must choose their preference amongst rendered scenes lit using a set of lighting estimation algorithms selected from the recent literature, and use it to analyse how these algorithms perform according to human perception. Then, we demonstrate that none of the most popular IQA metrics from the literature, taken individually, correctly represent human perception. Finally, we show that by learning a combination of existing IQA metrics, we can more accurately represent human preference. This provides a new perceptual framework to help evaluate future lighting estimation algorithms. | |||||
Address | Seattle; USA; June 2024 | |||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | CVPR | |||
Notes | MACO; CIC | Approved | no | |||
Call Number | Admin @ si @ GDH2024 | Serial | 3999 | |||
Permanent link to this record | ||||||
Author ![]() |
Josep M. Gonfaus; Xavier Boix; Joost Van de Weijer; Andrew Bagdanov; Joan Serrat; Jordi Gonzalez |
![]() ![]() ![]() ![]() ![]() |
||||
Title | Harmony Potentials for Joint Classification and Segmentation | Type | Conference Article | |||
Year | 2010 | Publication | 23rd IEEE Conference on Computer Vision and Pattern Recognition | Abbreviated Journal | ||
Volume | Issue | Pages | 3280–3287 | |||
Keywords | ||||||
Abstract | Hierarchical conditional random fields have been successfully applied to object segmentation. One reason is their ability to incorporate contextual information at different scales. However, these models do not allow multiple labels to be assigned to a single node. At higher scales in the image, this yields an oversimplified model, since multiple classes can be reasonable expected to appear within one region. This simplified model especially limits the impact that observations at larger scales may have on the CRF model. Neglecting the information at larger scales is undesirable since class-label estimates based on these scales are more reliable than at smaller, noisier scales. To address this problem, we propose a new potential, called harmony potential, which can encode any possible combination of class labels. We propose an effective sampling strategy that renders tractable the underlying optimization problem. Results show that our approach obtains state-of-the-art results on two challenging datasets: Pascal VOC 2009 and MSRC-21. | |||||
Address | San Francisco CA, USA | |||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | 1063-6919 | ISBN | 978-1-4244-6984-0 | Medium | ||
Area | Expedition | Conference | CVPR | |||
Notes | ADAS;CIC;ISE | Approved | no | |||
Call Number | ADAS @ adas @ GBW2010 | Serial | 1296 | |||
Permanent link to this record | ||||||
Author ![]() |
Jose Manuel Alvarez; Antonio Lopez; Ramon Baldrich |
![]() ![]() ![]() |
||||
Title | Shadow Resistant Road Segmentation from a Mobile Monocular System | Type | Conference Article | |||
Year | 2007 | Publication | 3rd Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2007), J. Marti et al. (Eds.) LNCS 4477:9–16 | Abbreviated Journal | ||
Volume | Issue | Pages | ||||
Keywords | road detection | |||||
Abstract | ||||||
Address | Gerona (Spain) | |||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | ADAS;CIC | Approved | no | |||
Call Number | ADAS @ adas @ ALB2007 | Serial | 943 | |||
Permanent link to this record | ||||||
Author ![]() |
Jose Manuel Alvarez; Antonio Lopez; Ramon Baldrich |
![]() ![]() ![]() |
||||
Title | Illuminant Invariant Model-Based Road Segmentation | Type | Conference Article | |||
Year | 2008 | Publication | IEEE Intelligent Vehicles Symposium, | Abbreviated Journal | ||
Volume | Issue | Pages | 1155–1180 | |||
Keywords | road detection | |||||
Abstract | ||||||
Address | Eindhoven (The Netherlands) | |||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | ADAS;CIC | Approved | no | |||
Call Number | ADAS @ adas @ ALB2008 | Serial | 1045 | |||
Permanent link to this record | ||||||
Author ![]() |
Jose Carlos Rubio |
![]() ![]() |
||||
Title | Graph matching based on graphical models with application to vehicle tracking and classification at night | Type | Report | |||
Year | 2009 | Publication | CVC Technical Report | Abbreviated Journal | ||
Volume | 144 | Issue | Pages | |||
Keywords | ||||||
Abstract | ||||||
Address | ||||||
Corporate Author | Computer Vision Center | Thesis | Master's thesis | |||
Publisher | Place of Publication | Bellaterra, Barcelona | Editor | |||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ Rub2009 | Serial | 2398 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Vitria; Petia Radeva; X. Binefa; A. Pujol; Ernest Valveny; Robert Benavente; Craig Von Land |
![]() ![]() |
||||
Title | Real time recognition of pharmaceutical products by subspace methods | Type | Report | |||
Year | 1999 | Publication | CVC Technical Report #35 | Abbreviated Journal | ||
Volume | Issue | Pages | ||||
Keywords | ||||||
Abstract | ||||||
Address | CVC (UAB) | |||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | OR;MILAB;DAG;CIC;MV | Approved | no | |||
Call Number | BCNPCL @ bcnpcl @ VRB1999b | Serial | 54 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Roca; Maria Vanrell; C. Alejandro Parraga |
![]() ![]() ![]() ![]() |
||||
Title | What is constant in colour constancy? | Type | Conference Article | |||
Year | 2012 | Publication | 6th European Conference on Colour in Graphics, Imaging and Vision | Abbreviated Journal | ||
Volume | Issue | Pages | 337-343 | |||
Keywords | ||||||
Abstract | Color constancy refers to the ability of the human visual system to stabilize
the color appearance of surfaces under an illuminant change. In this work we studied how the interrelations among nine colors are perceived under illuminant changes, particularly whether they remain stable across 10 different conditions (5 illuminants and 2 backgrounds). To do so we have used a paradigm that measures several colors under an immersive state of adaptation. From our measures we defined a perceptual structure descriptor that is up to 87% stable over all conditions, suggesting that color category features could be used to predict color constancy. This is in agreement with previous results on the stability of border categories [1,2] and with computational color constancy algorithms [3] for estimating the scene illuminant. |
|||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | 9781622767014 | Medium | |||
Area | Expedition | Conference | CGIV | |||
Notes | CIC | Approved | no | |||
Call Number | RVP2012 | Serial | 2189 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Roca; C. Alejandro Parraga; Maria Vanrell |
![]() ![]() ![]() |
||||
Title | Categorical Focal Colours are Structurally Invariant Under Illuminant Changes | Type | Conference Article | |||
Year | 2011 | Publication | European Conference on Visual Perception | Abbreviated Journal | ||
Volume | Issue | Pages | 196 | |||
Keywords | ||||||
Abstract | The visual system perceives the colour of surfaces approximately constant under changes of illumination. In this work, we investigate how stable is the perception of categorical \“focal\” colours and their interrelations with varying illuminants and simple chromatic backgrounds. It has been proposed that best examples of colour categories across languages cluster in small regions of the colour space and are restricted to a set of 11 basic terms (Kay and Regier, 2003 Proceedings of the National Academy of Sciences of the USA 100 9085\–9089). Following this, we developed a psychophysical paradigm that exploits the ability of subjects to reliably reproduce the most representative examples of each category, adjusting multiple test patches embedded in a coloured Mondrian. The experiment was run on a CRT monitor (inside a dark room) under various simulated illuminants. We modelled the recorded data for each subject and adapted state as a 3D interconnected structure (graph) in Lab space. The graph nodes were the subject\’s focal colours at each adaptation state. The model allowed us to get a better distance measure between focal structures under different illuminants. We found that perceptual focal structures tend to be preserved better than the structures of the physical \“ideal\” colours under illuminant changes. | |||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Perception 40 | Abbreviated Series Title | |||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ECVP | |||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ RPV2011 | Serial | 1867 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Roca; C. Alejandro Parraga; Maria Vanrell |
![]() ![]() ![]() ![]() |
||||
Title | Chromatic settings and the structural color constancy index | Type | Journal Article | |||
Year | 2013 | Publication | Journal of Vision | Abbreviated Journal | JV | |
Volume | 13 | Issue | 4-3 | Pages | 1-26 | |
Keywords | ||||||
Abstract | Color constancy is usually measured by achromatic setting, asymmetric matching, or color naming paradigms, whose results are interpreted in terms of indexes and models that arguably do not capture the full complexity of the phenomenon. Here we propose a new paradigm, chromatic setting, which allows a more comprehensive characterization of color constancy through the measurement of multiple points in color space under immersive adaptation. We demonstrated its feasibility by assessing the consistency of subjects' responses over time. The paradigm was applied to two-dimensional (2-D) Mondrian stimuli under three different illuminants, and the results were used to fit a set of linear color constancy models. The use of multiple colors improved the precision of more complex linear models compared to the popular diagonal model computed from gray. Our results show that a diagonal plus translation matrix that models mechanisms other than cone gain might be best suited to explain the phenomenon. Additionally, we calculated a number of color constancy indices for several points in color space, and our results suggest that interrelations among colors are not as uniform as previously believed. To account for this variability, we developed a new structural color constancy index that takes into account the magnitude and orientation of the chromatic shift in addition to the interrelations among colors and memory effects. | |||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | CIC; 600.052; 600.051; 605.203 | Approved | no | |||
Call Number | Admin @ si @ RPV2013 | Serial | 2288 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Roca; C. Alejandro Parraga; Maria Vanrell |
![]() ![]() ![]() ![]() |
||||
Title | Predicting categorical colour perception in successive colour constancy | Type | Abstract | |||
Year | 2012 | Publication | Perception | Abbreviated Journal | PER | |
Volume | 41 | Issue | Pages | 138 | ||
Keywords | ||||||
Abstract | Colour constancy is a perceptual mechanism that seeks to keep the colour of objects relatively stable under an illumination shift. Experiments haveshown that its effects depend on the number of colours present in the scene. We
studied categorical colour changes under different adaptation states, in particular, whether the colour categories seen under a chromatically neutral illuminant are the same after a shift in the chromaticity of the illumination. To do this, we developed the chromatic setting paradigm (2011 Journal of Vision11 349), which is as an extension of achromatic setting to colour categories. The paradigm exploits the ability of subjects to reliably reproduce the most representative examples of each category, adjusting multiple test patches embedded in a coloured Mondrian. Our experiments were run on a CRT monitor (inside a dark room) under various simulated illuminants and restricting the number of colours of the Mondrian background to three, thus weakening the adaptation effect. Our results show a change in the colour categories present before (under neutral illumination) and after adaptation (under coloured illuminants) with a tendency for adapted colours to be less saturated than before adaptation. This behaviour was predicted by a simple affine matrix model, adjusted to the chromatic setting results. |
|||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | 0301-0066 | ISBN | Medium | |||
Area | Expedition | Conference | ||||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ RPV2012 | Serial | 2188 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Roca; A.Owen; G.Jordan; Y.Ling; C. Alejandro Parraga; A.Hurlbert |
![]() ![]() ![]() ![]() |
||||
Title | Inter-individual Variations in Color Naming and the Structure of 3D Color Space | Type | Abstract | |||
Year | 2011 | Publication | Journal of Vision | Abbreviated Journal | VSS | |
Volume | 12 | Issue | 2 | Pages | 166 | |
Keywords | ||||||
Abstract | 36.307
Many everyday behavioural uses of color vision depend on color naming ability, which is neither measured nor predicted by most standardized tests of color vision, for either normal or anomalous color vision. Here we demonstrate a new method to quantify color naming ability by deriving a compact computational description of individual 3D color spaces. Methods: Individual observers underwent standardized color vision diagnostic tests (including anomaloscope testing) and a series of custom-made color naming tasks using 500 distinct color samples, either CRT stimuli (“light”-based) or Munsell chips (“surface”-based), with both forced- and free-choice color naming paradigms. For each subject, we defined his/her color solid as the set of 3D convex hulls computed for each basic color category from the relevant collection of categorised points in perceptually uniform CIELAB space. From the parameters of the convex hulls, we derived several indices to characterise the 3D structure of the color solid and its inter-individual variations. Using a reference group of 25 normal trichromats (NT), we defined the degree of normality for the shape, location and overlap of each color region, and the extent of “light”-“surface” agreement. Results: Certain features of color perception emerge from analysis of the average NT color solid, e.g.: (1) the white category is slightly shifted towards blue; and (2) the variability in category border location across NT subjects is asymmetric across color space, with least variability in the blue/green region. Comparisons between individual and average NT indices reveal specific naming “deficits”, e.g.: (1) Category volumes for white, green, brown and grey are expanded for anomalous trichromats and dichromats; and (2) the focal structure of color space is disrupted more in protanopia than other forms of anomalous color vision. The indices both capture the structure of subjective color spaces and allow us to quantify inter-individual differences in color naming ability. |
|||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | Place of Publication | Editor | ||||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | 1534-7362 | ISBN | Medium | |||
Area | Expedition | Conference | ||||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ ROJ2011 | Serial | 1758 | |||
Permanent link to this record | ||||||
Author ![]() |
Jordi Roca |
![]() ![]() |
||||
Title | Constancy and inconstancy in categorical colour perception | Type | Book Whole | |||
Year | 2012 | Publication | PhD Thesis, Universitat Autonoma de Barcelona-CVC | Abbreviated Journal | ||
Volume | Issue | Pages | ||||
Keywords | ||||||
Abstract | To recognise objects is perhaps the most important task an autonomous system, either biological or artificial needs to perform. In the context of human vision, this is partly achieved by recognizing the colour of surfaces despite changes in the wavelength distribution of the illumination, a property called colour constancy. Correct surface colour recognition may be adequately accomplished by colour category matching without the need to match colours precisely, therefore categorical colour constancy is likely to play an important role for object identification to be successful. The main aim of this work is to study the relationship between colour constancy and categorical colour perception. Previous studies of colour constancy have shown the influence of factors such the spatio-chromatic properties of the background, individual observer's performance, semantics, etc. However there is very little systematic study of these influences. To this end, we developed a new approach to colour constancy which includes both individual observers' categorical perception, the categorical structure of the background, and their interrelations resulting in a more comprehensive characterization of the phenomenon. In our study, we first developed a new method to analyse the categorical structure of 3D colour space, which allowed us to characterize individual categorical colour perception as well as quantify inter-individual variations in terms of shape and centroid location of 3D categorical regions. Second, we developed a new colour constancy paradigm, termed chromatic setting, which allows measuring the precise location of nine categorically-relevant points in colour space under immersive illumination. Additionally, we derived from these measurements a new colour constancy index which takes into account the magnitude and orientation of the chromatic shift, memory effects and the interrelations among colours and a model of colour naming tuned to each observer/adaptation state. Our results lead to the following conclusions: (1) There exists large inter-individual variations in the categorical structure of colour space, and thus colour naming ability varies significantly but this is not well predicted by low-level chromatic discrimination ability; (2) Analysis of the average colour naming space suggested the need for an additional three basic colour terms (turquoise, lilac and lime) for optimal colour communication; (3) Chromatic setting improved the precision of more complex linear colour constancy models and suggested that mechanisms other than cone gain might be best suited to explain colour constancy; (4) The categorical structure of colour space is broadly stable under illuminant changes for categorically balanced backgrounds; (5) Categorical inconstancy exists for categorically unbalanced backgrounds thus indicating that categorical information perceived in the initial stages of adaptation may constrain further categorical perception. | |||||
Address | ||||||
Corporate Author | Thesis | Ph.D. thesis | ||||
Publisher | Place of Publication | Editor | Maria Vanrell;C. Alejandro Parraga | |||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ Roc2012 | Serial | 2893 | |||
Permanent link to this record | ||||||
Author ![]() |
Joost Van de Weijer; Shida Beigpour |
![]() ![]() ![]() ![]() ![]() |
||||
Title | The Dichromatic Reflection Model: Future Research Directions and Applications | Type | Conference Article | |||
Year | 2011 | Publication | International Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications | Abbreviated Journal | ||
Volume | Issue | Pages | ||||
Keywords | dblp | |||||
Abstract | The dichromatic reflection model (DRM) predicts that color distributions form a parallelogram in color space, whose shape is defined by the body reflectance and the illuminant color. In this paper we resume the assumptions which led to the DRM and shortly recall two of its main applications domains: color image segmentation and photometric invariant feature computation. After having introduced the model we discuss several limitations of the theory, especially those which are raised once working on real-world uncalibrated images. In addition, we summerize recent extensions of the model which allow to handle more complicated light interactions. Finally, we suggest some future research directions which would further extend its applicability. | |||||
Address | Algarve, Portugal | |||||
Corporate Author | Thesis | |||||
Publisher | SciTePress | Place of Publication | Editor | Mestetskiy, Leonid and Braz, José | ||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | 978-989-8425-47-8 | Medium | |||
Area | Expedition | Conference | VISIGRAPP | |||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ WeB2011 | Serial | 1778 | |||
Permanent link to this record | ||||||
Author ![]() |
Joost Van de Weijer; Robert Benavente; Maria Vanrell; Cordelia Schmid; Ramon Baldrich; Jacob Verbeek; Diane Larlus |
![]() ![]() ![]() |
||||
Title | Color Naming | Type | Book Chapter | |||
Year | 2012 | Publication | Color in Computer Vision: Fundamentals and Applications | Abbreviated Journal | ||
Volume | Issue | 17 | Pages | 287-317 | ||
Keywords | ||||||
Abstract | ||||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | John Wiley & Sons, Ltd. | Place of Publication | Editor | Theo Gevers;Arjan Gijsenij;Joost Van de Weijer;Jan-Mark Geusebroek | ||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | Medium | ||||
Area | Expedition | Conference | ||||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ WBV2012 | Serial | 2063 | |||
Permanent link to this record | ||||||
Author ![]() |
Joost Van de Weijer; Fahad Shahbaz Khan; Marc Masana |
![]() ![]() ![]() ![]() ![]() |
||||
Title | Interactive Visual and Semantic Image Retrieval | Type | Book Chapter | |||
Year | 2013 | Publication | Multimodal Interaction in Image and Video Applications | Abbreviated Journal | ||
Volume | 48 | Issue | Pages | 31-35 | ||
Keywords | ||||||
Abstract | One direct consequence of recent advances in digital visual data generation and the direct availability of this information through the World-Wide Web, is a urgent demand for efficient image retrieval systems. The objective of image retrieval is to allow users to efficiently browse through this abundance of images. Due to the non-expert nature of the majority of the internet users, such systems should be user friendly, and therefore avoid complex user interfaces. In this chapter we investigate how high-level information provided by recently developed object recognition techniques can improve interactive image retrieval. Wel apply a bagof- word based image representation method to automatically classify images in a number of categories. These additional labels are then applied to improve the image retrieval system. Next to these high-level semantic labels, we also apply a low-level image description to describe the composition and color scheme of the scene. Both descriptions are incorporated in a user feedback image retrieval setting. The main objective is to show that automatic labeling of images with semantic labels can improve image retrieval results. | |||||
Address | ||||||
Corporate Author | Thesis | |||||
Publisher | Springer Berlin Heidelberg | Place of Publication | Editor | Angel Sappa; Jordi Vitria | ||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | 1868-4394 | ISBN | 978-3-642-35931-6 | Medium | ||
Area | Expedition | Conference | ||||
Notes | CIC; 605.203; 600.048 | Approved | no | |||
Call Number | Admin @ si @ WKC2013 | Serial | 2284 | |||
Permanent link to this record | ||||||
Author ![]() |
Joost Van de Weijer; Fahad Shahbaz Khan |
![]() ![]() ![]() ![]() ![]() |
||||
Title | Fusing Color and Shape for Bag-of-Words Based Object Recognition | Type | Conference Article | |||
Year | 2013 | Publication | 4th Computational Color Imaging Workshop | Abbreviated Journal | ||
Volume | 7786 | Issue | Pages | 25-34 | ||
Keywords | Object Recognition; color features; bag-of-words; image classification | |||||
Abstract | In this article we provide an analysis of existing methods for the incorporation of color in bag-of-words based image representations. We propose a list of desired properties on which bases fusing methods can be compared. We discuss existing methods and indicate shortcomings of the two well-known fusing methods, namely early and late fusion. Several recent works have addressed these shortcomings by exploiting top-down information in the bag-of-words pipeline: color attention which is motivated from human vision, and Portmanteau vocabularies which are based on information theoretic compression of product vocabularies. We point out several remaining challenges in cue fusion and provide directions for future research. | |||||
Address | Chiba; Japan; March 2013 | |||||
Corporate Author | Thesis | |||||
Publisher | Springer Berlin Heidelberg | Place of Publication | Editor | |||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | 0302-9743 | ISBN | 978-3-642-36699-4 | Medium | ||
Area | Expedition | Conference | CCIW | |||
Notes | CIC; 600.048 | Approved | no | |||
Call Number | Admin @ si @ WeK2013 | Serial | 2283 | |||
Permanent link to this record | ||||||
Author ![]() |
Jesus Jaime Moreno Escobar |
![]() ![]() ![]() ![]() |
||||
Title | Perceptual Criteria on Image Compresions | Type | Book Whole | |||
Year | 2011 | Publication | PhD Thesis, Universitat Autonoma de Barcelona-CVC | Abbreviated Journal | ||
Volume | Issue | Pages | ||||
Keywords | ||||||
Abstract | Nowadays, digital images are used in many areas in everyday life, but they tend to be big. This increases amount of information leads us to the problem of image data storage. For example, it is common to have a representation a color pixel as a 24-bit number, where the channels red, green, and blue employ 8 bits each. In consequence, this kind of color pixel can specify one of 224 ¼ 16:78 million colors. Therefore, an image at a resolution of 512 £ 512 that allocates 24 bits per pixel, occupies 786,432 bytes. That is why image compression is important. An important feature of image compression is that it can be lossy or lossless. A compressed image is acceptable provided these losses of image information are not perceived by the eye. It is possible to assume that a portion of this information is redundant. Lossless Image Compression is defined as to mathematically decode the same image which was encoded. In Lossy Image Compression needs to identify two features inside the image: the redundancy and the irrelevancy of information. Thus, lossy compression modifies the image data in such a way when they are encoded and decoded, the recovered image is similar enough to the original one. How similar is the recovered image in comparison to the original image is defined prior to the compression process, and it depends on the implementation to be performed. In lossy compression, current image compression schemes remove information considered irrelevant by using mathematical criteria. One of the problems of these schemes is that although the numerical quality of the compressed image is low, it shows a high visual image quality, e.g. it does not show a lot of visible artifacts. It is because these mathematical criteria, used to remove information, do not take into account if the viewed information is perceived by the Human Visual System. Therefore, the aim of an image compression scheme designed to obtain images that do not show artifacts although their numerical quality can be low, is to eliminate the information that is not visible by the Human Visual System. Hence, this Ph.D. thesis proposes to exploit the visual redundancy existing in an image by reducing those features that can be unperceivable for the Human Visual System. First, we define an image quality assessment, which is highly correlated with the psychophysical experiments performed by human observers. The proposed CwPSNR metrics weights the well-known PSNR by using a particular perceptual low level model of the Human Visual System, e.g. the Chromatic Induction Wavelet Model (CIWaM). Second, we propose an image compression algorithm (called Hi-SET), which exploits the high correlation and self-similarity of pixels in a given area or neighborhood by means of a fractal function. Hi-SET possesses the main features that modern image compressors have, that is, it is an embedded coder, which allows a progressive transmission. Third, we propose a perceptual quantizer (½SQ), which is a modification of the uniform scalar quantizer. The ½SQ is applied to a pixel set in a certain Wavelet sub-band, that is, a global quantization. Unlike this, the proposed modification allows to perform a local pixel-by-pixel forward and inverse quantization, introducing into this process a perceptual distortion which depends on the surround spatial information of the pixel. Combining ½SQ method with the Hi-SET image compressor, we define a perceptual image compressor, called ©SET. Finally, a coding method for Region of Interest areas is presented, ½GBbBShift, which perceptually weights pixels into these areas and maintains only the more important perceivable features in the rest of the image. Results presented in this report show that CwPSNR is the best-ranked image quality method when it is applied to the most common image compression distortions such as JPEG and JPEG2000. CwPSNR shows the best correlation with the judgement of human observers, which is based on the results of psychophysical experiments obtained for relevant image quality databases such as TID2008, LIVE, CSIQ and IVC. Furthermore, Hi-SET coder obtains better results both for compression ratios and perceptual image quality than the JPEG2000 coder and other coders that use a Hilbert Fractal for image compression. Hence, when the proposed perceptual quantization is introduced to Hi-SET coder, our compressor improves its numerical and perceptual e±ciency. When ½GBbBShift method applied to Hi-SET is compared against MaxShift method applied to the JPEG2000 standard and Hi-SET, the images coded by our ROI method get the best results when the overall image quality is estimated. Both the proposed perceptual quantization and the ½GBbBShift method are generalized algorithms that can be applied to other Wavelet based image compression algorithms such as JPEG2000, SPIHT or SPECK. | |||||
Address | ||||||
Corporate Author | Thesis | Ph.D. thesis | ||||
Publisher | Ediciones Graficas Rey | Place of Publication | Editor | Xavier Otazu | ||
Language | Summary Language | Original Title | ||||
Series Editor | Series Title | Abbreviated Series Title | ||||
Series Volume | Series Issue | Edition | ||||
ISSN | ISBN | 978-84-938351-3-2 | Medium | |||
Area | Expedition | Conference | ||||
Notes | CIC | Approved | no | |||
Call Number | Admin @ si @ Mor2011 | Serial | 1786 | |||
Permanent link to this record |