|
Oriol Ramos Terrades, Albert Berenguel, & Debora Gil. (2020). A flexible outlier detector based on a topology given by graph communities.
Abstract: Outlier, or anomaly, detection is essential for optimal performance of machine learning methods and statistical predictive models. It is not just a technical step in a data cleaning process but a key topic in many fields such as fraudulent document detection, in medical applications and assisted diagnosis systems or detecting security threats. In contrast to population-based methods, neighborhood based local approaches are simple flexible methods that have the potential to perform well in small sample size unbalanced problems. However, a main concern of local approaches is the impact that the computation of each sample neighborhood has on the method performance. Most approaches use a distance in the feature space to define a single neighborhood that requires careful selection of several parameters. This work presents a local approach based on a local measure of the heterogeneity of sample labels in the feature space considered as a topological manifold. Topology is computed using the communities of a weighted graph codifying mutual nearest neighbors in the feature space. This way, we provide with a set of multiple neighborhoods able to describe the structure of complex spaces without parameter fine tuning. The extensive experiments on real-world data sets show that our approach overall outperforms, both, local and global strategies in multi and single view settings.
|
|
|
Oriol Ramos Terrades, Albert Berenguel, & Debora Gil. (2022). A Flexible Outlier Detector Based on a Topology Given by Graph Communities. BDR - Big Data Research, 29, 100332.
Abstract: Outlier detection is essential for optimal performance of machine learning methods and statistical predictive models. Their detection is especially determinant in small sample size unbalanced problems, since in such settings outliers become highly influential and significantly bias models. This particular experimental settings are usual in medical applications, like diagnosis of rare pathologies, outcome of experimental personalized treatments or pandemic emergencies. In contrast to population-based methods, neighborhood based local approaches compute an outlier score from the neighbors of each sample, are simple flexible methods that have the potential to perform well in small sample size unbalanced problems. A main concern of local approaches is the impact that the computation of each sample neighborhood has on the method performance. Most approaches use a distance in the feature space to define a single neighborhood that requires careful selection of several parameters, like the number of neighbors.
This work presents a local approach based on a local measure of the heterogeneity of sample labels in the feature space considered as a topological manifold. Topology is computed using the communities of a weighted graph codifying mutual nearest neighbors in the feature space. This way, we provide with a set of multiple neighborhoods able to describe the structure of complex spaces without parameter fine tuning. The extensive experiments on real-world and synthetic data sets show that our approach outperforms, both, local and global strategies in multi and single view settings.
Keywords: Classification algorithms; Detection algorithms; Description of feature space local structure; Graph communities; Machine learning algorithms; Outlier detectors
|
|
|
Oriol Ramos Terrades. (2003). Descripcio i classificacio de simbols tecnics usant la transformada de crestetes.
|
|
|
Oriol Ramos Terrades. (2006). Linear Combination of Multiresolution Descriptors: Application to Graphics Recognition (Salvatore Antoine Tabbone, & Ernest Valveny, Eds.). Ph.D. thesis, , .
|
|
|
Oriol Pujol, Sergio Escalera, & Petia Radeva. (2008). An Incremental Node Embedding Technique for Error Correcting Output Codes. PR - Pattern Recognition, 713–725.
|
|
|
Oriol Pujol, Petia Radeva, Jordi Vitria, & J. Mauri. (2004). Adaboost to Classify Plaque Appearance in IVUS Images.
|
|
|
Oriol Pujol, Petia Radeva, & Jordi Vitria. (2005). Traffic sign recognition using an adaptive boosting multiclass framework.
|
|
|
Oriol Pujol, Petia Radeva, & Jordi Vitria. (2006). Discriminant ECOC: A Heuristic Method for Application Dependent Design of Error Correcting Output Codes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(6): 1007–1012.
|
|
|
Oriol Pujol, Petia Radeva, J. Mauri, & E Fernandez-Nofrerias. (2002). Automatic segmentation of lumen in Intravascular Ultrasound Images: An evaluation of texture feature extractors..
|
|
|
Oriol Pujol, & Petia Radeva. (2002). Lumen Detection in Ivus Image Using Snakes in a Statical Framework..
|
|
|
Oriol Pujol, & Petia Radeva. (2003). Texture Segmentation by Statistic Deformable Models. International Journal of Image and Graphics (IJIG).
|
|
|
Oriol Pujol, & Petia Radeva. (2004). Texture Segmentation by Statistical Deformable Models. IJIG - International Journal of Image and Graphics, 433–452.
Abstract: Deformable models have received much popularity due to their ability to include high-level knowledge on the application domain into low-level image processing. Still, most proposed active contour models do not sufficiently profit from the application information and they are too generalized, leading to non-optimal final results of segmentation, tracking or 3D reconstruction processes. In this paper we propose a new deformable model defined in a statistical framework to segment objects of natural scenes. We perform a supervised learning of local appearance of the textured objects and construct a feature space using a set of co-occurrence matrix measures. Linear Discriminant Analysis allows us to obtain an optimal reduced feature space where a mixture model is applied to construct a likelihood map. Instead of using a heuristic potential field, our active model is deformed on a regularized version of the likelihood map in order to segment objects characterized by the same texture pattern. Different tests on synthetic images, natural scene and medical images show the advantages of our statistic deformable model.
Keywords: Texture segmentation, parametric active contours, statistic snakes
|
|
|
Oriol Pujol, & Petia Radeva. (2005). Solving Particularization with Supervised Clustering Competition Scheme. In Pattern Recognition and Image Analysis (IbPRIA 2005), LNCS 3523: 11–18.
|
|
|
Oriol Pujol, & Petia Radeva. (2005). On the assessment of texture descriptors in intravascular ultrasound images: A boosting approach to a feasible plaque classification. In Plaque Imaging: Pixel to Molecular Level, IOS Press, J. Suri et al. (Eds.), 113: 276–299, ISBN: 1–58603–516–9.
|
|
|
Oriol Pujol, & Petia Radeva. (2006). Optimal extension of Error Correcting Output Codes.
|
|