|
Oriol Pujol, & Petia Radeva. (2002). Lumen Detection in Ivus Image Using Snakes in a Statical Framework..
|
|
|
Bonifaz Stuhr, Jurgen Brauer, Bernhard Schick, & Jordi Gonzalez. (2023). Masked Discriminators for Content-Consistent Unpaired Image-to-Image Translation.
Abstract: A common goal of unpaired image-to-image translation is to preserve content consistency between source images and translated images while mimicking the style of the target domain. Due to biases between the datasets of both domains, many methods suffer from inconsistencies caused by the translation process. Most approaches introduced to mitigate these inconsistencies do not constrain the discriminator, leading to an even more ill-posed training setup. Moreover, none of these approaches is designed for larger crop sizes. In this work, we show that masking the inputs of a global discriminator for both domains with a content-based mask is sufficient to reduce content inconsistencies significantly. However, this strategy leads to artifacts that can be traced back to the masking process. To reduce these artifacts, we introduce a local discriminator that operates on pairs of small crops selected with a similarity sampling strategy. Furthermore, we apply this sampling strategy to sample global input crops from the source and target dataset. In addition, we propose feature-attentive denormalization to selectively incorporate content-based statistics into the generator stream. In our experiments, we show that our method achieves state-of-the-art performance in photorealistic sim-to-real translation and weather translation and also performs well in day-to-night translation. Additionally, we propose the cKVD metric, which builds on the sKVD metric and enables the examination of translation quality at the class or category level.
|
|
|
Maria Vanrell, & Jordi Vitria. (1993). Mathematical Morphology, Granulometries and Texture Perception..
|
|
|
Carme Julia. (2008). Missig Data Matrix Factorization Addressing the Structure from Motion Problem.
|
|
|
Hao Wu, Alejandro Ariza-Casabona, Bartłomiej Twardowski, & Tri Kurniawan Wijaya. (2023). MM-GEF: Multi-modal representation meet collaborative filtering.
Abstract: In modern e-commerce, item content features in various modalities offer accurate yet comprehensive information to recommender systems. The majority of previous work either focuses on learning effective item representation during modelling user-item interactions, or exploring item-item relationships by analysing multi-modal features. Those methods, however, fail to incorporate the collaborative item-user-item relationships into the multi-modal feature-based item structure. In this work, we propose a graph-based item structure enhancement method MM-GEF: Multi-Modal recommendation with Graph Early-Fusion, which effectively combines the latent item structure underlying multi-modal contents with the collaborative signals. Instead of processing the content feature in different modalities separately, we show that the early-fusion of multi-modal features provides significant improvement. MM-GEF learns refined item representations by injecting structural information obtained from both multi-modal and collaborative signals. Through extensive experiments on four publicly available datasets, we demonstrate systematical improvements of our method over state-of-the-art multi-modal recommendation methods.
|
|
|
Daniel Ponsa, & Jordi Vitria. (1999). Mobile monitoring system using an agent-oriented approach.
|
|
|
Md. Mostafa Kamal Sarker, Hatem A. Rashwan, Mohamed Abdel-Nasser, Vivek Kumar Singh, Syeda Furruka Banu, Farhan Akram, et al. (2019). MobileGAN: Skin Lesion Segmentation Using a Lightweight Generative Adversarial Network.
Abstract: CoRR abs/1907.00856
Skin lesion segmentation in dermoscopic images is a challenge due to their blurry and irregular boundaries. Most of the segmentation approaches based on deep learning are time and memory consuming due to the hundreds of millions of parameters. Consequently, it is difficult to apply them to real dermatoscope devices with limited GPU and memory resources. In this paper, we propose a lightweight and efficient Generative Adversarial Networks (GAN) model, called MobileGAN for skin lesion segmentation. More precisely, the MobileGAN combines 1D non-bottleneck factorization networks with position and channel attention modules in a GAN model. The proposed model is evaluated on the test dataset of the ISBI 2017 challenges and the validation dataset of ISIC 2018 challenges. Although the proposed network has only 2.35 millions of parameters, it is still comparable with the state-of-the-art. The experimental results show that our MobileGAN obtains comparable performance with an accuracy of 97.61%.
|
|
|
Jose Manuel Alvarez, & Antonio Lopez. (2009). Model-based road detection using shadowless features and on-line learning.
|
|
|
David Guillamet, B. Moghaddam, & Jordi Vitria. (2003). Modeling High-Order Dependencies in Local Appearance Models.
|
|
|
Cristina Cañero, E Fernandez-Nofrerias, J. Mauri, & Petia Radeva. (2002). Modelling the Acquisition Geometry of a C-arm Angiography System for 3D Reconstruction..
|
|
|
Hannes Mueller, Andre Groger, Jonathan Hersh, Andrea Matranga, & Joan Serrat. (2020). Monitoring War Destruction from Space: A Machine Learning Approach.
Abstract: Existing data on building destruction in conflict zones rely on eyewitness reports or manual detection, which makes it generally scarce, incomplete and potentially biased. This lack of reliable data imposes severe limitations for media reporting, humanitarian relief efforts, human rights monitoring, reconstruction initiatives, and academic studies of violent conflict. This article introduces an automated method of measuring destruction in high-resolution satellite images using deep learning techniques combined with data augmentation to expand training samples. We apply this method to the Syrian civil war and reconstruct the evolution of damage in major cities across the country. The approach allows generating destruction data with unprecedented scope, resolution, and frequency – only limited by the available satellite imagery – which can alleviate data limitations decisively.
|
|
|
Jordi Vitria, X. Binefa, & Juan J. Villanueva. (1992). Morphological Algorithms for Visual Analysis of Integrated Circuits..
|
|
|
D. Seron, F. Moreso, C. Gratin, & Jordi Vitria. (1995). Morphological Granulometries and Quantification of Interstitial Chronic Renal Damage.
|
|
|
Carme Julia, Joan Serrat, Antonio Lopez, Felipe Lumbreras, & Daniel Ponsa. (2006). Motion segmentation through factorization. Application to night driving assistance.
|
|
|
David Lloret, Joan Serrat, Antonio Lopez, & Juan J. Villanueva. (2002). Motion-induced error correction in ultrasound imaging..
|
|