TY - JOUR AU - Victor M. Campello AU - Carlos Martin-Isla AU - Cristian Izquierdo AU - Andrea Guala AU - Jose F. Rodriguez Palomares AU - David Vilades AU - Martin L. Descalzo AU - Mahir Karakas AU - Ersin Cavus AU - Zahra Zahra Raisi-Estabragh AU - Steffen E. Petersen AU - Sergio Escalera AU - Santiago Segui AU - Karim Lekadir PY - 2022// TI - Minimising multi-centre radiomics variability through image normalisation: a pilot study T2 - ScR JO - Scientific Reports SP - 12532 VL - 12 IS - 1 PB - Springer Nature N2 - Radiomics is an emerging technique for the quantification of imaging data that has recently shown great promise for deeper phenotyping of cardiovascular disease. Thus far, the technique has been mostly applied in single-centre studies. However, one of the main difficulties in multi-centre imaging studies is the inherent variability of image characteristics due to centre differences. In this paper, a comprehensive analysis of radiomics variability under several image- and feature-based normalisation techniques was conducted using a multi-centre cardiovascular magnetic resonance dataset. 218 subjects divided into healthy (n = 112) and hypertrophic cardiomyopathy (n = 106, HCM) groups from five different centres were considered. First and second order texture radiomic features were extracted from three regions of interest, namely the left and right ventricular cavities and the left ventricular myocardium. Two methods were used to assess features’ variability. First, feature distributions were compared across centres to obtain a distribution similarity index. Second, two classification tasks were proposed to assess: (1) the amount of centre-related information encoded in normalised features (centre identification) and (2) the generalisation ability for a classification model when trained on these features (healthy versus HCM classification). The results showed that the feature-based harmonisation technique ComBat is able to remove the variability introduced by centre information from radiomic features, at the expense of slightly degrading classification performance. Piecewise linear histogram matching normalisation gave features with greater generalisation ability for classification ( balanced accuracy in between 0.78 ± 0.08 and 0.79 ± 0.09). Models trained with features from images without normalisation showed the worst performance overall ( balanced accuracy in between 0.45 ± 0.28 and 0.60 ± 0.22). In conclusion, centre-related information removal did not imply good generalisation ability for classification. UR - http://dx.doi.org/10.1038/s41598-022-16375-0 N1 - HuPBA ID - Victor M. Campello2022 ER -