Unsupervised phenotypic clustering of cardiac MRI data reveals distinct subgroups associated with outcomes in ischemic cardiomyopathy.
Authors
Affiliations (9)
Affiliations (9)
- Cardiac Imaging Unit, Wythenshawe Hospital, Manchester University NHS Foundation Trust, Trust Southmoor Rd, Manchester, M23 9LT, UK. [email protected].
- Institute of Cardiovascular Sciences, University of Manchester, Manchester, UK. [email protected].
- Cardiac Electrophysiology Unit, Department of Biomedical Sciences, Humanitas University, Milan, 20090, Italy.
- Cardiac Electrophysiology, Cardiovascular Medicine Division, Hospital of the University of Pennsylvania, Philadelphia, PA, USA.
- Institute of Cardiovascular Sciences, University of Manchester, Manchester, UK.
- Royal Brompton and Harefield Hospitals, Guy's and St Thomas' NHS Foundation Trust, London, UK.
- Cardiothoracic Department, Santa Maria della Misericordia University Hospital, Udine, Italy.
- Institute for Maternal and Child Health-IRCCS "Burlo Garofolo", Trieste, Italy.
- Cardiovascular Department, Azienda Sanitaria Universitaria Integrata, Trieste, Italy.
Abstract
Ischemic cardiomyopathy (ICM) shows significant heterogeneity in clinical outcomes, challenging traditional risk stratification methods. Cardiac magnetic resonance (CMR) imaging offers detailed insights into myocardial structure and function, yet integrating this multidimensional data remains complex. Aim of the current study was to assess whether unsupervised machine learning could help identify distinct phenotypic subgroups and enhance prognostic accuracy. This study included 319 clinically stable ICM patients. CMR-derived variables, including left ventricular ejection fraction (LVEF), ventricular volumes, and myocardial scar burden, were analysed using KAMILA clustering algorithm. The optimal number of clusters was determined through silhouette analysis, within-cluster sum of squares, and gap statistics. Principal Component Analysis (PCA) visualized the clustering results, and prognostic value was assessed using Cox regression and Kaplan-Meier survival analysis. SHAP (SHapley Additive exPlanations) values were used to evaluate feature importance. Two distinct phenotypic clusters were identified. Cluster 1 (n = 219) demonstrated better cardiac function, with higher LVEF, smaller ventricular volumes, and lower scar burden. Cluster 2 (n = 100) indicated advanced disease, with lower LVEF, larger volumes, higher scar burden, and greater midwall fibrosis. PCA confirmed clear separation between clusters, explaining 62.6% of the variance. After a median follow-up of 13 months, the composite endpoint was observed in 37 (12%) patients. Patients in Cluster 2 had a significantly higher risk of experiencing the composite outcome (HR = 3.96, p < 0.001). SHAP analysis identified ischaemic scar burden, sphericity index, and midwall fibrosis as key predictors of outcomes. Unsupervised clustering of CMR-derived variables identified distinct ICM phenotypes with important prognostic implications. This method improves risk stratification and could help tailor personalised treatment plans, highlighting the potential of machine learning in understanding ICM heterogeneity.