Machine Learning-Based Diagnostic Prediction Model Using T1-Weighted Striatal Magnetic Resonance Imaging for Early-Stage Parkinson's Disease Detection.
Authors
Affiliations (6)
Affiliations (6)
- Medical Science Center, Federal University of Pernambuco, Recife, Brazil (A.R.M.A., L.H.C., D.P.C.F.B.). Electronic address: [email protected].
- Nuclear Medicine, Clinical Hospital of Federal University of Pernambuco, Recife, Brazil (V.O.M., F.A.M.).
- Medical Science Center, Federal University of Pernambuco, Recife, Brazil (A.R.M.A., L.H.C., D.P.C.F.B.).
- Center for Exact and Natural Sciences, Federal University of Pernambuco, Recife, Brazil (M.L.).
- Radiology Department, Hospital Universitário Prof. Edgard Santos, Bahia, Brazil (M.A.D.M.).
- Telehealth Unit, Medical Science Center, Federal University of Pernambuco, Recife, Brazil (P.R.B.D.).
Abstract
Diagnosing Parkinson's disease (PD) typically relies on clinical evaluations, often detecting it in advanced stages. Recently, artificial intelligence has increasingly been applied to imaging for neurodegenerative disorders. This study aims to develop a diagnostic prediction model using T1-weighted magnetic resonance imaging (T1-MRI) data from the caudate and putamen in individuals with early-stage PD. This retrospective case-control study included 69 early-stage PD patients and 22 controls, recruited through the Parkinson's Progression Markers Initiative. T1-MRI scans were acquired using a 3-tesla system. 432 radiomic features were extracted from images of the segmented caudate and putâmen in an automated way. Feature selection was performed using Pearson's correlation and recursive feature elimination to identify the most relevant variables. Three machine learning algorithms-random forest (RF), support vector machine and logistic regression-were evaluated for diagnostic prediction effectiveness using a cross-validation method. The Shapley Additive Explanations technique identified the most significant features distinguishing between the groups. The metrics used to evaluate the performance were discrimination, expressed in area under the ROC curve (AUC), sensitivity and specificity; and calibration, expressed as accuracy. The RF algorithm showed superior performance with an average accuracy of 92.85%, precision of 100.00%, sensitivity of 86.66%, specificity of 96.65% and AUC of 0.93. The three most influential features were contrast, elongation, and gray-level non-uniformity, all from the putamen. Machine learning-based models can differentiate early-stage PD from controls using T1-weighted MRI radiomic features.