Automated machine learning for prostate cancer detection and Gleason score prediction using T2WI: a diagnostic multi-center study.
Authors
Affiliations (6)
Affiliations (6)
- Radiology Department, Huadong Hospital, Fudan University, No.221# West Yan'an Road, Shanghai, 200040, China.
- Radiology Department, Huashan Hospital, Affiliated with Fudan University, No.12# Middle Urmuqi Road, Shanghai, 200040, China.
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA.
- Radiology Department, Huadong Hospital, Fudan University, No.221# West Yan'an Road, Shanghai, 200040, China. [email protected].
- Radiology Department, Huashan Hospital, Affiliated with Fudan University, No.12# Middle Urmuqi Road, Shanghai, 200040, China. [email protected].
- Radiology Department, Huashan Hospital, Affiliated with Fudan University, No.12# Middle Urmuqi Road, Shanghai, 200040, China. [email protected].
Abstract
Prostate cancer (PCa) is one of the most common malignancies in men, and accurate assessment of tumor aggressiveness is crucial for treatment planning. The Gleason score (GS) remains the gold standard for risk stratification, yet it relies on invasive biopsy, which has inherent risks and sampling errors. The aim of this study was to detect PCa and non-invasively predict the GS for the early detection and stratification of clinically significant cases. We used single-modality T2-weighted imaging (T2WI) with an automatic machine-learning (ML) approach, MLJAR. The internal dataset comprised PCa patients who underwent magnetic resonance imaging (MRI) examinations at our hospital from September 2015 to June 2022 prior to prostate biopsy, surgery, radiotherapy, and endocrine therapy and whose examinations resulted in pathological findings. An external dataset from another medical center and a public challenge dataset were used for external validation. The Kolmogorov-Smirnov curve was used to evaluate the risk-differentiation ability of the PCa detection model. The area under the receiver operating characteristic curve (AUC) was calculated with confidence intervals to compare the model performance. The internal MRI dataset included 198 non-PCa and 291 PCa patients with histopathological results obtained through biopsy or surgery. External and public challenge datasets included 45 and 68 PCa patients, respectively. AUC for PCa detection in the internal-testing cohort (n = 147, PCa = 78) was 0.99. For GS prediction, AUCs were GS = 3 + 3 (0.97), GS = 3 + 4 (0.97), GS = 3 + 5 (1.0), GS = 4 + 3 (0.87), GS = 4 + 4 (0.91), GS = 4 + 5 (0.95), GS = 5 + 4 (1.0), and GS = 5 + 5 (0.99) in the internal-testing cohort (PCa = 88); GS = 3 + 3 (0.95), GS = 3 + 4 (0.76); GS = 3 + 5 (0.77), GS = 4 + 3 (0.88), GS = 4 + 4 (0.82), GS = 4 + 5 (0.87), GS = 5 + 4 (0.95), and GS = 5 + 5 (0.85) in the external-testing cohort (PCa = 45); and GS = 3 + 4 (0.89), GS = 4 + 3 (0.75), GS = 4 + 4 (0.65), and GS = 4 + 5 (0.91) in the public challenge cohort (PCa = 68). This multi-center study shows that an auto-ML model using only T2WI can accurately detect PCa and predict Gleason scores non-invasively, offering potential to reduce biopsy reliance and improve early risk stratification. These results warrant further validation and exploration for integration into clinical workflows.