Back to all papers

Optimizing recurrence prediction and risk stratification in prostate cancer using a 2.5D deep learning model: a multicenter MRI-based study.

December 19, 2025pubmed logopapers

Authors

Li F,Liu R,Wang P,Yue L,Hu P,Liu X,Yang L,Ruan Q,Wu S,Feng R,Chen Y,Zhou M,Yang J,Wang F,Qu H,Ning G,Zhuo L

Affiliations (6)

  • Department of Radiology, The Third Hospital of Mianyang, Sichuan Mental Health Center, Mianyang, China.
  • Department of Radiology, General Hospital of Ningxia Medical University, Yinchuan, China.
  • Department of Radiology, Panzhihua Central Hospital, Panzhihua, China.
  • Department of Radiology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Department of Radiology, Luzhou Hospital of Traditional Chinese Medicine, Luzhou, China.
  • Department of Radiology, West China Second Hospital, Sichuan University, Chengdu, China.

Abstract

High tumor recurrence after surgery remains a significant challenge in managing prostate cancer (PCa). We aimed to develop and validate a 2.5D deep learning model based on a transformer architecture utilizing T2WI, ADC, DWI, and CE-T1WI images for the preoperative prediction of biochemical recurrence (BCR) in PCa, and to further investigate its capability for risk stratification. A total of 923 PCa patients (10 153 images) who underwent radical prostatectomy (RP) at five tertiary medical centers were retrospectively enrolled, with follow-up completed by September 2024. Among the five evaluated classifiers, ResNet18 was selected as the best-performing backbone for feature extraction. A Transformer-based deep learning (DL) model was developed using preoperative mpMRI data, and a deep learning fusion (DLF) model was constructed by integrating DL scores with weighted clinical variables, and its performance was compared with the traditional clinical risk score (CAPRA), a clinical model (Clinical), an ensemble learning model (Ensemble), and a multiple instance learning model (MIL). Model performance was evaluated using receiver operating characteristic (ROC) curves. Model comparisons were conducted using the DeLong test, decision curve analysis (DCA) and calibration curves were used to assess the clinical utility and calibration of the models. Furthermore, Grad-CAM was used to visualize model attention and improve interpretability. The DLF model exhibited excellent performance in both the validation and test sets. It achieved an AUC of 0.938 (95% CI: 0.884-0.992) in the internal validation cohort and an AUC of 0.935 (95% CI: 0.900-0.969) in the external test cohort. The DLF model significantly outperformed all unimodal models (P < 0.05, DeLong test), improving AUC by 0.058-0.253 over the clinical model and 0.189-0.299 over CAPRA. Furthermore, the DLF model enabled effective risk stratification, with high-risk patients showing significantly poorer prognostic outcomes than low-risk patients (P < 0.05). It also demonstrated significant predictive power for recurrence events at multiple time points. Specifically, as measured by time-dependent AUC values, the model's predictive performance at 1, 2, and 3 years was 0.866 (95% CI: 0.810-0.921), 0.867 (95% CI: 0.816-0.918), and 0.879 (95% CI: 0.832-0.926), respectively. The stacked DLF model demonstrated the capability to predict early postoperative recurrence in patients with PCa and effectively identify high-risk cases, highlighting its potential as a valuable tool for optimizing treatment strategies and postoperative surveillance.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 7,400+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.