Fusion of Deep Transfer Learning and Radiomics in MRI-Based Prediction of Post-Surgical Recurrence in Soft Tissue Sarcoma.
Authors
Affiliations (3)
Affiliations (3)
- Department of Radiology, Peking University People's Hospital, No.11, Xizhimen South Street, Xicheng District, Beijing, 100044, China.
- Department of Radiology, The Affiliated Hospital of Qingdao University, 266003, Qingdao, China.
- Department of Radiology, Peking University People's Hospital, No.11, Xizhimen South Street, Xicheng District, Beijing, 100044, China. [email protected].
Abstract
Soft tissue sarcomas (STS) are heterogeneous malignancies with high recurrence rates (33-39%) post-surgery, necessitating improved prognostic tools. This study proposes a fusion model integrating deep transfer learning and radiomics from MRI to predict postoperative STS recurrence. Axial T2-weighted fat-suppressed imaging (T<sub>2</sub>WI) of 803 STS patients from two institutions was retrospectively collected and divided into training (n = 527), internal validation (n = 132), and external validation (n = 144) cohorts. Tumor segmentation was performed using the SegResNet model within the Auto3DSeg framework. Radiomic features and deep learning features were extracted. Feature selection employed LASSO regression, and the deep learning radiomic (DLR) model combined radiomic and deep learning signatures. Using the features, nine models were constructed based on three classifiers. The area under the receiver operating characteristic curve (AUC), sensitivity, specificity, accuracy, negative predictive value, and positive predictive value were calculated for performance evaluation. The SegResNet model achieved Dice coefficients of 0.728 after refinement. Recurrence rates were 22.8% (120/527) in the training, 25.0% (33/132) in the internal validation, and 32.6% (47/144) in the external validation cohorts. The DLR model (ExtraTrees) demonstrated superior performance, achieving an AUC of 0.818 in internal validation and 0.809 in external validation, better than the radiomic model (0.710, 0.612) and the deep learning model (0.751, 0.667). Sensitivity and specificity ranged from 0.702 to 0.976 and 0.732 to 0.830, respectively. Decision curve analysis confirmed superior clinical utility. The DLR model provides a robust, non-invasive tool for preoperative STS recurrence prediction, enabling personalized treatment decisions and postoperative management.