Ultrasound and Clinicopathological Features-Based Machine Learning Model for Predicting Neoadjuvant Therapy Efficacy in Breast Cancer.
Authors
Affiliations (1)
Affiliations (1)
- Department of Ultrasound Medicine, The Fourth Hospital of Hebei Medical University, Shijiazhuang City, Hebei Province, China.
Abstract
Accurate prediction of pathological complete response (pCR) after neoadjuvant therapy (NAT) remains challenging in breast cancer. Conventional imaging modalities, such as ultrasound and magnetic resonance imaging (MRI), have limited accuracy when used alone. To develop and validate a machine learning model integrating ultrasound imaging features and clinicopathological information for non-invasive and individualized prediction of pCR following NAT in breast cancer patients. This retrospective study included 609 breast cancer patients who underwent NAT. Ultrasound imaging features and clinicopathological variables were collected and analyzed. Data preprocessing was performed using Python and R. The diagnostic performance of ultrasound and MRI for predicting pCR was evaluated as a baseline. Significant predictors were identified through univariate and multivariate analyses. Three machine learning models-Random Forest, Logistic Regression, and Support Vector Machine-were developed and validated. Model performance was assessed using receiver operating characteristic (ROC) curves and decision curve analysis, while SHAP analysis and feature importance rankings were used to evaluate variable contributions. The Random Forest model achieved the best performance, with an AUC of 0.85 and an accuracy of 84.7%, outperforming conventional imaging assessments. Key predictors included early NAT tumor volume reduction ≥ 80%, increased echogenicity, HER2 positivity, and higher tumor-infiltrating lymphocyte levels. The Random Forest model substantially improved prediction of pCR after NAT in breast cancer and may provide a practical, non-invasive tool to support individualized treatment planning, and clinical decision-making.