CT radiomics-based explainable machine learning model for accurate differentiation of malignant and benign endometrial tumors: a two-center study.
Authors
Affiliations (8)
Affiliations (8)
- Gynecology Department, Qingdao Traditional Chinese Medicine Hospital, Qingdao Hiser Hospital Affiliated of Qingdao University, Qingdao, 266000, China.
- Sichuan University-Pittsburgh Institute, Sichuan University, Chengdu, 610000, China.
- Department of Obstetrics and Gynecology, Qingbaijiang Women's and Children's Hospital (Maternal and Child Health Hospital), West China Second University Hospital, Sichuan University, Chengdu, 610300, China.
- College of Computer Science, Sichuan University, Chengdu, 610000, Sichuan, China.
- Radiology Department, Qingdao Hiser Hospital Affiliated of Qingdao University (Qingdao Traditional Chinese Medicine Hospital), Qingdao, 266000, China.
- Department of Traditional Chinese Medicine, Jiaozhou Traditional Chinese Medicine Hospital, Qingdao, 266000, China.
- Adult Traditional Chinese Medicine Department, Qingdao Women and Children's Hospital, Qingdao, 266000, China.
- Gynecology Department, Qingdao Traditional Chinese Medicine Hospital, Qingdao Hiser Hospital Affiliated of Qingdao University, Qingdao, 266000, China. [email protected].
Abstract
This study aimed to develop and validate a CT radiomics-based explainable machine learning model for precise diagnosing of malignancy and benignity specifically in endometrial cancer (EC) patients. A total of 83 EC patients from two centers, including 46 with malignant and 37 with benign conditions, were included, with data split into a training set (n = 59) and a testing set (n = 24). The regions of interest (ROIs) were manually segmented from pre-surgical CT scans, and 1132 radiomic features were extracted from the pre-surgical CT scans using Pyradiomics. Six explainable machine learning (ML) modeling algorithms were implemented, respectively, for determining the optimal radiomics pipeline. The diagnostic performance of the radiomic model was evaluated by using sensitivity, specificity, accuracy, precision, F1 score, area under the receiver operating characteristic curve (AUROC), and area under the precision-recall curve (AUPRC). To enhance clinical understanding and usability, we separately implemented SHAP analysis and feature mapping visualization and evaluated the calibration curve and decision curve. By comparing six modeling strategies, the Random Forest model emerged as the optimal choice for diagnosing EC, with a training AUROC of 1.00 and a testing AUROC of 0.96. SHAP identified the most important radiomic features, revealing that all selected features were significantly associated with EC (p < 0.05). Radiomics feature maps also provide a feasible assessment tool for clinical applications. Decision curve analysis (DCA) indicated a higher net benefit for our model compared to the "All" and "None" strategies, suggesting its clinical utility in identifying high-risk cases and reducing unnecessary interventions. CT radiomics-based explainable ML model achieved high diagnostic performance, which could be used as an intelligent auxiliary tool for the diagnosis of endometrial cancer.