<sup>18</sup>F-FDG PET/CT-based Radiomics Analysis of Different Machine Learning Models for Predicting Pathological Highly Invasive Non-small Cell Lung Cancer.
Authors
Affiliations (6)
Affiliations (6)
- Department of Nuclear Medicine, Shanghai Pulmonary Hospital, Tongji University School of Medicine, 507 Zheng Min Road, Shanghai 200433, China (Y.L., Q-Q.Z., Q-P.Z., L-Y.H., L.Z.).
- Department of Ultrasound, Shanghai Pulmonary Hospital, Tongji University School of Medicine, 507 Zheng Min Road, Shanghai 200433, China (M-J.S., Y.W.).
- Shanghai Pulmonary Hospital, Tongji University School of Medicine, 507 Zheng Min Road, Shanghai 200433, China (J-W.Y.).
- Department of PET/CT Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong, China (J-J.Q., W-H.L.).
- Department of Radiology, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong, China (X-D.W.).
- Department of Ultrasound, Shanghai Pulmonary Hospital, Tongji University School of Medicine, 507 Zheng Min Road, Shanghai 200433, China (M-J.S., Y.W.). Electronic address: [email protected].
Abstract
This study aimed to develop and validate machine learning models integrating clinicoradiological and radiomic features from 2-[18 F]-fluoro-2-deoxy-D-glucose (<sup>18</sup>F-FDG) positron emission tomography/computed tomography (PET/CT) to predict pathological high invasiveness in cT1-sized (tumor size ≤ 3 cm) non-small cell lung cancer (NSCLC). We retrospectively reviewed 1459 patients with NSCLC (633 with pathological high invasiveness and 826 with pathological non-high invasiveness) from two medical centers. Patients with cT1-sized NSCLC were included. 1145 radiomic features were extracted per modality (PET and CT) from each patient. Optimal predictors were selected to construct a radiomics score (Rad-score) for the PET/CT radiomics model. A combined model incorporating significant clinicoradiological features and the Rad-score was developed. Logistic regression (LR), random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost) algorithms were used to train the combined model. Model performance was assessed the area under the receiver operating characteristic (ROC) curve (AUC), calibration curve, and decision curve analysis (DCA). Shapley Additive Explanations (SHAP) was applied to visualize the prediction process. The radiomics model was built using 11 radiomic features, achieving AUCs of 0.851 (training), 0.859 (internal validation), and 0.829 (external validation). Among all models, the XGBoost combined model demonstrated the best predictive performance, with AUCs of 0.958, 0.919, and 0.903, respectively, along with good calibration and high net benefit. The XGBoost combined model showed strong performance in predicting pathological high invasiveness in cT1-sized NSCLC.