Development and validation of a transformer-based deep learning model for predicting distant metastasis in non-small cell lung cancer using <sup>18</sup>FDG PET/CT images.
Authors
Affiliations (5)
Affiliations (5)
- Department of Radiology, The Affiliated Hospital of Guizhou Medical University, No. 28, Guiyi Street, Yunyan District, Guiyang, 550004, China.
- Department of Anesthesiology, Guizhou Provincial People's Hospital, Guiyang, Guizhou Province, China.
- Department of Nuclear Medicine, The Affiliated Hospital of Guizhou Medical University, Guiyang, 550004, China.
- Department of Ultrasound Center, Affliated Hospital of Guizhou Medical University, Guiyang, 550004, China.
- Department of Radiology, The Affiliated Hospital of Guizhou Medical University, No. 28, Guiyi Street, Yunyan District, Guiyang, 550004, China. [email protected].
Abstract
This study aimed to develop and validate a hybrid deep learning (DL) model that integrates convolutional neural network (CNN) and vision transformer (ViT) architectures to predict distant metastasis (DM) in patients with non-small cell lung cancer (NSCLC) using <sup>18</sup>F-FDG PET/CT images. A retrospective analysis was conducted on a cohort of consecutively registered patients who were newly diagnosed and untreated for NSCLC. A total of 167 patients with available PET/CT images were included in the analysis. DL features were extracted using a combination of CNN and ViT architectures, followed by feature selection, model construction, and evaluation of model performance using the receiver operating characteristic (ROC) and the area under the curve (AUC). The ViT-based DL model exhibited strong predictive capabilities in both the training and validation cohorts, achieving AUCs of 0.824 and 0.830 for CT features, and 0.602 and 0.694 for PET features, respectively. Notably, the model that integrated both PET and CT features demonstrated a notable AUC of 0.882 in the validation cohort, outperforming models that utilized either PET or CT features alone. Furthermore, this model outperformed the CNN model (ResNet 50), which achieved an AUC of 0.752 [95% CI 0.613, 0.890], p < 0.05. Decision curve analysis further supported the efficacy of the ViT-based DL model. The ViT-based DL developed in this study demonstrates considerable potential in predicting DM in patients with NSCLC, potentially informing the creation of personalized treatment strategies. Future validation through prospective studies with larger cohorts is necessary.