Deep integration of clinical metadata with [<sup>18</sup>F]FDG PET/CT imaging for histological subtyping in non-small cell lung cancer: a multi-center study.
Authors
Affiliations (21)
Affiliations (21)
- Department of Clinical Engineering, The Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310009, China.
- Institute of Nuclear Medicine and Molecular Imaging of Zhejiang University, Hangzhou, Zhejiang, 310009, China.
- Key Laboratory of Medical Molecular Imaging of Zhejiang Province, Hangzhou, Zhejiang, 310009, China.
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, Zhejiang, 310027, China.
- Department of Nuclear Medicine, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou, Zhejiang, 310022, China.
- Key Laboratory for Biomedical Engineering of Ministry of Education, Zhejiang University, Hangzhou, Zhejiang, 310027, China.
- The Polytechnic Institute, Zhejiang University, Hangzhou, Zhejiang, 310015, China.
- Department of Nuclear Medicine, The First Hospital of China Medical University, 155 Nanjing St, Shenyang, Liaoning, 110001, China.
- Department of Nuclear Medicine, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, China.
- Department of Nuclear Medicine, The Affiliated Guangdong Second Provincial General Hospital of Jinan University, Guangzhou, Guangdong, 510317, China.
- Department of Nuclear Medicine, The Second School of Clinical Medicine, Southern Medical University, Guangzhou, Guangdong, 510317, China.
- Institute of Nuclear Medicine and Molecular Imaging of Zhejiang University, Hangzhou, Zhejiang, 310009, China. [email protected].
- Key Laboratory of Medical Molecular Imaging of Zhejiang Province, Hangzhou, Zhejiang, 310009, China. [email protected].
- Department of Nuclear Medicine, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou, Zhejiang, 310022, China. [email protected].
- Key Laboratory for Biomedical Engineering of Ministry of Education, Zhejiang University, Hangzhou, Zhejiang, 310027, China. [email protected].
- Human Phenome Institute, Fudan University, Shanghai, 201203, China. [email protected].
- Institute of Nuclear Medicine and Molecular Imaging of Zhejiang University, Hangzhou, Zhejiang, 310009, China. [email protected].
- Key Laboratory of Medical Molecular Imaging of Zhejiang Province, Hangzhou, Zhejiang, 310009, China. [email protected].
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, Zhejiang, 310027, China. [email protected].
- Department of Nuclear Medicine, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou, Zhejiang, 310022, China. [email protected].
- Key Laboratory for Biomedical Engineering of Ministry of Education, Zhejiang University, Hangzhou, Zhejiang, 310027, China. [email protected].
Abstract
To develop and validate a multimodal deep learning framework that integrates clinical metadata with [<sup>18</sup>F]FDG PET/CT imaging to resolve overlapping metabolic phenotypes. The primary objective is the histological subtyping of non-small cell lung cancer (NSCLC), utilizing binary clinical staging (early vs. advanced) strategically as an auxiliary regularization task. A multi-center surgical NSCLC cohort (n = 780) was partitioned into a development set (n = 675) and an independent external test set (n = 105). The framework first utilized a 3D Transformer for bounding-box-based tumor localization. Subsequently, a multi-task network employed Feature-wise Linear Modulation (FiLM) to dynamically inject clinical metadata into the visual backbone. For histological subtyping of adenocarcinoma versus squamous cell carcinoma, in the validation cohort, the proposed multimodal framework achieved the highest area under the receiver operating characteristic curve (AUC) of 0.894 (95% CI: 0.813-0.959), significantly outperforming the conventional radiomics baseline (AUC = 0.796, DeLong test P = 0.017) and the clinical-only baseline (AUC = 0.759, P = 0.004). On the internal test set, the multimodal model maintained an AUC of 0.832 (95% CI: 0.744-0.906), outperforming competing models numerically, though differences did not reach statistical significance (all P > 0.11). On the independent external test cohort, the multimodal framework demonstrated superior cross-center stability, maintaining an AUC of 0.787 (95% CI: 0.687-0.876). On the external cohort, the between-model AUC differences did not reach statistical significance against the clinical-only model (AUC of 0.740, P = 0.480) or the image-only model (AUC of 0.685, P = 0.082). Nevertheless, the multimodal framework achieved the highest F1-score and yielded the most optimal net clinical benefit across a wide range of threshold probabilities in decision curve analysis. For the intrinsically challenging auxiliary staging task, the unguided image-only network exhibited severe vulnerability, however, the FiLM-based multimodal mechanism effectively enhanced diagnostic capacity by employing systemic clinical priors, improving the AUC to 0.656. Combining 3D detection with an early clinico-biological fusion strategy effectively enhances NSCLC characterization on [<sup>18</sup>F]FDG PET/CT, which has the potential to mitigate the limitations of single-modality imaging in resolving diagnostically ambiguous cases characterized by overlapping [<sup>18</sup>F]FDG uptake phenotypes, thereby providing a non-invasive decision-support tool in the precision management of NSCLC.