Multisequence MRI-driven assessment of PD-L1 expression in non-small cell lung cancer: a pilot study.
Authors
Affiliations (11)
Affiliations (11)
- Physics Department, University of Pavia, Pavia, Italy.
- (INFN) National Institute for Nuclear Physics, Unit of Pavia, Pavia, Italy.
- Diagnostic Imaging Unit, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy.
- Radiology Institute, IRCCS Polyclinic San Matteo, Pavia, Italy.
- Clinical Department, (CNAO) National Center for Oncological Hadrontherapy, Pavia, Italy.
- Siemens Healthcare srl, Milan, Italy.
- Siemens Healthineers AG, Erlangen, Germany.
- (INFN) National Institute for Nuclear Physics, Unit of Milan, Milan, Italy.
- (USI) University of Lugano, Lugano, Switzerland.
- Department of Internal Medicine and Medical Therapeutics, University of Pavia, Pavia, Italy.
- Unit of Respiratory Disease, Cardiothoracic and Vascular Department, IRCCS Polyclinic San Matteo, Pavia, Italy.
Abstract
<i>Objective.</i>Lung cancer remains the leading cause of cancer-related mortality worldwide, with Non-Small Cell Lung Cancer (NSCLC) accounting for approximately 85% of all cases. Programmed cell Death Ligand-1 (PD-L1) is a well-established biomarker that guides immunotherapy in advanced-stage NSCLC, currently evaluated via invasive biopsy procedures. This study aims to develop and validate a non-invasive pipeline for stratifying PD-L1 expression using quantitative analysis of IVIM parameter maps-diffusion (D), pseudo-diffusion (D*), perfusion fraction (pf)-and T1-VIBE MRI acquisitions.<i>Approach.</i>MRI data from 43 NSCLC patients were analysed and labelled as PD-L1 positive (≥1%) or negative (<1%) based on immunohistochemistry exam. After pre-processing, 1,171 radiomic features and 512 deep learning features were obtained. Three feature sets (radiomic, deep learning, and fusion) were tested with Logistic Regression, Random Forest, and XGBoost. Four discriminative features were selected using the Mann-Whitney U-test, and model performance was primarily assessed using the area under the receiver operating characteristic curve (AUC). Robustness was ensured through repeated stratified 5-fold cross-validation, bootstrap-derived confidence intervals, and permutation test.<i>Main Results.</i>Logistic Regression generally demonstrated the highest classification performance, with AUC values ranging from 0.78 to 0.92 across all feature sets. Fusion models outperformed or matched the performance of the best standalone radiomics or deep learning model. Among multisequence MRI, the IVIM-D fusion features yielded the best performance with an AUC of 0.92, followed by IVIM-D* radiomic features that showed a similar AUC of 0.91. For IVIM-pf and T1-VIBE derived features, the fusion model yielded the best AUC values of 0.87 and 0.90, respectively.<i>Significance.</i>The obtained results highlight the potential of a combined radiomic-deep learning approach to effectively detect PD-L1 expression from MRI acquisitions, paving the way for a non-invasive PD-L1 evaluation procedure.