Back to all papers

Interpretable machine learning for preoperative prediction of ground-glass nodules invasiveness in lung adenocarcinoma: a multicenter study.

June 24, 2026pubmed logopapers

Authors

Wu Z,Li L,Shi D,Xiao Y

Affiliations (4)

  • Department of Radiology, Shanghang County Hospital, 364200, Shanghang, China.
  • Department of Radiology, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310009, China.
  • Department of Radiology, The People's Hospital of PingYang, Pingyang, 325401, China.
  • Department of Radiology, The People's Hospital of PingYang, Pingyang, 325401, China. [email protected].

Abstract

Accurate, noninvasive prediction of invasiveness in ground-glass nodules (GGNs) is important for surgical planning in lung adenocarcinoma. This multicenter study developed and validated an interpretable machine learning (ML) model based on quantitative and qualitative computed tomography (CT) features. We retrospectively enrolled 860 patients with 1009 GGNs from three centers. Center 1 cases were split into training (n = 609) and internal validation (n = 261) cohorts, while Centers 2 and 3 formed an external validation cohort (n = 139). Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression, and eight ML models were trained with cross-validation. Additional analyses compared the Random Forest (RF) model with consolidation-to-tumor ratio (CTR), assessed pure GGNs (pGGNs), removed visually evident invasive signs, and evaluated overfitting using out-of-bag and bootstrap optimism-corrected estimates. The RF model achieved apparent training, internal validation, and external validation AUCs of 0.955, 0.874, and 0.873, respectively; the RF out-of-bag AUC was 0.835. CTR alone showed lower discrimination than RF in internal and external validation cohorts. Adding CTR to RF did not significantly improve AUC. In pGGN-only analysis, RF achieved AUCs of 0.792 and 0.872 in internal and external validation cohorts, respectively. Removing spiculation and internal vascular signs reduced performance but retained acceptable external discrimination. The RF model showed stable validation performance and interpretable feature contributions for preoperative assessment of GGN invasiveness. Its use should be considered adjunctive, and prospective validation remains necessary before clinical implementation.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.