Back to all papers

Large language model and Gd-EOB-DTPA-enhanced MRI-based risk stratification system for postoperative hepatocellular carcinoma: a multicenter study.

February 23, 2026pubmed logopapers

Authors

Yu C,Zhang Q,Ding JX,Li W,Han S,Cong S,Wang X,Zhou Y

Affiliations (7)

  • Department of Radiology, Harbin Medical University Cancer Hospital, Harbin, China.
  • Department of Electronic Information, Harbin Engineering University Qingdao Innovation and Development Base, Qingdao, China.
  • Department of Radiology, Shandong Cancer Hospital and Institute, Jinan, China.
  • Department of Radiology, The Affiliated Cancer Hospital of Zhengzhou University & Henan Cancer Hospital, Zhengzhou, China.
  • Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore, Singapore. [email protected].
  • Department of Radiology, Harbin Medical University Cancer Hospital, Harbin, China. [email protected].
  • Department of Radiology, Harbin Medical University Cancer Hospital, Harbin, China. [email protected].

Abstract

To develop and validate a Fully Automated Stratification System (FASS) integrating serum biomarkers, automated radiomic features, and large language model (LLM)-derived semantic features for prognostic prediction in patients with solitary hepatocellular carcinoma (HCC) after hepatic resection. A total of 448 patients with solitary HCC from three centers were retrospectively enrolled. Automated tumor segmentation was performed using a modified MedNeXt-loss framework, and radiomic features were extracted from Gadolinium ethoxybenzyl-diethylenetriamine pentaacetic acid (Gd-EOB-DTPA)-enhanced MRI. Five LLMs were compared for feature-level accuracy and completeness, and the best-performing model was incorporated into the FASS. Prognostic models based on serum, radiomic, and LLM-semantic features were integrated and evaluated using concordance index, time-dependent ROC, and decision curve analyses. Biological relevance was explored through RNA sequencing and pathway enrichment analyses. The MedNeXt-loss framework achieved robust segmentation (Dice = 0.77). ChatGPT-4o demonstrated the best balance between predictive accuracy and completeness and was used for subsequent modeling. In multivariate analysis, AFP, AST, and the ChatGPT-4o-derived irregular margin were independent predictors of overall survival. The integrated FASS achieved high prognostic performance (C-index 0.78 and 0.76 in test and external validation cohorts) and effectively stratified patients into distinct risk groups (log-rank p < 0.05). Transcriptomic analyses revealed inflammatory and cytokine signaling activation in the high-risk group. FASS enables fully automated, interpretable, and biologically informed prognostic assessment in solitary HCC, supporting precision decision-making in hepatobiliary oncology. QuestionCan large language models improve preoperative hepatocellular carcinoma risk stratification by integrating advanced image interpretation and semantic analysis? FindingsThe system enabled fully automated analysis, identified AFP, AST and LLM-derived irregular margin as independent predictors, and effectively stratified postoperative risk across cohorts. Clinical relevanceThis fully automated, interpretable platform enables reliable postoperative risk stratification, helping identify high-risk patients early and potentially improving outcomes after resection of solitary hepatocellular carcinoma.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.