Back to all papers

CT-Based Radiomics and Deep Learning for Preoperative Thyroid Nodule Classification: A Systematic Review, Meta-analysis, and Radiologist Comparison.

Authors

Broomand Lomer N,Ahmadzadeh AM,Ashoobi MA,Abdi S,Ghasemi A,Gholamrezanezhad A

Affiliations (6)

  • Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104 (N.B.L.).
  • Department of Radiology, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran (A.M.A.).
  • Guilan Road Trauma Research Center, Trauma Institute, Guilan University of Medical Sciences, Rasht, Iran (M.A.A.).
  • Department of Radiology, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA 02215 (S.A.).
  • Faculty of Medicine, Guilan University of Medical Sciences, Rasht, Iran (A.G.).
  • Department of Radiology, Los Angeles General Hospital, Los Angeles, California (A.G.). Electronic address: [email protected].

Abstract

Computed tomography (CT) can evaluate thyroid cancer invasion into adjacent structures and is useful in identifying incidental thyroid nodules. Computer-aided diagnostic approaches may provide valuable clinical advantages in this domain. Here, we aim to evaluate the diagnostic performance of radiomics and deep-learning methods using CT imaging for preoperative nodule classification. A comprehensive search of PubMed, Embase, Scopus, and Web of Science was conducted from inception to June 2, 2025. Study quality was assessed using QUADAS-2 and METRICS. A bivariate meta-analysis estimated pooled sensitivity, specificity, positive and negative likelihood ratios (PLR and NLR), diagnostic odds ratio (DOR), and area under the curve (AUC). Two supplementary analyses compared AI model performance with radiologists and assessed diagnostic utility across CT imaging phases (plain, venous, arterial). Subgroup and sensitivity analyses explored sources of heterogeneity. Publication bias was evaluated using Deek's funnel plot. The meta-analysis included 12 radiomics studies (sensitivity: 0.85, specificity: 0.83, PLR: 4.60, NLR: 0.19, DOR: 30.29, AUC: 0.894) and five deep-learning studies (sensitivity: 0.87, specificity: 0.93, PLR: 14.04, NLR: 0.15, DOR: 95.76, AUC: 0.911). Radiomics models showed low heterogeneity, while deep-learning models showed substantial heterogeneity, potentially due to differences in validation, segmentation, METRICS quality, and reference standards. Overall, these models outperformed radiologists, and models using plain CT images outperformed those based on arterial or venous phases. Radiomics and deep-learning models have demonstrated promising performance in classifying thyroid nodules and may improve radiologists' accuracy in indeterminate cases, while reducing unnecessary biopsies.

Topics

Journal ArticleReview

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.