Back to all papers

Diagnostic performance of artificial intelligence for identification of cervical spine fractures: a systematic review and meta-analysis.

May 2, 2026pubmed logopapers

Authors

Gholamrezanezhad A,Farrokhi M,Almasri S,Askari H,Askari A,Mirjafarifiroozabadi SA,Nochian MA,Kosaraju E

Affiliations (8)

  • Department of Radiology, Keck School of Medicine, University of Southern California (USC), Los Angeles, CA, 90089, USA.
  • Department of Epidemiology, School of Public Health and Safety, Shahid Beheshti University of Medical Sciences, Tehran, Iran. [email protected].
  • School of Medicine, Royal College of Surgeons in Ireland, University of Medicine and Health Science, Dublin, Ireland.
  • Paul L. Foster School of Medicine, Texas Tech University Health Sciences Center El Paso, El Paso, TX, 79905, USA.
  • McGovern Medical School, The University of Texas Health Science Center at Houston (UTHealth Houston), Houston, TX, 77030, USA.
  • College of Osteopathic Medicine of the Pacific, Western University of Health Sciences, Pomona, CA, USA.
  • Student Research Committee, Faculty of Nursing and Midwifery, Semnan University of Medical Sciences, Semnan, Iran.
  • Solon High School, Solon, OH, USA.

Abstract

This meta-analysis aimed to evaluate how effectively artificial intelligence (AI) models can diagnose cervical spine fractures. A systematic search from inception to July 2025 in databases of Medline, Scopus, and Web of Science was performed to retrieve studies applying AI models for diagnosis of cervical spine fractures. Deduplication, study screening, and full-text evaluations were performed using EndNote by two independent authors. Statistical analyses were conducted using MIDAS and Metaprop packages in Stata and Meta-Disc software. Sixteen studies reporting 69 AI models were analyzed. The estimated sensitivity was 0.93 (95% CI, 0.92-0.94) and specificity was found to be 0.97 (95% CI, 0.97-0.97). The combined positive and negative likelihood ratios were 26.34 (95% CI, 15.93-43.56) and 0.07 (95% CI, 0.03-0.16), respectively. The overall accuracy was 0.96 (95% CI, 0.95-0.98), with summary receiver operating characteristic curve (AUROC) suggesting of 0.99 area under curve. Our meta-analysis showed that AI models demonstrate high diagnostic accuracy (0.96) for the early identification of cervical spine fractures. While our analysis suggests that AI models may achieve similar diagnostic performance using either CT or radiography, this finding should be interpreted with caution due to the limited number of X-ray studies and the high degree of heterogeneity. Given the scarcity of independent peer-reviewed validation studies and the variability in performance across AI models and study settings, the generalizability of these findings across different patient populations and clinical environments remains uncertain. Further research is required to confirm these results and account for study design variations.

Topics

Journal ArticleReview

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.