Back to all papers

Large Language Models for Cholesteatoma Diagnosis: A Pathology-Validated Study.

June 2, 2026pubmed logopapers

Authors

Yenigun A,Kucuk RB,Yildiz C,Yurtsever I,Senturk E,Dogan R,Eren SB,Aksoy F,Ozturan O

Affiliations (2)

  • Department of Otorhinolaryngology, Faculty of Medicine, Bezmialem Vakif University.
  • Department of Radiology, Faculty of Medicine, Bezmialem Vakif University, Istanbul, Turkey.

Abstract

To evaluate the diagnostic performance of the large language model (LLM) Gemini 2.5 for cholesteatoma detection using histopathology as the reference standard, and to compare its performance with that of routine radiologic assessment. A secondary objective was to assess the model's consistency across clinically relevant subgroups and its potential utility as a complementary, image-centered decision-support tool for otologic practice. In this retrospective diagnostic accuracy study, 244 patients who underwent temporal bone MRI, including non-echo-planar diffusion-weighted imaging, between 2017 and 2025 were analyzed. MRI data sets were converted into multislice video files and evaluated by an LLM without fine-tuning or task-specific training. Diagnostic performance metrics were calculated using histopathology as the gold standard and compared using the McNemar test. Subgroup analyses were performed for primary versus recurrent cholesteatoma and according to mastoidectomy history. Histopathology confirmed cholesteatoma in 121 patients. Overall diagnostic accuracy was 79.1% for the LLM and 84.0% for radiology. Sensitivity did not differ significantly between the 2 approaches, whereas specificity was higher for radiologic assessment. Diagnostic performance remained stable across primary and recurrent disease and in postoperative anatomy. The model also identified an incidental intracranial cystic lesion outside the temporal bone. The large language model (LLM) demonstrated diagnostic performance comparable to routine radiologic interpretation and provided rapid, image-centered analysis without the need for training or specialized infrastructure. These findings suggest that LLM-based systems may serve as practical complementary tools in cholesteatoma evaluation, particularly as an adjunct to radiologic assessment and preoperative decision-making in otologic practice. Level III.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.