Back to all papers

Evaluating AI diagnostic accuracy in approximal dental caries detection on bitewing radiographs.

April 29, 2026pubmed logopapers

Authors

Giannakopoulos K,Kavadella A,Paraskevis D,Arhakis A,Makrygiannakis MA,Kaklamanos EG

Affiliations (6)

  • School of Dentistry, European University Cyprus, 6 Diogenous str, Nicosia, 2404, Cyprus. [email protected].
  • School of Dentistry, European University Cyprus, 6 Diogenous str, Nicosia, 2404, Cyprus.
  • Medical School, National and Kapodistrian University of Athens, 75 Mikras Asias str, Athens, 11527, Greece.
  • School of Dentistry, Aristotle University of Thessaloniki, Aristotle University of Thessaloniki Campus, Thessaloniki, 54124, Greece.
  • School of Dentistry, National and Kapodistrian University of Athens, 2 Thivon str, Athens, 11527, Greece.
  • Hamdan Bin Mohammed College of Dental Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, P.O. Box: 505055, Dubai Healthcare City, Dubai, United Arab Emirates.

Abstract

To evaluate the diagnostic accuracy of the Diagnocat™ artificial intelligence (AI) system for caries detection on bitewing radiographs compared with expert human examiners, with emphasis on differences between enamel and dentin lesions. A sample of 100 digital bitewing radiographs (1540 surfaces) was retrospectively selected from the European University Cyprus dental clinic database using a systematic backward screening method. Radiographs were obtained with a standardized phosphor plate system and anonymized before analysis. Two independent experts (operative dentistry and oral radiology) established the reference standard. AI and human assessments were binarized (caries/no caries; enamel/dentin) and compared. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy were calculated, and statistical significance was tested across detection categories. Diagnocat™ showed high specificity (94.3%, 95% CI: 92.4%–96.0%) and NPV (96.1%, 95% CI: 94.7%–97.3%), with an overall accuracy of 91.6% (95% CI: 89.7%–93.4%). Sensitivity was moderate (73.1%, 95% CI: 65.9%–79.9%), and PPV was 64.7% (95% CI: 57.7%–71.5%). Agreement with the expert consensus was substantial (Cohen’s κ = 0.638). For enamel lesions, sensitivity and specificity were 73.3% (95% CI: 62.8%–82.7%) and 92.9% (95% CI: 91.0%–94.7%) with moderate agreement with the consensus (Cohen’s κ = 0.492) and for dentin lesions they were 72.8% (95% CI: 61.8%–83.8%) and 92.8% (95% CI 90.9%–94.6%) with moderate agreement with the consensus (Cohen’s κ = 0.468). NPV remained high (≥ 98.0%), while PPV was low (42.0% and 39.2%), across lesion types. Detection patterns differed significantly between AI and the reference standard (<i>p</i> < 0.001). Diagnocat™ demonstrated good diagnostic performance in ruling out caries. However, its overall lower sensitivity emphasizes the need for clinician oversight, especially in detecting early-stage disease. This study offers an independent validation of Diagnocat™ using bitewing radiographs. It demonstrates lesion-depth–specific insights, showing that while AI is highly reliable for excluding disease, its predictive value remains limited. The online version contains supplementary material available at 10.1007/s00784-026-06882-z.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.