Diagnostic accuracy of artificial intelligence-based deep learning models in detecting furcation involvement: A systematic review and meta-analysis.
Authors
Affiliations (5)
Affiliations (5)
- Mohammed Bin Rashid University of Medicine and Health Sciences, Hamdan Bin Mohammed College of Dental Medicine, Dubai Healthcare City, Dubai, UAE.
- Sir John Walsh Research Institute, Faculty of Dentistry, University of Otago, Dunedin, New Zealand.
- School of Dentistry, University of Jordan, Amman, Jordan.
- College of Dentistry, The Ohio State University, Columbus, Ohio, USA.
- Ajman University, College of Dentistry, Department of Restorative Dentistry, Ajman, UAE.
Abstract
Furcation involvement complicates the management of periodontitis and increases the risk of tooth loss. Conventional methods of detection, such as probing and two-dimensional radiographs, are limited by operator variability and anatomical complexity. Deep learning has shown a potential to detect furcation involvement on radiographic images. The aim of this review was to systematically evaluate the diagnostic potentials of deep learning models in detecting furcation involvement on radiographic images. Systematic search was conducted in PubMed, EMBASE, CENTRAL, ClinicalTrials.gov and ProQuest for studies published from 2010 to September 2025. Two reviewers independently screened studies, extracted data, and assessed quality using QUADAS-2. Diagnostic metrics (sensitivity, specificity, F1-score, area under the curve (AUC)) were pooled using random-effects meta-analysis. Heterogeneity and publication bias were assessed via I<sup>2</sup> statistics, meta-regression, and funnel plots. Eight studies, including 7814 radiographs of 12,373 molars (periapical, panoramic, cone-beam computed tomography), were analyzed. Deep learning models demonstrated high accuracy: sensitivity 0.93, specificity 0.94, diagnostic odds ratio (DOR) 187, AUC 0.97 with mandibular molars reflecting higher accuracy (sensitivity 0.96, specificity 0.97, DOR 631, AUC 0.99). Fagan plot analysis indicated strong clinical utility. Meta-regression showed no significant effect of dataset type, augmentation, or number of annotators. No publication bias was detected. Deep learning models show promising accuracy in detecting furcation involvement, particularly in mandibular molars, comparable to expert clinicians. Further refinement with larger, diverse datasets is needed to reduce false positives and enable safe clinical integration. Furcation involvement, a condition where the bone between the roots of a tooth is lost, makes managing gum disease more difficult and increases the risk of tooth loss. Clinical probing remains a reliable and essential method for detecting this condition, while dental X-rays can provide complementary information, particularly in complex cases. This study reviewed the use of deep learning, a type of artificial intelligence (AI), to detect furcation involvement on dental X-rays. Data from eight studies, including more than 7,800 dental radiographs of >12,000 molars, were analyzed and the results showed that deep learning models were highly accurate, performing similarly to expert dentists. Accuracy was slightly higher for lower jaw (mandibular) molars. The study suggests that these AI tools could help dentists detect furcation involvement more reliably. However, more research with larger and more diverse datasets is needed before these tools can be safely used in everyday dental practice.