Artificial intelligence-guided distal radius fracture detection on plain radiographs in comparison with human raters.

Authors

Ramadanov N,John P,Hable R,Schreyer AG,Shabo S,Prill R,Salzmann M

Affiliations (6)

  • Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg/Havel, 14770, Brandenburg/Havel, Germany. [email protected].
  • Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg/Havel, Germany. [email protected].
  • Center of Orthopaedics and Traumatology, Brandenburg Medical School, University Hospital Brandenburg/Havel, 14770, Brandenburg/Havel, Germany.
  • Faculty of Health Science Brandenburg, Brandenburg Medical School Theodor Fontane, Brandenburg/Havel, Germany.
  • Faculty of Applied Computer Science, Deggendorf Institute of Technology, Deggendorf, Germany.
  • Department of Diagnostic and Interventional Radiology, University Hospital Brandenburg, Brandenburg Medical School Theodor Fontane, 14770, Brandenburg, Germany.

Abstract

The aim of this study was to compare the performance of artificial intelligence (AI) in detecting distal radius fractures (DRFs) on plain radiographs with the performance of human raters. We retrospectively analysed all wrist radiographs taken in our hospital since the introduction of AI-guided fracture detection from 11 September 2023 to 10 September 2024. The ground truth was defined by the radiological report of a board-certified radiologist based solely on conventional radiographs. The following parameters were calculated: True Positives (TP), True Negatives (TN), False Positives (FP), and False Negatives (FN), accuracy (%), Cohen's Kappa coefficient, F1 score, sensitivity (%), specificity (%), Youden Index (J Statistic). In total 1145 plain radiographs of the wrist were taken between 11 September 2023 and 10 September 2024. The mean age of the included patients was 46.6 years (± 27.3), ranging from 2 to 99 years and 59.0% were female. According to the ground truth, of the 556 anteroposterior (AP) radiographs, 225 cases (40.5%) had a DRF, and of the 589 lateral view radiographs, 240 cases (40.7%) had a DRF. The AI system showed the following results on AP radiographs: accuracy (%): 95.90; Cohen's Kappa: 0.913; F1 score: 0.947; sensitivity (%): 92.02; specificity (%): 98.45; Youden Index: 90.47. The orthopedic surgeon achieved a sensitivity of 91.5%, specificity of 97.8%, an overall accuracy of 95.1%, F1 score of 0.943, and Cohen's kappa of 0.901. These results were comparable to those of the AI model. AI-guided detection of DRF demonstrated diagnostic performance nearly identical to that of an experienced orthopedic surgeon across all key metrics. The marginal differences observed in sensitivity and specificity suggest that AI can reliably support clinical fracture assessment based solely on conventional radiographs.

Topics

Radius FracturesArtificial IntelligenceRadiographyJournal ArticleComparative Study
Get Started

Upload your X-ray image and get interpretation.

Upload now →

Disclaimer: X-ray Interpreter's AI-generated results are for informational purposes only and not a substitute for professional medical advice. Always consult a healthcare professional for medical diagnosis and treatment.