FAD-MIL: a weakly supervised fracture detection model based on X-ray images.
Authors
Affiliations (4)
Affiliations (4)
- The First Affiliated Hospital of Xinjiang Medical University, Urumqi, China.
- Clinical Skills Training Center, Xinjiang Medical University, Urumqi, China.
- The First Affiliated Hospital of Xinjiang Medical University, Urumqi, China. [email protected].
- Department of Minimally Invasive Spine and Precision Orthopedics, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, 830054, Xinjiang Uygur Autonomous Region, China. [email protected].
Abstract
Fractures are among the most common presentations to emergency and orthopedic services, yet radiograph interpretation remains inconsistent, particularly in resource-limited and off-hours settings. Weakly supervised learning offers a scalable alternative to costly pixel-level annotation but often fails to capture subtle, focal fracture cues. We propose FAD-MIL (Fracture-Aware Dual-stream Multiple-Instance Learning), which addresses these limitations through three design choices: (1) a global-local dual-stream architecture that captures both whole-image context and fine-grained tile features, (2) a fracture-aware gating mechanism that re-weights instance representations toward fracture-discriminative patterns, and (3) Top-K instance selection that focuses learning on the most informative regions. On FracAtlas (4,068 radiographs), FAD-MIL achieves an AUC of 0.833 (95% CI 0.797-0.871), average precision of 0.619, and F1 of 0.541, outperforming Mean-Pool MIL and Tile-Vote MIL and performing comparably to ABMIL while offering more interpretable instance-level attribution. Transferability was assessed on a retrospective single-center positive-only cohort (distal radius, nā=ā975; ankle, nā=ā350); because contemporaneous non-fracture controls were unavailable, recall across decision thresholds is reported as a preliminary sensitivity analysis. Gradient-based feature-attribution heatmaps provide a qualitative visualization of regions associated with fracture predictions. Future validation with matched non-fracture controls is required to further evaluate specificity and false-positive rates.