Back to all papers

Improving confidence in MRI-based auto-segmentation via uncertainty assessment.

May 11, 2026pubmed logopapers

Authors

Kallehauge JF,Ren J,Lassen-Ramshad Y

Affiliations (3)

  • Danish Centre for Particle Therapy, Aarhus University Hospital, Aarhus, Denmark; Department of Clinical Medicine, Aarhus University, Aarhus, Denmark. [email protected].
  • Danish Centre for Particle Therapy, Aarhus University Hospital, Aarhus, Denmark; Department of Clinical Medicine, Aarhus University, Aarhus, Denmark.
  • Danish Centre for Particle Therapy, Aarhus University Hospital, Aarhus, Denmark.

Abstract

Accurate delineation of organs of interest (OOIs, also commonly referred to as organs at risk, OARs) is crucial for safe radiotherapy. While deep learning-based segmentation using convolutional neural networks has achieved high geometric accuracy, clinical translation is hindered by overconfident, uncalibrated predictions in anatomically ambiguous regions. Uncertainty quantification and model calibration are prerequisites for safe clinical workflows. This study compared the standard nnU-Netv2 against its residual-encoding variant (ResEncM), hypothesizing that ResEncM would demonstrate superior reliability and calibration while maintaining comparable geometric accuracy. Patient/material and methods: T1-weighted contrast-enhanced MRI scans from 70 brain cancer patients were used (55 training/validation, 15 testing). Ground-truth contours for brainstem, hippocampi, chiasm, optic nerves, optic tracts, and pituitary were delineated per Danish Neuro Oncology Group guidelines. Both architectures were trained using five-fold cross-validation with identical preprocessing. Epistemic uncertainty was quantified using mutual information, and Expected Calibration Error (ECE) was computed within a 10-mm isotropic margin around reference contours. Both models achieved high geometric accuracy (brainstem dice similarity coefficient [DSC] > 0.93, hippocampi DSC > 0.81). No significant geometric differences were found for large structures. ResEncM showed significantly lower DSC for the pituitary (p = 0.003) and chiasm (p = 0.018). However, ResEncM demonstrated significantly lower epistemic uncertainty and ensemble variance across all structures (p < 0.05), and significantly reduced ECE for the optic chiasm, optic tracts, and pituitary. Integrating a deep residual encoder into the standard U-Net framework significantly improves reliability and calibration of automated brain OOI contours while maintaining strong geometric performance. The ResEncM architecture provides a more trustworthy tool for clinical radiotherapy by reliably flagging high-uncertainty voxels, supporting confidence-aware clinical workflows.

Topics

Magnetic Resonance ImagingBrain NeoplasmsOrgans at RiskRadiotherapy Planning, Computer-AssistedImage Processing, Computer-AssistedJournal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.