Back to all papers

Optimizing and Evaluating Robustness of AI for Brain Metastasis Detection and Segmentation via Loss Functions and Multi-dataset Training.

December 23, 2025pubmed logopapers

Authors

Han Y,Pathak P,Awad O,Mohamed ASR,Ugarte V,Zhou B,Hamstra DA,Echeverria AE,Mekdash HA,Siddiqui ZA,Sun B

Affiliations (1)

  • Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030-3411, UNITED STATES.

Abstract

Accurate detection and segmentation of brain metastases (BM) from MRI are critical for the appropriate management of cancer patients. This study investigates strategies to enhance the robustness of artificial intelligence (AI)-based BM detection and segmentation models.&#xD;Method: A DeepMedic-based network with a loss function, tunable with a sensitivity/specificity tradeoff weighting factor \alpha- was trained on T1 post-contrast MRI datasets from two institutions (514 patients, 4520 lesions). Robustness was evaluated on an external dataset from a third institution dataset (91 patients, 397 lesions), featuring ground truth annotations from two physicians. We investigated the impact of loss function weighting factor, \alpha and training dataset combinations. Detection performance (sensitivity, precision, F1 score) and segmentation accuracy (Dice similarity, and 95% Hausdorff distance (HD95)) were evaluated using one physician's contours as the reference standard. The optimal AI model was then directly compared to the performance of the second physician.&#xD;Results: Varying α demonstrated a trade-off between sensitivity (higher α) and precision (lower α), with α=0.5 yielding the best F1 score (0.80 vs. 0.78 for α=0.95 and 0.72 for α=0.99) on the external dataset. The optimally trained model achieved detection performance comparable to the physician (F1: AI=0.83, Physician=0.83), but slightly underperformed in segmentation (Dice: 0.81 vs. AI=0.69; HD95: 3.0 mm vs. AI=4.94 mm, p<0.05).&#xD;Conclusion: The derived optimal model achieves detection and segmentation performance comparable to an expert physician in a parallel comparison. &#xD.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 7,500+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.