Back to all papers

Assessing the performance of deep learning and hand-crafted radiomics models using MRI and ultrasound in predicting axillary lymph node status in breast cancer: a systematic review and meta-analysis.

June 19, 2026pubmed logopapers

Authors

Kiani I,Pourakbar N,Azizpour AM,Mousavi T,Mohammadzadeh S,Pourghazi F,Khosravi A

Affiliations (4)

  • Students' Scientific Research Center, Tehran University of Medical Sciences, Tehran, Iran.
  • Student Research Committee, Tabriz University of Medical Sciences, Tabriz, Iran.
  • Center for Orthopedic Trans-Disciplinary Applied Research, Tehran University of Medical Sciences, Tehran, Iran.
  • Department of Radiology, Mayo Clinic, Rochester, MN, USA. [email protected].

Abstract

Axillary lymph node metastasis (ALNM) is a critical prognostic factor in breast cancer. While sentinel lymph node biopsy remains the gold standard, conventional imaging relies on operator expertise with variable diagnostic accuracy. This meta-analysis evaluates the diagnostic performance of deep learning (DL) and hand-crafted radiomics (HCR) models using MRI and ultrasound (US) for ALNM prediction. Literature search was conducted across four databases up to November 2024. Studies assessing DL or HCR models for ALNM prediction using MRI or US, with histopathological confirmation as the reference standard, were included. Diagnostic accuracy metrics were pooled using bivariate random-effects meta-analysis. Heterogeneity was assessed using Higgins I², and subgroup analyses explored its potential sources. Across 41 included studies, pooled sensitivity was 0.79 (95% CI: 0.74-0.84) and specificity 0.78 (95% CI: 0.75-0.81) for internal validation, with AUC 0.84. External validation demonstrated sensitivity of 0.78 and specificity of 0.74, with an AUC of 0.82. Likelihood ratio analysis (LR+ 3.0, LR- 0.33) indicated limited standalone clinical utility. Ensemble approaches combining DL and HCR showed higher diagnostic performance (AUC = 0.88 in MRI and AUC = 0.92 in US) compared to individual methods. Models incorporating both intratumoral and peritumoral regions yielded higher AUCs (0.81 vs 0.75) than intratumoral alone. AI models demonstrate moderate diagnostic accuracy but limited standalone clinical utility. These tools may serve adjunctive roles in risk stratification and treatment planning. Ensemble approaches combining DL and HCR achieve superior performance. Methodological standardization and validation across diverse populations are essential before clinical implementation. Question Can artificial intelligence models applied to US and MRI accurately predict axillary lymph-node metastasis in breast cancer? Findings Across 41 studies, AI models demonstrated good diagnostic performance for predicting nodal metastasis. DL, HCR, and combined approaches each achieved clinically meaningful accuracy, with integrated DL + HCR models showing the highest pooled diagnostic effect and lowest heterogeneity. Clinical relevance AI-enhanced imaging can assist in non-invasively stratifying nodal status and may reduce reliance on invasive procedures such as sentinel lymph-node biopsy. Consistent imaging protocols, standardized model development, and external validation are needed before clinical adoption.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.