Back to all papers

Machine learning and deep learning approaches in MRI for quantifying and staging fatty liver disease: A systematic review.

Authors

Elhaie M,Koozari A,Koohi H,Alqurain QT

Affiliations (4)

  • Department of Medical Physics, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran. Electronic address: [email protected].
  • Department of Medical Physics, School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran.
  • Department of Medical Physics, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
  • Department of Cardiovascular Magnetic Resonance Imaging, College of Applied Medical Sciences, University of Hail, Hai'l, Saudi Arabia.

Abstract

Fatty liver disease, encompassing non-alcoholic fatty liver disease (NAFLD) and alcohol-related liver disease (ALD), affects ∼25% of adults globally. Magnetic resonance imaging (MRI), particularly proton density fat fraction (PDFF), is the non-invasive gold standard for hepatic steatosis quantification, but its clinical use is limited by cost, protocol variability, a analysis time. Machine learning (ML) and deep learning (DL) techniques, including convolutional neural networks (CNNs) and generative adversarial networks (GANs), show promise in enhancing MRI-based quantification and staging. To systematically review the diagnostic accuracy, reproducibility, and clinical utility of ML and DL techniques applied to MRI for quantifying and staging hepatic steatosis in fatty liver disease. This systematic review was registered in PROSPERO (CRD420251121056) and adhered to PRISMA guidelines, searching PubMed, Cochrane Library, Scopus, IEEE Xplore, Web of Science, Google Scholar, and grey literature for studies on ML/DL applications in MRI for fatty liver disease. Eligible studies involved human participants with suspected/confirmed NAFLD, NASH, or ALD, using ML/DL (e.g., CNNs, GANs) on MRI data (e.g., PDFF, Dixon MRI). Outcomes included diagnostic accuracy (sensitivity, specificity, area under the curve (AUC)), reproducibility (intraclass correlation coefficient (ICC), Dice), and clinical utility (e.g., treatment planning). Two reviewers screened studies, extracted data, and assessed risk of bias using QUADAS-2. Narrative synthesis and meta-analysis (where feasible) were conducted. From 2550 records, 15 studies (n = 25-1038) were included, using CNNs, GANs, radiomics, and dictionary learning on PDFF, chemical shift-encoded MRI, or Dixon MRI. Diagnostic accuracy was high (AUC 0.85-0.97, r = 0.94-0.99 vs. biopsy/MRS), with reproducibility metrics robust (ICC 0.94-0.99, Dice 0.87-0.94). Efficiency improved significantly (e.g., processing <0.16 s/slice, scan time <1 min). Clinical utility included virtual biopsies, surgical planning, and treatment monitoring. Limitations included small sample sizes, single-center designs, and vendor variability. ML and DL enhance MRI-based hepatic steatosis assessment, offering high accuracy, reproducibility, and efficiency. CNNs excel in segmentation/PDFF quantification, while GANs and radiomics aid free-breathing MRI and NASH staging. Multi-center studies and standardization are needed for clinical integration.

Topics

Journal ArticleReview

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.