Back to all papers

Multicenter evaluation of three-shot SAM2 segmentation for group-level quantification of lumbar paraspinal muscles at the L4/L5 level on multi-sequence MRI.

April 27, 2026pubmed logopapers

Authors

Zhang Z,Hides J,De Martino E,Tuxworth G

Affiliations (4)

  • School of Information and Communication Technology, Griffith University, Nathan, QLD 4111, Australia. Electronic address: [email protected].
  • School of Allied Health, Sport and Social Work, Griffith University, Nathan, QLD 4111, Australia.
  • Center for Neuroplasticity and Pain (CNAP), Department of Health Science and Technology, Aalborg University, Gistrup, 9260 North Jutland, Denmark.
  • School of Information and Communication Technology, Griffith University, Nathan, QLD 4111, Australia.

Abstract

To evaluate a data-efficient segmentation approach for quantifying lumbar paraspinal muscle (LPM) at the single-slice L4/L5 level across multi-sequence MRI, minimizing annotation burden while achieving reliable group-level quantification. A total of 943 MRI scans from 486 participants were retrospectively analyzed, with a single axial slice at the L4/L5 disc level selected from each scan. The images included T1-weighted, T2-weighted, and Dixon MRI. A training-free Segment Anything Model 2 (SAM2) was applied for dataset-level LPM segmentation. For each dataset, one, two, or three ground-truth masks were used as prompt labels (4, 8, and 12 images in total across all four datasets), with the remaining images used for testing (939, 935, and 931 images, respectively). Segmentation accuracy was evaluated using the Dice similarity coefficient (DSC). Automated quantifications were assessed using two one-sided tests (TOST), intraclass correlation coefficients (ICC), and Bland-Altman analysis. With three-shot prompting, SAM2 achieved the highest segmentation accuracy for the left and right lumbar erector spinae and multifidus muscles (DSC: 0.871 ± 0.128 to 0.944 ± 0.063). Automated measurements were statistically equivalent to manual references within a 5% margin for bilateral muscle area and a 10% margin for bilateral fat ratio, except for the fat ratio of left erector spinae, which required a 14.14% margin. ICCs ranged from 0.822 to 0.856 for muscle area and 0.771-0.927 for fat ratio. L4/L5 slice-level LPM segmentation and quantification are achieved using only three labeled examples per dataset, with multicenter evaluation and publicly available code and data.

Topics

Journal ArticleReview

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.