SpineMamba: Enhancing 3D spinal segmentation in clinical imaging through residual visual Mamba layers and shape priors.

Authors

Zhang Z,Liu T,Fan G,Li N,Li B,Pu Y,Feng Q,Zhou S

Affiliations (8)

  • Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China; University of Chinese Academy of Sciences, Beijing, 100049, China.
  • Institute of Unconventional Oil & Gas Research, Northeast Petroleum University, Street 15, Daqing, 163318, China.
  • College Of Information Science and Engineering, Northeastern University, Liaoning, 110819, China.
  • Department of Biomedical Engineering, Guangdong Medical University, Dongguan, 523808, China.
  • Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
  • Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong, China.
  • School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China. Electronic address: [email protected].
  • Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China. Electronic address: [email protected].

Abstract

Accurate segmentation of three-dimensional (3D) clinical medical images is critical for the diagnosis and treatment of spinal diseases. However, the complexity of spinal anatomy and the inherent uncertainties of current imaging technologies pose significant challenges for the semantic segmentation of spinal images. Although convolutional neural networks (CNNs) and Transformer-based models have achieved remarkable progress in spinal segmentation, their limitations in modeling long-range dependencies hinder further improvements in segmentation accuracy. To address these challenges, we propose a novel framework, SpineMamba, which incorporates a residual visual Mamba layer capable of effectively capturing and modeling the deep semantic features and long-range spatial dependencies in 3D spinal data. To further enhance the structural semantic understanding of the vertebrae, we also propose a novel spinal shape prior module that captures specific anatomical information about the spine from medical images, significantly enhancing the model's ability to extract structural semantic information of the vertebrae. Extensive comparative and ablation experiments across three datasets demonstrate that SpineMamba outperforms existing state-of-the-art models. On two computed tomography (CT) datasets, the average Dice similarity coefficients achieved are 94.40±4% and 88.28±3%, respectively, while on a magnetic resonance (MR) dataset, the model achieves a Dice score of 86.95±10%. Notably, SpineMamba surpasses the widely recognized nnU-Net in segmentation accuracy, with a maximum improvement of 3.63 percentage points. These results highlight the precision, robustness, and exceptional generalization capability of SpineMamba.

Topics

Imaging, Three-DimensionalSpineNeural Networks, ComputerJournal Article

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.