MsM-DPM: Multiscale Mamba Diffusion Probabilistic Model for Medical Image Segmentation.
Authors
Abstract
Diffusion probabilistic models (DPMs) have recently demonstrated promising performance in medical image segmentation. However, traditional DPM has difficulty handling the irregular structure of images and the inherent similarity between lesions and surrounding tissues. To overcome these challenges, we propose an innovative architecture, the multiscale Mamba DPM (MsM-DPM), designed to enhance medical image segmentation. Specifically, MsM-DPM introduces a multiscale attention fusion module (MSAFM) in a multiscale denoising UNet (Ms-DU) to capture lesion deformations from multilevel features, thereby enhancing the model's robustness to shape and scale variations. Furthermore, in the segmentation network, a multilayer axial feature module (MLAFM) is used to adaptively aggregate the global context features from the Mamba encoder to enhance the expression of features in the spatial dimension by capturing axial multiscale features. The multilevel global context (MLGC) module is then used to reconstruct skip connections using graph convolutional network inference, and the enhanced features are assigned to each layer in the decoder to capture the contextual relationship of features. Finally, the feature fusion module (FFM) integrates deep features with upsampled features in the decoder, enhancing the network's ability to capture lesion boundary details. Our MsM-DPM effectively encodes the semantic difference between lesions and background to improve the representation of their internal features. Extensive experiments on six datasets, LUNA16, ATM22, COVID-19, Self-collected datasets, Pancreas, and BT-MSD, show that the proposed MsM-DPM outperforms existing segmentation methods. Our code is publicly available at https://github.com/suhuaqiang/deep-learning.