Frequency disentanglement with State space gating network for medical image segmentation.
Authors
Affiliations (6)
Affiliations (6)
- Center for Medical Artificial Intelligence, Shandong University of Traditional Chinese Medicine, Qingdao Traditional Chinese Medicine Inheritance and Innovation Base, East Side of Fenghe Road, Qingdao High-tech Industrial Development Zone, Qingdao, 266112, Shandong, China.
- Qingdao Academy of Chinese Medical Sciences, Shandong University of Traditional Chinese Medicine, East Side of Fenghe Road, Qingdao High-tech Industrial Development Zone, Qingdao, 266112, Shandong, China.
- Center for Medical Artificial Intelligence, Shandong University of Traditional Chinese Medicine, Qingdao Traditional Chinese Medicine Inheritance and Innovation Base, East Side of Fenghe Road, Qingdao High-tech Industrial Development Zone, Qingdao, 266112, Shandong, China. [email protected].
- Qingdao Academy of Chinese Medical Sciences, Shandong University of Traditional Chinese Medicine, East Side of Fenghe Road, Qingdao High-tech Industrial Development Zone, Qingdao, 266112, Shandong, China. [email protected].
- Center for Medical Artificial Intelligence, Shandong University of Traditional Chinese Medicine, Qingdao Traditional Chinese Medicine Inheritance and Innovation Base, East Side of Fenghe Road, Qingdao High-tech Industrial Development Zone, Qingdao, 266112, Shandong, China. [email protected].
- Qingdao Academy of Chinese Medical Sciences, Shandong University of Traditional Chinese Medicine, East Side of Fenghe Road, Qingdao High-tech Industrial Development Zone, Qingdao, 266112, Shandong, China. [email protected].
Abstract
Precise automated segmentation of anatomical structures is a prerequisite for computer-aided diagnosis, radiotherapy planning, and quantitative medical analysis. However, existing models, whether based on convolutional neural networks (CNN) or transformer architectures, are primarily centered on the extraction and processing of spatial features. These approaches lead to spectral feature entanglement, where low-frequency global structures, mid-frequency contours, and high-frequency textures are indiscriminately mixed, degrading segmentation accuracy, particularly at object boundaries critical for clinical delineation. To address this, we introduce the FD-SSGNet, a framework that performs frequency disentanglement with State-Space gating. Our model first employs the Fast Fourier Transform (FFT) to explicitly decompose feature maps into low-, mid-, and high-frequency components. It then leverages the Shift Bidirectional Selective Gate Mamba (SBSGM), with parallel, heterogeneously configured pathways to effectively model long-range dependencies specific to each frequency band. Finally, a dynamic fusion module adaptively reintegrates the processed multi-band features to produce a refined segmentation map. Extensive experiments on the challenging BTCV multi-organ and ACDC cardiac segmentation datasets demonstrate that FD-SSGNet achieves new state-of-the-art performance, validating the significant benefits of explicit frequency domain modeling for robust and accurate medical image analysis in clinical workflows. Our implementation is available at https://github.com/singinghz/FD-SSGNet .