Back to all papers

MS-STFNN: A multi-scale spatio-temporal fusion neural network for fMRI-based depression diagnosis.

February 9, 2026pubmed logopapers

Authors

Zhou M,Wang M,Mi R,Niu Y,Cui X,Gao R,Li Y,Wen X,Xiang J

Affiliations (5)

  • School of Software, Taiyuan University of Technology, Taiyuan, 030024, China; Research Center for Medical Artificial Intelligence, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
  • College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China; College of Artificial Intelligence, Taiyuan University of Technology, Taiyuan, 030024, China.
  • College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China.
  • School of Software, Taiyuan University of Technology, Taiyuan, 030024, China. Electronic address: [email protected].
  • College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China. Electronic address: [email protected].

Abstract

Major depressive disorder (MDD) is a common mental disorder. Current clinical diagnosis primarily relies on subjective scales, underscoring an urgent need to develop objective neuroimaging diagnostic methods. Brain network analysis and deep learning techniques based on functional magnetic resonance imaging (fMRI) have been proven effective in uncovering abnormal spatial patterns associated with depression. However, the temporal dynamics of brain activity are also equally critical for understanding the neural mechanisms of MDD. Despite this importance, existing studies still suffer from insufficient characterization of spatio-temporal information. This study proposes a novel multi-scale spatio-temporal fusion neural network (MS-STFNN) that captures multi-granularity spatial features of the brain from local to global levels, while incorporating dynamic functional connectivity (DFC) and raw fMRI sequences to characterize time-varying properties at multiple resolution levels. Finally, effective classification of different depression subtypes is achieved through multi-scale feature fusion (MSFF). The results show that the proposed model outperforms baseline models across multiple subtype classification tasks. Ablation study further demonstrates the effectiveness of multi-granularity spatial feature extraction, multi-resolution temporal representation, and the spatio-temporal fusion strategy. This study provides a reliable framework for the objective diagnosis of depression, addressing the limitations of traditional subjective assessments, while also offering new insights into the efficient utilization of spatio-temporal features in brain network analysis.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.