Spec2VolCAMU-Net: a spectrogram-to-volume model for EEG-to-fMRI reconstruction based on multi-directional time-frequency convolutional attention encoder and Vision-Mamba U-Net.

October 21, 2025

papers

DOI: 10.1088/1741-2552/ae15bf PMID: 41119961

Authors

He D,Li S,Jiang B,Yan H

Affiliations (2)

Chongqing University of Technology, No. 459 Pufu Avenue, Liangjiang New District, Chongqing, Chongqing, 400050, CHINA.
College of Artificial Intelligence, Chongqing University of Technology, No. 459 Pufu Avenue, Liangjiang New District, Chongqing, Chongqing, 401135, CHINA.

Abstract

High-resolution functional magnetic resonance imaging (fMRI) is essential for mapping human brain activity; however, it remains costly and logistically challenging. If comparable volumes could be generated directly from widely available scalp electroencephalography (EEG), advanced neuroimaging would become significantly more accessible. Existing EEG-to-fMRI generators rely on plain Convolutional Neural Networks (CNNs) that fail to capture cross-channel time-frequency cues or on heavy transformer/Generative Adversarial Network (GAN) decoders that strain memory and stability. To address these limitations, we propose Spec2VolCAMU-Net, a lightweight architecture featuring a Multi-directional Time-Frequency Convolutional Attention Encoder for rich feature extraction and a Vision-Mamba U-Net decoder that uses linear-time state-space blocks for efficient long-range spatial modelling. We frame the goal of this work as establishing a new state of the art in the spatial fidelity of single-volume reconstruction, a foundational prerequisite for the ultimate aim of generating temporally coherent fMRI time series. Trained end-to-end with a hybrid SSI-MSE loss, Spec2VolCAMU-Net achieves state-of-the-art fidelity on three public benchmarks, recording Structural Similarity Index (SSIM) of 0.693 on NODDI, 0.725 on Oddball and 0.788 on CN-EPFL, representing improvements of 14.5%, 14.9%, and 16.9% respectively over previous best SSIM scores. Furthermore, it achieves competitive Signal-to-Noise Ratio (PSNR) scores, particularly excelling on the CN-EPFL dataset with a 4.6% improvement over the previous best PSNR, thus striking a better balance in reconstruction quality. The proposed model is lightweight and efficient, making it suitable for real-time applications in clinical and research settings. The code is available at https://github.com/hdy6438/Spec2VolCAMU-Net.

View Source Full Text PDF

Topics

Journal Article

Spec2VolCAMU-Net: a spectrogram-to-volume model for EEG-to-fMRI reconstruction based on multi-directional time-frequency convolutional attention encoder and Vision-Mamba U-Net.

Authors

Affiliations (2)

Abstract

Tags

Topics

Ready to Sharpen Your Edge?