Back to all papers

MTA-Swin: A Multi-Token Attention Swin Transformer for Brain Tumor Classification with Leakage-Free MRI Benchmarking.

June 22, 2026pubmed logopapers

Authors

Lu D,Zhang Y,Chaudhary D

Affiliations (2)

  • College of Engineering, Northeastern University, 401 Terry Ave N, Seattle, WA, 98109, United States.
  • Khoury College of Computer Sciences, Northeastern University, 401 Terry Ave N, Seattle, WA, 98109, United States. [email protected].

Abstract

Brain tumors represent a major global health challenge, and accurate classification of brain tumors is essential for effective diagnosis and treatment. Magnetic resonance imaging (MRI) is the most commonly used and reliable modality in early brain tumor detection, and numerous studies have leveraged MRI datasets to train deep learning models for classification. However, many widely adopted brain tumor MRI datasets suffer from duplicate-induced data leakage, which can lead to artificially inflated performance metrics and unreliable model evaluation. In this study, we systematically analyze this issue and develop an automated data cleaning pipeline capable of identifying and removing duplicate scans. By applying this pipeline to a widely used public dataset, we obtain a leakage-free benchmark dataset containing 3,522 unique MRI scans, which is used for all comparative experiments. Additionally, we propose MTA-Swin, an enhanced Swin Transformer that incorporates Multi-Token Attention by re-designing the attention computation within Swin blocks. The design refines attention logits to enrich local context and enables explicit cross-head information exchange in deeper layers, while preserving Swin's hierarchical stages and windowing scheme. MTA-Swin is first pre-trained on ImageNet-1K, and then fine-tuned on our leakage-free dataset. Experimental results show that MTA-Swin reaches an overall accuracy of 98.57% over three random seeds, outperforming thirteen representative baselines. Additional stratified cross-validation and Grad-CAM analyses further support the robustness and interpretability of the proposed model. These results indicate that MTA-Swin can serve as a practical computer-aided diagnostic support model for brain tumor MRI classification.

Topics

Brain NeoplasmsMagnetic Resonance ImagingDeep LearningJournal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.