Back to all papers

CAHA-Net: A novel MR image classification model based on DenseNet incorporating coordinate attention and hybrid augmentation.

May 28, 2026pubmed logopapers

Authors

Qi H,Guo R,He J,Wang X,Han Y,Jiang X,Liu C

Affiliations (8)

  • School of Information Engineering, Sanming University, Sanming, 365004, China.
  • School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, 430073, China.
  • Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK.
  • School of Mathematics, Sun Yat-sen University, Guangzhou, 510275, China.
  • School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, 430073, China. [email protected].
  • School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, 430073, China. [email protected].
  • Fujian Key Lab of Agriculture IOT Application, Sanming University, Sanming, 365004, China.
  • Fujian Provincial Universities Key Laboratory of Industrial Big Data Analysis and Application, Sanming University, Sanming, 365004, China.

Abstract

Traditional diagnostic approaches are time-consuming and labor-intensive, and the field currently lacks a comprehensive evaluation of mainstream models that addresses their complementary strengths. Computer-aided diagnosis methods can significantly increase the accuracy and efficiency of MRI-based image classification, thereby providing useful support for brain tumor computer-aided diagnosis research. To address these challenges, this study utilizes a brain tumor MRI dataset comprising 4,264 images from 333 subjects to conduct a comprehensive evaluation of various mainstream Convolutional Neural Networks and Vision Transformers under a repeated patient-level validation protocol. In addition, representative Transformer models are further evaluated under a pretrained partial fine-tuning setting. The results reveal that DenseNet-121 provides a favorable balance between diagnostic performance and computational efficiency under the no-external-pretraining setting, whereas pretrained Transformer models remain competitive when external visual pretraining is available. Building upon these findings, we propose CAHA-Net (Coordinate Attention and Hybrid Augmentation Enhanced DenseNet), an improved architecture based on DenseNet-121. This method employs a hybrid augmentation strategy combining Contrast Limited Adaptive Histogram Equalization and FMix, aiming to enhance local contrast and increase tumor-background variation during training. Furthermore, by integrating a Coordinate Attention mechanism, the model simultaneously captures spatial positional information in both the horizontal and vertical directions, promoting more lesion-sensitive feature representations during classification. The experimental results demonstrate that CAHA-Net achieves the best average performance among the tested ablation variants in multi-class classification tasks, achieving a Macro-F1 score of 94.14% ± 1.16%, an AUC of 98.97% ± 0.48%, and a false positive rate as low as 1.66% ± 0.39%. The systematic ablation results indicate that FMix is the main independent contributor to performance improvement, whereas coordinate attention helps stabilize feature representation within the complete augmentation-attention pipeline. The proposed model provides preliminary evidence of robustness and cross-dataset transferability, offering an efficient and useful reference for computer-aided diagnostic research on brain tumors.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.