Optimized deep learning for brain tumor detection: a hybrid approach with attention mechanisms and clinical explainability.
Authors
Affiliations (5)
Affiliations (5)
- School of Computer Science Engineering and Applications, D Y Patil International University (DYPIU), Akrudi, Pune, 411044, Maharashtra, India.
- School of Computer Science Engineering and Applications, D Y Patil International University (DYPIU), Akrudi, Pune, 411044, Maharashtra, India. [email protected].
- Department of Applied Sciences, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Lavale, Pune, 412115, Maharashtra, India. [email protected].
- Symbiosis Centre for Applied AI, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Lavale, Pune, 412115, Maharashtra, India.
- School of Engineering, Swinburne University of Technology, Hawthorn, Australia.
Abstract
Brain tumor classification (BTC) from Magnetic Resonance Imaging (MRI) is a critical diagnosis task, which is highly important for treatment planning. In this study, we propose a hybrid deep learning (DL) model that integrates VGG16, an attention mechanism, and optimized hyperparameters to classify brain tumors into different categories as glioma, meningioma, pituitary tumor, and no tumor. The approach leverages state-of-the-art preprocessing techniques, transfer learning, and Gradient-weighted Class Activation Mapping (Grad-CAM) visualization on a dataset of 7023 MRI images to enhance both performance and interpretability. The proposed model achieves 99% test accuracy and impressive precision and recall figures and outperforms traditional approaches like Support Vector Machines (SVM) with Histogram of Oriented Gradients (HOG), Local Binary Pattern (LBP) and Principal Component Analysis (PCA) by a significant margin. Moreover, the model eliminates the need for manual labelling-a common challenge in this domain-by employing end-to-end learning, which allows the proposed model to derive meaningful features hence reducing human input. The integration of attention mechanisms further promote feature selection, in turn improving classification accuracy, while Grad-CAM visualizations show which regions of the image had the greatest impact on classification decisions, leading to increased transparency in clinical settings. Overall, the synergy of superior prediction, automatic feature extraction, and improved predictability confirms the model as an important application to neural networks approaches for brain tumor classification with valuable potential for enhancing medical imaging (MI) and clinical decision-making.