Interpretable Deep Learning for Enhanced Multi-Class Classification of Gastrointestinal Endoscopic Images.
Authors
Affiliations (6)
Affiliations (6)
- Information Technology, Atal Bihari Vajpayee Indian Institute of Information Technology and Management, Gwalior, Gwalior, Madhya Pradesh, 474015, INDIA.
- Atal Bihari Vajpayee Indian Institute of Information Technology and Management, ABV-IIITM Gwalior, Morena Link road, Gwalior, Gwalior, Gwalior, MP, 474015, INDIA.
- Information Technology, Atal Bihari Vajpayee Indian Institute of Information Technology and Management, IIITM Campus, Gwalior, Gwalior, Madhya Pradesh, 474015, INDIA.
- Atal Bihari Vajpayee Indian Institute of Information Technology and Management, IIITM Campus, Gwalior, Gwalior, Madhya Pradesh, 474015, INDIA.
- ECE, SRM University AP - Amaravati, SRM University AP, Andhra Pradesh,, Neerukonda, Mangalagiri Mandal, Guntur District, Mangalagiri, Andhra Pradesh 522240, Guntur, Andhra Pradesh, 522240, INDIA.
- Massachusetts General Hospital, USA, Boston, Massachusetts, 02114, UNITED STATES.
Abstract
Gastrointestinal (GI) endoscopy serves as a vital tool for assessing the GI tract and diagnosing related disorders. Recent progress in deep learning has shown significant improvements in identifying anomalies using sophisticated models and data augmentation strategies. This study introduces an enhanced approach to improve classification accuracy using 8,000 labeled endoscopic images from the Kvasir dataset, categorized into eight distinct classes. Leveraging EfficientNetB3 as the backbone, our proposed architecture eliminates the reliance on data augmentation while maintaining moderate model complexity. Our model achieves a test accuracy of 94.25%, alongside precision and recall of 94.29% and 94.24%, respectively. Furthermore, Local Interpretable Model-agnostic Explanation (LIME) saliency maps are employed to enhance interpretability by highlighting critical regions in the images that influence model predictions. To facilitate real-world usability, a user-friendly interface was developed using Gradio, enabling users to upload images, generate predictions, view confidence levels, and maintain a history of past results. This work underscores the importance of integrating high classification accuracy, interpretability, and accessibility in advancing medical imaging applications.