Residual self-attention vision transformer for detecting acquired vitelliform lesions and age-related macular drusen.

Authors

Powroznik P,Skublewska-Paszkowska M,Nowomiejska K,Gajda-Deryło B,Brinkmann M,Concilio M,Toro MD,Rejdak R

Affiliations (8)

  • Department of Computer Science, Lublin University of Technology, Nadbystrzycka 36B, 20-618, Lublin, Poland. [email protected].
  • Department of Computer Science, Lublin University of Technology, Nadbystrzycka 36B, 20-618, Lublin, Poland.
  • Chair and Department of General and Pediatric Ophthalmology, Medical University of Lublin, Chmielna 1, 20-400, Lublin, Poland.
  • Department of Ophthalmology, Klinikum Klagenfurt, 9020, Klagenfurt, Austria.
  • Department of Ophthalmology, Universitaetsklinikum Schleswig-Holstein, 23564, Luebeck, Germany.
  • Department of Medicine and Health Sciences "V.Tiberio", University of Molise, 86100, Campobasso, Italy.
  • Eye Clinic, Department of Public Health, Federico II University, Naples, Italy.
  • Department of Special Surgery and Ophthalmology, University of Jordan, Amman, Jordan.

Abstract

Retinal diseases recognition is still a challenging task. Many deep learning classification methods and their modifications have been developed for medical imaging. Recently, Vision Transformers (ViT) have been applied for classification of retinal diseases with great success. Therefore, in this study a novel method was proposed, the Residual Self-Attention Vision Transformer (RS-A ViT), for automatic detection of acquired vitelliform lesions (AVL), macular drusen as well as distinguishing them from healthy cases. The Residual Self-Attention module instead of Self-Attention was applied in order to improve model's performance. The new tool outperforms the classical deep learning methods, like EfficientNet, InceptionV3, ResNet50 and VGG16. The RS-A ViT method also exceeds the ViT algorithm, reaching 96.62%. For the purpose of this research a new dataset was created that combines AVL data gathered from two research centers and drusen as well as normal cases from the OCT dataset. The augmentation methods were applied in order to enlarge the samples. The Grad-CAM interpretability method indicated that this model analyses the appropriate areas in optical coherence tomography images in order to detect retinal diseases. The results proved that the presented RS-A ViT model has a great potential in classification retinal disorders with high accuracy and thus may be applied as a supportive tool for ophthalmologists.

Topics

Retinal DrusenVitelliform Macular DystrophyMacular DegenerationJournal Article
Get Started

Upload your X-ray image and get interpretation.

Upload now →

Disclaimer: X-ray Interpreter's AI-generated results are for informational purposes only and not a substitute for professional medical advice. Always consult a healthcare professional for medical diagnosis and treatment.