Role of large language models for etiological classification of brain stroke based on MRI brain reports: a feasibility study.
Authors
Affiliations (4)
Affiliations (4)
- Neurorradiología Diagnostica e Intervencionista, HT Médica Ávila, Spain. Electronic address: [email protected].
- NLP department, HT medica, Jaén, Spain. Electronic address: [email protected].
- MRI unit, Radiology department, HT medica, Jaén, Spain. Electronic address: [email protected].
- MRI unit, Radiology department, HT medica, Jaén, Spain. Electronic address: [email protected].
Abstract
Ischemic stroke, a leading cause of global disability and mortality, demands precise etiological classification for effective management. The variability in the use of existing stroke classification systems, along with the challenges in manual etiological labeling from brain MRI radiological reports, calls for an innovative approach. This study aims to develop and evaluate a Natural Language Processing (NLP) algorithm using transformer-based models for the extraction and classification of ischemic stroke types from MRI reports, enhancing diagnostic efficiency and stroke management. We built a dataset comprising 635 brain MRI reports, annotated for four distinct ischemic stroke types. All were clinically consistent with focal neurologic impairment due to stroke. The study involved evaluating two pre-trained models BERT (Bert clinical and Beto) and two models RoBERTa (Roberta clinical trials and Roberta biomedical), focusing on their ability to accurately classify stroke subtypes. The Roberta biomedical model emerged as the most effective, demonstrating superior performance with an accuracy of 76.7 % with statistically significant results. This model also achieved the highest precision, recall, and F1 scores across all stroke types, indicating its robustness in stroke subtype classification. The study highlights the potential of NLP algorithms in automating stroke classification from MRI reports, which could significantly aid in diagnostic processes and streamline stroke management strategies.