Deep learning-driven multi-class classification of brain strokes using computed tomography: A step towards enhanced diagnostic precision.
Authors
Affiliations (4)
Affiliations (4)
- Department of Radiological Sciences, School of Human Health Sciences, Tokyo Metropolitan University, Japan.
- Department of Radiology, Faculty of Medicine, University of Peradeniya, Sri Lanka.
- Department of Statistics and Computer Science, Faculty of Science, University of Peradeniya, Sri Lanka.
- Department of Radiological Sciences, School of Human Health Sciences, Tokyo Metropolitan University, Japan. Electronic address: [email protected].
Abstract
To develop and validate deep learning models leveraging CT imaging for the prediction and classification of brain stroke conditions, with the potential to enhance accuracy and support clinical decision-making. This retrospective, bi-center study included data from 250 patients, with a dataset of 8186 CT images collected from 2017 to 2022. Two AI models were developed using the Expanded ResNet101 deep learning framework as a two-step model. Model performance was evaluated using confusion matrices, supplemented by external validation with an independent dataset. External validation was conducted by an expert and two external members. Overall accuracy, confidence intervals, Cohen's Kappa value, and McNemar's test P-values were calculated. A total of 8186 CT images were incorporated, with 6386 images used for the training and 900 datasets for testing and validation in Model 01. Further, 1619 CT images were used for training and 600 datasets for testing and validation in Model 02. The average accuracy, precision, and F1 score for both models were assessed: Model 01 achieved 99.6 %, 99.4 %, and 99.6 % respectively, whereas Model 02 achieved 99.2 %, 98.8 %, and 99.1 %. The external validation accuracies were 78.6 % (95 % CI: 0.73,0.83; P < 0.001) and 60.2 % (95 % CI: 0.48,0.70; P < 0.001) for Models 01 and 02 respectively, as evaluated by the expert. Deep learning models demonstrated high accuracy, precision, and F1 scores in predicting outcomes for brain stroke patients. With larger cohort and diverse radiologic mimics, these models could support clinicians in prognosis and decision-making.