Automated Classification of Cervical Spinal Stenosis using Deep Learning on CT Scans.
Zhang YL, Huang JW, Li KY, Li HL, Lin XX, Ye HB, Chen YH, Tian NF
•papers•Jun 3 2025Retrospective study. To develop and validate a computed tomography-based deep learning(DL) model for diagnosing cervical spinal stenosis(CSS). Although magnetic resonance imaging (MRI) is widely used for diagnosing CSS, its inherent limitations, including prolonged scanning time, limited availability in resource-constrained settings, and contraindications for patients with metallic implants, make computed tomography (CT) a critical alternative in specific clinical scenarios. The development of CT-based DL models for CSS detection holds promise in transcending the diagnostic efficacy limitations of conventional CT imaging, thereby serving as an intelligent auxiliary tool to optimize healthcare resource allocation. Paired CT/MRI images were collected. CT images were divided into training, validation, and test sets in an 8:1:1 ratio. The two-stage model architecture employed: (1) a Faster R-CNN-based detection model for localization, annotation, and extraction of regions of interest (ROI); (2) comparison of 16 convolutional neural network (CNN) models for stenosis classification to select the best-performing model. The evaluation metrics included accuracy, F1-score, and Cohen's κ coefficient, with comparisons made against diagnostic results from physicians with varying years of experience. In the multiclass classification task, four high-performing models (DL1-b0, DL2-121, DL3-101, and DL4-26d) achieved accuracies of 88.74%, 89.40%, 89.40%, and 88.08%, respectively. All models demonstrated >80% consistency with senior physicians and >70% consistency with junior physicians.In the binary classification task, the models achieved accuracies of 94.70%, 96.03%, 96.03%, and 94.70%, respectively. All four models demonstrated consistency rates slightly below 90% with junior physicians. However, when compared with senior physicians, three models (excluding DL4-26d) exhibited consistency rates exceeding 90%. The DL model developed in this study demonstrated high accuracy in CT image analysis of CSS, with a diagnostic performance comparable to that of senior physicians.