Automated lumbar spine segmentation in MRI using an enhanced U-Net with inception module and dual-output mechanism.
Authors
Affiliations (2)
Affiliations (2)
- Department of Data Science and Computer Applications, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, 576104, India.
- Department of Data Science and Computer Applications, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, 576104, India. [email protected].
Abstract
Accurate segmentation of spinal structures, including vertebrae, intervertebral discs (IVDs), and the spinal canal, is crucial for diagnosing lumbar spine disorders. Deep learning-based semantic segmentation has significantly improved accuracy in medical imaging. This study proposes an enhanced U-Net incorporating an Inception module for multi-scale feature extraction and a dual-output mechanism for improved training stability and feature refinement. The model is trained on the SPIDER lumbar spine MRI dataset and evaluated using Accuracy, Precision, Recall, F1-score, and mean Intersection over Union (mIoU). Comparative analysis with the baseline models-U-Net, ResUNet, Attention U-Net, and TransUNet-shows that the proposed model achieves superior segmentation accuracy, with improved boundary delineation and better handling of class imbalance. An evaluation of loss functions identified Dice loss as the most effective, enabling the model to achieve an mIoU of 0.8974, an accuracy of 0.9742, a precision of 0.9417, a recall of 0.9470, and an F1-score of 0.9444, outperforming all four baseline models. The Inception module enhances feature extraction at multiple scales, while the dual-output mechanism improves gradient flow and segmentation consistency. Initially focused on binary segmentation, the approach was extended to multiclass segmentation, enabling separate identification of vertebrae, IVDs, and the spinal canal. These enhancements offer a more precise and efficient solution for automated lumbar spine segmentation in MRI, thereby supporting enhanced diagnostic workflows in medical imaging.