Enhanced pulmonary nodule detection with U-Net, YOLOv8, and swin transformer.
Authors
Affiliations (5)
Affiliations (5)
- Department of Cardiology, Tianjin Chest Hospital, Tianjin, 300000, China.
- School of Medical Imaging, Tianjin Medical University, No. 1 Guangdong Road, Tianjin, 300203, China.
- The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, 116011, China.
- Department of Cardiology, Tianjin Chest Hospital, Tianjin, 300000, China. [email protected].
- School of Medical Imaging, Tianjin Medical University, No. 1 Guangdong Road, Tianjin, 300203, China. [email protected].
Abstract
Lung cancer remains the leading cause of cancer-related mortality worldwide, emphasizing the critical need for early pulmonary nodule detection to improve patient outcomes. Current methods encounter challenges in detecting small nodules and exhibit high false positive rates, placing an additional diagnostic burden on radiologists. This study aimed to develop a two-stage deep learning model integrating U-Net, Yolov8s, and the Swin transformer to enhance pulmonary nodule detection in computer tomography (CT) images, particularly for small nodules, with the goal of improving detection accuracy and reducing false positives. We utilized the LUNA16 dataset (888 CT scans) and an additional 308 CT scans from Tianjin Chest Hospital. Images were preprocessed for consistency. The proposed model first employs U-Net for precise lung segmentation, followed by Yolov8s augmented with the Swin transformer for nodule detection. The Shape-aware IoU (SIoU) loss function was implemented to improve bounding box predictions. For the LUNA16 dataset, the model achieved a precision of 0.898, a recall of 0.851, and a mean average precision at 50% IoU (mAP50) of 0.879, outperforming state-of-the-art models. The Tianjin Chest Hospital dataset has a precision of 0.855, a recall of 0.872, and an mAP50 of 0.862. This study presents a two-stage deep learning model that leverages U-Net, Yolov8s, and the Swin transformer for enhanced pulmonary nodule detection in CT images. The model demonstrates high accuracy and a reduced false positive rate, suggesting its potential as a useful tool for early lung cancer diagnosis and treatment.