Abdominal multi-organ segmentation on 3D negative-contrast CT cholangiopancreatography: a comparative study of deep learning methods.
Authors
Affiliations (6)
Affiliations (6)
- Department of Radiology, Jiangnan University Medical Center, Wuxi No.2 People's Hospital, Wuxi, China.
- Department of Automation, School of Information Science and Engineering, East China University of Science and Technology, Shanghai, China.
- Center for Intelligent Medical Imaging and Health, Research Institute of Tsinghua University in Shenzhen, Shenzhen, China.
- Tsimage Medical Technology (Shenzhen) Co., Ltd, Shenzhen, China.
- Suumi Research Institute, Atami, Japan.
- Department of Radiology, Jiangnan University Medical Center, Wuxi No.2 People's Hospital, Wuxi, China. [email protected].
Abstract
This study aims to automate segmentation of the biliary and pancreatic systems on 3D negative-contrast CT cholangiopancreatography (3D-nCTCP) to improve preoperative planning and diagnosis. We retrospectively collected dual-phase enhanced CT data from 111 patients with malignant low biliary obstruction. Portal phase data were processed and annotated by an expert radiologist. The dataset, comprising 25,700 images, was split into 91 patients for training/validation and 20 patients for testing. Four state-of-the-art segmentation models, namely TransUNet 2D, nnU-Net 2D, Swin-UNETR 2D, and Swin-UNETR 3D, were implemented and quantitatively compared. Model performance was evaluated using the Dice Similarity Coefficient (DSC) and the Average Symmetric Surface Distance (ASSD), with the inter-observer variability (IOV) serving as the clinical benchmark. Across all models, segmentation performance exhibited high accuracy for the liver, with notably lower accuracy for smaller, more complex organs (duodenum, pancreas, biliary system). The Swin-UNETR 3D model demonstrated superior overall segmentation performance, particularly for challenging organs, with competitive stability. Swin-UNETR 3D achieved a mean DSC of 96.12% ± 1.09% and ASSD of 4.60 mm ± 8.25 mm for the liver, mean DSC of 75.31% ± 11.31% and ASSD of 4.42 mm ± 5.84 mm for the duodenum, mean DSC of 81.00% ± 6.33% and ASSD of 2.07 mm ± 1.70 mm for the pancreas, and mean DSC of 88.64% ± 6.74% and ASSD of 5.47 mm ± 12.13 mm for the biliary system. The 3D volumetric approach in Swin-UNETR 3D outperformed its 2D counterparts (TransUNet 2D, nnU-Net 2D, Swin-UNETR 2D) in most metrics, particularly for the duodenum, where it achieved the highest mean DSC and improved boundary localization compared to 2D models. The comparative analysis demonstrates the superiority of 3D volumetric models (Swin-UNETR 3D) over 2D models for accurate and stable abdominal multi-organ segmentation on 3D-nCTCP, reduces manual annotation time, and may aid broader clinical adoption.