Liver cancer segmentator: Metadata-guided confidence scoring for reliable segmentation of colorectal liver metastases in CT.
Authors
Affiliations (8)
Affiliations (8)
- School of Computing, Queen's University, Kingston, ON, Canada; Department of Electrical Engineering, Qa.C., Islamic Azad University, Qazvin, Iran.
- Department of Radiology, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
- Department of Biomedical and Molecular Sciences, Queen's University, Kingston, ON, Canada.
- School of Computing, Queen's University, Kingston, ON, Canada.
- Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
- Department of Surgical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Department of Abdominal Imaging, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- School of Computing, Queen's University, Kingston, ON, Canada; Department of Biomedical and Molecular Sciences, Queen's University, Kingston, ON, Canada. Electronic address: [email protected].
Abstract
This study introduces the liver cancer segmentator (LCS), a deep learning model designed for automatic and robust segmentation of liver parenchyma and tumors in abdominal contrast-enhanced computed tomography images from patients with colorectal liver metastases. The primary aim was to enhance confidence scoring for more reliable clinical segmentation assessment. In this retrospective study, 446 abdominal contrast-enhanced computed tomography examinations were collected; 355 (80%) were used for training and 91 for testing. Data originated from routine clinical cases at two institutions, representing diverse disease stages and treatment settings. A state-of-the-art neural network segmentation framework was trained on these cases, with performance evaluated using the Dice score and the normalized surface distance. An iterative training process, supported by an integrated annotation workflow, was employed to refine the training set. The final model was applied to the 91 test examinations to assess the impact of tumor volume and slice thickness on confidence scoring. Reliability was quantified through pairwise Dice score for failure detection and the area under the risk coverage curve. The LCS achieved a Dice score of 0.9707 (95% CI: 0.9663-0.9751) for liver parenchyma and 0.7695 (95% CI: 0.7166-0.8224) for tumors. Normalized surface distance values at a 3-millimeter tolerance were 0.9605 (95% CI: 0.9539-0.9671) for parenchyma and 0.8412 (95% CI: 0.7928-0.8896) for tumors. Confidence scoring analysis demonstrated strong correlations between tumor volume, slice thickness, and segmentation reliability, reducing the area under the risk coverage curve from 16.7 to 10.3. The LCS achieved high segmentation accuracy in patients with colorectal liver metastases. Incorporating tumor volume and slice thickness into the confidence scoring process improved failure detection, enhanced reliability, and provided valuable insights for refining clinical deployment of automated segmentation algorithms.