Evaluation of a commercial deep-learning-based contouring software for CT-based gynecological brachytherapy.
Authors
Affiliations (6)
Affiliations (6)
- Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada; Department of Medical Biophysics, Western University, London, Ontario, Canada; Robarts Research Institute, Western University, London, Ontario, Canada.
- Department of Radiation Oncology, London Health Sciences Centre, London, Ontario, Canada; Department of Oncology, Western University, London, Ontario, Canada; Department of Medical Biophysics, Western University, London, Ontario, Canada.
- Department of Radiation Oncology, London Health Sciences Centre, London, Ontario, Canada; Department of Oncology, Western University, London, Ontario, Canada.
- Department of Radiation Oncology, London Health Sciences Centre, London, Ontario, Canada.
- Department of Medical Biophysics, Western University, London, Ontario, Canada; Robarts Research Institute, Western University, London, Ontario, Canada.
- Department of Radiation Oncology, London Health Sciences Centre, London, Ontario, Canada; Department of Oncology, Western University, London, Ontario, Canada; Department of Medical Biophysics, Western University, London, Ontario, Canada. Electronic address: [email protected].
Abstract
To evaluate a commercial deep-learning based auto-contouring software specifically trained for high-dose-rate gynecological brachytherapy. We collected CT images from 30 patients treated with gynecological brachytherapy (19.5-28 Gy in 3-4 fractions) at our institution from January 2018 to December 2022. Clinical and artificial intelligence (AI) generated contours for bladder, bowel, rectum, and sigmoid were obtained. Five patients were randomly selected from the test set and manually re-contoured by 4 radiation oncologists. Contouring was repeated 2 weeks later using AI contours as the starting point ("AI-assisted" approach). Comparisons amongst clinical, AI, AI-assisted, and manual retrospective contours were made using various metrics, including Dice similarity coefficient (DSC) and unsigned D2cc difference. Between clinical and AI contours, DSC was 0.92, 0.79, 0.62, 0.66, for bladder, rectum, sigmoid, and bowel, respectively. Rectum and sigmoid had the lowest median unsigned D2cc difference of 0.20 and 0.21 Gy/fraction respectively between clinical and AI contours, while bowel had the largest median difference of 0.38 Gy/fraction. Agreement between fully automated AI and clinical contours was generally not different compared to agreement between AI-assisted and clinical contours. AI-assisted interobserver agreement was better than manual interobserver agreement for all organs and metrics. The median time to contour all organs for manual and AI-assisted approaches was 14.8 and 6.9 minutes/patient (p < 0.001), respectively. The agreement between AI or AI-assisted contours against the clinical contours was similar to manual interobserver agreement. Implementation of the AI-assisted contouring approach could enhance clinical workflow by decreasing both contouring time and interobserver variability.