A deep learning model for predicting radiation-induced xerostomia in patients with head and neck cancer based on multi-channel fusion.
Authors
Affiliations (6)
Affiliations (6)
- Guangdong Pharmaceutical University, Guangzhou, Guangdong, 510006, China.
- The Second Clinical College of Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, 510120, China.
- The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, 510120, China.
- Guangdong Pharmaceutical University, Guangzhou, Guangdong, 510006, China. [email protected].
- State Key Laboratory of Traditional Chinese Medicine Syndrome, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, 510120, China. [email protected].
- The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, 510120, China. [email protected].
Abstract
Radiation-induced xerostomia is a common sequela in patients who undergo head and neck radiation therapy. This study aims to develop a three-dimensional deep learning model to predict xerostomia by fusing data from the gross tumor volume primary (GTVp) channel and parotid glands (PGs) channel. Retrospective data were collected from 180 head and neck cancer patients. Xerostomia was defined as xerostomia of grade ≥ 2 occurring in the 6th month of radiation therapy. The dataset was split into 137 cases (58.4% xerostomia, 41.6% non-xerostomia) for training and 43 (55.8% xerostomia, 44.2% non-xerostomia) for testing. XeroNet was composed of GNet, PNet, and a Naive Bayes decision fusion layer. GNet processed data from the GTVp channel (CT, dose distributions corresponding and the GTVp contours). PNet processed data from the PGs channel (CT, dose distributions and the PGs contours). The Naive Bayes decision fusion layer was used to integrate the results from GNet and PNet. Model performance was evaluated using accuracy, F-score, sensitivity, specificity, and area under the receiver operator characteristic curve (AUC). The proposed model achieved promising prediction results. The accuracy, AUC, F-score, sensitivity and specificity were 0.779, 0.858, 0.797, 0.777, and 0.782, respectively. Features extracted from the CT and dose distributions in the GTVp and PGs regions were used to construct machine learning models. However, the performance of these models was inferior to our method. Compared with recent studies on xerostomia prediction, our method also showed better performance. The proposed model could effectively extract features from the GTVp and PGs channels, achieving good performance in xerostomia prediction.