Semi-supervised temporal attention network for lung 4D CT ventilation estimation.
Authors
Affiliations (5)
Affiliations (5)
- School of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, 264209, China.
- School of Computer Science and Engineering, Southeast University, Nanjing, 210000, China.
- Department of Control Science and Engineering, School of Electronics and Information Engineering, Tongji University, Shanghai, 201804, China.
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, 30332, USA.
- School of Mechanical, Electrical and Information Engineering, Shandong University, Weihai, 264209, China. Electronic address: [email protected].
Abstract
Computed tomography (CT)-derived ventilation estimation, also known as CT ventilation imaging (CTVI), is emerging as a potentially crucial tool for designing functional avoidance radiotherapy treatment plans and evaluating therapy responses. However, most conventional CTVI methods are highly dependent on deformation fields from image registration to track volume variations, making them susceptible to registration errors and resulting in low estimation accuracy. In addition, existing deep learning-based CTVI methods typically have the issue of requiring a large amount of labeled data and cannot fully utilize temporal characteristics of 4D CT images. To address these issues, we propose a semi-supervised temporal attention (S<sup>2</sup>TA) network for lung 4D CT ventilation estimation. Specifically, the semi-supervised learning framework involves a teacher model for generating pseudo-labels from unlabeled 4D CT images, to train a student model that takes both labeled and unlabeled 4D CT images as input. The teacher model is updated as the moving average of the instantly trained student, to prevent it from being abruptly impacted by incorrect pseudo-labels. Furthermore, to fully exploit the temporal information of 4D CT images, a temporal attention architecture is designed to effectively capture the temporal relationships across multiple phases in 4D CT image sequence. Extensive experiments on three publicly available thoracic 4D CT datasets show that our proposed method can achieve higher estimation accuracy than state-of-the-art methods, which could potentially be used for lung functional avoidance radiotherapy and treatment response modeling.