Back to all papers

Multi-model study of fast VMAT segment dose calculation with deep learning.

April 23, 2026pubmed logopapers

Authors

Xiao F,Wahl N,Belka C,Kurz C,Dedes G,Landry G

Affiliations (4)

  • Department of Radiation Oncology, LMU University Hospital, Marchioninistrasse 15, Munich, Bayern, 81377, Germany.
  • Department of Medical Physics in Radiation Oncology, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 280, Heidelberg, 69120, Germany.
  • Department of Medical Physics, Ludwig-Maximilians-Universitaet Muenchen, Am Coulombwall 1, Garching, Munich, Bayern, 85748 , Germany.
  • Department of Radiation Oncology, LMU University Hospital, Marchioninistr. 15, Munich, 81377, Germany.

Abstract

Deep learning (DL) methods enable photon dose calculation under two main coordinate representations: Beam's Eye View (BEV) and patient coordinates. We evaluate dose calculation accuracy and speed under these coordinate paradigms and with representative DL models within a unified dataset and pipeline, and introduce two lightweight models for fast photon dose calculation. Planning computed tomography (CT) scans and volumetric modulated arc therapy (VMAT) plans from 24 prostate cancer patients were used. Monte Carlo (MC) simulation generated 5940, 540, and 3053 segment doses for training (11 patients), validation (3), and testing (10), respectively. For BEV, we used a combination of convolutional neural network (CNN) and convolutional long short-term memory network (ConvLSTM) called CNN-ConvLSTM, a CNN-Mamba combination (CNN-Mamba), a transformer-based architecture (DoTA), and a cascaded 3D UNet (C3D). These were trained on CT and segment-projection BEV cuboids. For patient coordinates, the DeepDose individual segment dose prediction framework implemented with C3D (DeepDose-C3D) was trained on cropped CT volumes with four physical inputs. Segment and plan dose accuracy were assessed using local gamma passing rates γPR (2%/3 mm and 1%/3 mm) and dose-volume histogram metrics. Dose calculation times (inference plus pre/post-processing) were measured on three different GPUs. All five models achieved mean local γPR values ≥91.0% (2%/3 mm) for segment doses and ≥99.0% (1%/3 mm) for plan doses. Mean per-segment dose calculation times were 79, 67, 298, 490, 356 ms for CNN-ConvLSTM, CNN-Mamba, DoTA, C3D, and DeepDose-C3D, respectively. On the latest-generation GPU avaliable, the corresponding per-plan (average 305 segments) dose calculation times were 5.5, 6.2, 33.6, 38.7 35.4 s. Both BEV- and patient-coordinate DL methods achieved accurate photon plan dose calculation, with BEV-based approaches showing more robust segment performance. CNN-ConvLSTM and CNN-Mamba retain comparable accuracy at lower computational cost, enabling fast photon dose calculation.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 11k+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.