Diagnosis of Pulmonary Hypertension by Integrating Multimodal Data with a Hybrid Graph Convolutional and Transformer Network.
Authors
Affiliations (8)
Affiliations (8)
- School of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou, 450001, Henan, China.
- State Key Laboratory for Innovation and Transformation of Luobing Theory, Department of Cardiology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, Jiangsu, China.
- Department of Computer Science, Kennesaw State University, Marietta, GA, USA.
- Department of Integrated Traditional Chinese and Western Clinical Medicine, Hebei Medical University, Shijiazhuang, Hebei, China.
- Department of Radiology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, China.
- State Key Laboratory for Innovation and Transformation of Luobing Theory, Department of Cardiology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, Jiangsu, China. [email protected].
- Department of Applied Computing, Michigan Technological University, Houghton, MI, USA.
- Center for Biocomputing and Digital Health, Institute of Computing and Cybersystems, and Health Research Institute, Michigan Technological University, Houghton, MI, USA.
Abstract
Early and accurate diagnosis of pulmonary hypertension (PH), including differentiating pre-capillary from post-capillary PH, is crucial for guiding effective clinical management. This study developed and validated a deep learning-based diagnostic model to classify patients into non-PH, pre-capillary PH, or post-capillary PH categories. A retrospective dataset from 204 patients (112 pre-capillary PH, 32 post-capillary PH, and 60 non-PH controls) was collected at the First Affiliated Hospital of Nanjing Medical University, with diagnoses confirmed by right heart catheterization (RHC). Patients were randomly divided into training (186 patients, 90%) and testing sets (18 patients, 10%) stratified by diagnostic category. We trained and evaluated the model using 35 repeated random splits. The proposed deep learning model combined graph convolutional networks (GCN), convolutional neural networks (CNN), and Transformers to analyze multimodal data, including cine short-axis (SAX) sequences, four-chamber (4CH) sequences, and clinical parameters. Across test splits, the model achieved an overall area under the receiver operating characteristic curve (AUC) of 0.814 ± 0.06 and accuracy (ACC) of 0.734 ± 0.06 (mean ± SD). Class-specific AUCs were 0.745 ± 0.11 for non-PH, 0.863 ± 0.06 for pre-capillary PH, and 0.834 ± 0.10 for post-capillary PH, indicating good discriminative ability. This study demonstrated three-class PH classification using multimodal inputs. By fusing imaging and clinical data, the model may support accurate and timely clinical decision-making in PH.