Photoacoustic device fingerprints induce bias in deep learning models.
Authors
Affiliations (21)
Affiliations (21)
- Division of Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ) Heidelberg, Heidelberg, Germany. [email protected].
- Medical Faculty, Heidelberg University, Heidelberg, Germany. [email protected].
- Division of Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ) Heidelberg, Heidelberg, Germany.
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany.
- Faculty of Physics and Astronomy, Heidelberg University, Heidelberg, Germany.
- Division of Medical Image Computing (MIC), German Cancer Research Center (DKFZ) Heidelberg, Heidelberg, Germany.
- Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany.
- Department of Vascular Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany.
- Institute of Biological and Medical Imaging, Helmholtz Zentrum München, Neuherberg, Germany.
- Chair of Biological Imaging, Central Institute for Translational Cancer Research (TranslaTUM), School of Medicine and Health, Technical University of Munich, Munich, Germany.
- Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, USA.
- Department of Pediatrics and Adolescent Medicine, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany.
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Hospital Heidelberg, Heidelberg, Germany.
- Division of Intelligent Medical Systems (IMSY), German Cancer Research Center (DKFZ) Heidelberg, Heidelberg, Germany. [email protected].
- Medical Faculty, Heidelberg University, Heidelberg, Germany. [email protected].
- Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany. [email protected].
- Helmholtz Imaging, German Cancer Research Center (DKFZ), Heidelberg, Germany. [email protected].
- National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Hospital Heidelberg, Heidelberg, Germany. [email protected].
- Surgical AI Research Group, Heidelberg University Hospital, Surgical Clinic, Heidelberg, Germany. [email protected].
- HIDSS4Health - Helmholtz Information and Data Science School for Health, Karlsruhe/Heidelberg, Germany. [email protected].
- Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), Abu Dhabi, UAE. [email protected].
Abstract
Deep learning (DL) models developed for established medical imaging modalities have shown increasing performance and reliability as a result of scaling efforts. In contrast, model development for emerging modalities such as photoacoustic imaging (PAI) remains challenged by data sparsity, which limits model generalizability and raises the susceptibility to bias. While recent studies in PAI have started to investigate subject-related confounders, the impact of hardware-related confounders remains unexplored, posing a critical risk for failure in multicentric deployment scenarios. We are the first to provide a multicentric analysis of hardware-induced bias in PAI. We analyzed device-specific characteristics in images from four device instances and two peripheral artery disease studies, and trained DL models to classify device origin and disease under varying levels of device-health correlations in the data. We showed that 1) multiple instances of the same PAI device type embed identifiable fingerprints in the images, 2) that DL models can leverage these fingerprints to reach [Formula: see text] accuracy in device detection and critically, 3) when a correlation between device instance and health status is present, models trained for disease diagnosis exploit these device-specific signatures as shortcuts, thereby producing biased and clinically misleading predictions. This research highlights the risk of overestimating algorithm performance when such confounding is overlooked, emphasizing the importance of bias evaluation and explainable artificial intelligence methods to identify potential shortcuts, finally enabling multicentric PAI studies.