CausalMixNet: A mixed-attention framework for causal intervention in robust medical image diagnosis.

July 1, 2025

papers DOI: 10.1016/j.media.2025.103581 PMID: 40359724

Authors

Zhang Y,Huang YA,Hu Y,Liu R,Wu J,Huang ZA,Tan KC

Affiliations (5)

Department of Data Science and Artificial Intelligence, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China.
School of Computer Science, Northwestern Polytechnical University, Xi'an, China.
Department of Data Science and Artificial Intelligence, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China; Department of Computing, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China; Research Center on Data Sciences and Artificial Intelligence, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China.
Department of Computer Science, City University of Hong Kong (Dongguan), Dongguan, China. Electronic address: [email protected].
Department of Data Science and Artificial Intelligence, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China; Research Center on Data Sciences and Artificial Intelligence, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China.

Abstract

Confounding factors inherent in medical images can significantly impact the causal exploration capabilities of deep learning models, resulting in compromised accuracy and diminished generalization performance. In this paper, we present an innovative methodology named CausalMixNet that employs query-mixed intra-attention and key&value-mixed inter-attention to probe causal relationships between input images and labels. For mitigating unobservable confounding factors, CausalMixNet integrates the non-local reasoning module (NLRM) and the key&value-mixed inter-attention (KVMIA) to conduct a front-door adjustment strategy. Furthermore, CausalMixNet incorporates a patch-masked ranking module (PMRM) and query-mixed intra-attention (QMIA) to enhance mediator learning, thereby facilitating causal intervention. The patch mixing mechanism applied to query/(key&value) features within QMIA and KVMIA specifically targets lesion-related feature enhancement and the inference of average causal effect inference. CausalMixNet consistently outperforms existing methods, achieving superior accuracy and F1-scores across in-domain and out-of-domain scenarios on multiple datasets, with an average improvement of 3% over the closest competitor. Demonstrating robustness against noise, gender bias, and attribute bias, CausalMixNet excels in handling unobservable confounders, maintaining stable performance even in challenging conditions.

View Source Full Text PDF

Topics

Deep LearningImage Interpretation, Computer-AssistedJournal Article

CausalMixNet: A mixed-attention framework for causal intervention in robust medical image diagnosis.

Authors

Affiliations (5)

Abstract

Tags

Topics

Ready to Sharpen Your Edge?