Back to all papers

Deep learning based medical image compression using cross attention learning and wavelet transform.

November 14, 2025pubmed logopapers

Authors

Dai F

Affiliations (1)

  • School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, Shaanxi, China. [email protected].

Abstract

Efficient compression of medical images is vital for telemedicine and cloud-based healthcare, where bandwidth and storage constraints pose significant challenges. Conventional lossless approaches provide limited compression, whereas lossy techniques risk compromising diagnostic accuracy. To address these limitations, we introduce a novel hybrid compression framework that combines Discrete Wavelet Transform (DWT) with a deep Cross-Attention Learning (CAL) module to preserve clinically relevant details while reducing redundant information. The proposed pipeline first decomposes input images into multi-resolution sub-bands via DWT, followed by a CAL-driven encoder that emphasizes high-information regions through dynamic feature weighting. A lightweight Variational Autoencoder (VAE) refines feature representation prior to entropy coding for final compression. Extensive experiments on benchmark datasets, including LIDC-IDRI, LUNA16, and MosMed, demonstrate that our approach achieves superior performance in terms of PSNR, SSIM, and MSE compared to state-of-the-art codecs such as JPEG2000 and BPG. These results highlight the method's potential for real-time medical image transmission and long-term storage without sacrificing diagnostic integrity.

Topics

Journal Article

Ready to Sharpen Your Edge?

Subscribe to join 7,600+ peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.