Enhancing image retrieval through optimal barcode representation.

Authors

Khosrowshahli R,Kheiri F,Asilian Bidgoli A,Tizhoosh HR,Makrehchi M,Rahnamayan S

Affiliations (5)

  • Faculty of Mathematics and Science, Brock University, St. Catharines, ON, L2S 3A1, Canada.
  • Faculty of Engineering and Applied Sciences, University of Ontario Institute of Technology, Oshawa, ON, L1G 0C5, Canada.
  • Faculty of Science, Wilfrid Laurier University, Waterloo, ON, N2L 3C5, Canada. [email protected].
  • Kimia Lab, Mayo Clinic, Rochester, MN, 55905, USA.
  • Department of Engineering, Brock University, St. Catharines, ON, L2S 3A1, Canada.

Abstract

Data binary encoding has proven to be a versatile tool for optimizing data processing and memory efficiency in various machine learning applications. This includes deep barcoding, generating barcodes from deep learning feature extraction for image retrieval of similar cases among millions of indexed images. Despite the recent advancement in barcode generation methods, converting high-dimensional feature vectors (e.g., deep features) to compact and discriminative binary barcodes is still an urgent necessity and remains an unresolved problem. Difference-based binarization of features is one of the most efficient binarization methods, transforming continuous feature vectors into binary sequences and capturing trend information. However, the performance of this method is highly dependent on the ordering of the input features, leading to a significant combinatorial challenge. This research addresses this problem by optimizing feature sequences based on retrieval performance metrics. Our approach identifies optimal feature orderings, leading to substantial improvements in retrieval effectiveness compared to arbitrary or default orderings. We assess the performance of the proposed approach in various medical and non-medical image retrieval tasks. This evaluation includes medical images from The Cancer Genome Atlas (TCGA), a comprehensive publicly available dataset, as well as COVID-19 Chest X-rays dataset. In addition, we evaluate the proposed approach on non-medical benchmark image datasets, such as CIFAR-10, CIFAR-100, and Fashion-MNIST. Our findings demonstrate the importance of optimizing binary barcode representation to significantly enhance accuracy for fast image retrieval across a wide range of applications, highlighting the applicability and potential of barcodes in various domains.

Topics

Journal Article

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.