Publication Type
Journal Article
Version
acceptedVersion
Publication Date
1-2010
Abstract
Similarity measurement is a critical component in content-based image retrieval systems, and learning a good distance metric can significantly improve retrieval performance. However, despite extensive study, there are several major shortcomings with the existing approaches for distance metric learning that can significantly affect their application to medical image retrieval. In particular, "similarity" can mean very different things in image retrieval: resemblance in visual appearance (e.g., two images that look like one another) or similarity in semantic annotation (e.g., two images of tumors that look quite different yet are both malignant). Current approaches for distance metric learning typically address only one goal without consideration of the other. This is problematic for medical image retrieval where the goal is to assist doctors in decision making. In these applications, given a query image, the goal is to retrieve similar images from a reference library whose semantic annotations could provide the medical professional with greater insight into the possible interpretations of the query image. If the system were to retrieve images that did not look like the query, then users would be less likely to trust the system; on the other hand, retrieving images that appear superficially similar to the query but are semantically unrelated is undesirable because that could lead users toward an incorrect diagnosis. Hence, learning a distance metric that preserves both visual resemblance and semantic similarity is important. We emphasize that, although our study is focused on medical image retrieval, the problem addressed in this work is critical to many image retrieval systems. We present a boosting framework for distance metric learning that aims to preserve both visual and semantic similarities. The boosting framework first learns a binary representation using side information, in the form of labeled pairs, and then computes the distance as a weighted Hamming distance using the learned binary representation. A boosting algorithm is presented to efficiently learn the distance function. We evaluate the proposed algorithm on a mammographic image reference library with an Interactive Search-Assisted Decision Support (ISADS) system and on the medical image data set from ImageCLEF. Our results show that the boosting framework compares favorably to state-of-the-art approaches for distance metric learning in retrieval accuracy, with much lower computational cost. Additional evaluation with the COREL collection shows that our algorithm works well for regular image data sets.
Keywords
Boosting, Distance Metric Learning, Image/video retrieval, Machine learning, boosting, distance metric learning, image retrieval
Discipline
Artificial Intelligence and Robotics | Theory and Algorithms
Research Areas
Data Science and Engineering
Publication
IEEE Transactions on Pattern Analysis Machine Intelligence (TPAMI)
Volume
32
Issue
1
First Page
30
Last Page
44
ISSN
0162-8828
Identifier
10.1109/TPAMI.2008.273
Publisher
IEEE
Citation
LIU, Yang; JIN, Rong; Mummert, Lily; Sukthankar, Rahul; Goode, Adam; ZHENG, Bin; HOI, Steven C. H.; and Satyanarayanan, Mahadev.
A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval. (2010). IEEE Transactions on Pattern Analysis Machine Intelligence (TPAMI). 32, (1), 30-44.
Available at: https://ink.library.smu.edu.sg/sis_research/2315
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/TPAMI.2008.273