Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
12-2015
Abstract
One of the fundamental problems in image search is to learn the ranking functions, i.e., similarity between the query and image. The research on this topic has evolved through two paradigms: feature-based vector model and image ranker learning. The former relies on the image surrounding texts, while the latter learns a ranker based on human labeled query-image pairs. Each of the paradigms has its own limitation. The vector model is sensitive to the quality of text descriptions, and the learning paradigm is difficult to be scaled up as human labeling is always too expensive to obtain. We demonstrate in this paper that the above two limitations can be well mitigated by jointly exploring subspace learning and the use of click-through data. Specifically, we propose a novel Ranking Canonical Correlation Analysis (RCCA) for learning query and image similarities. RCCA initially finds a common subspace between query and image views by maximizing their correlations, and further simultaneously learns a bilinear query-image similarity function and adjusts the subspace to preserve the preference relations implicit in the click-through data. Once the subspace is finalized, query-image similarity can be computed by the bilinear similarity function on their mappings in this subspace. On a large-scale click-based image dataset with 11.7 million queries and one million images, RCCA is shown to be powerful for image search with superior performance over several state-of-the-art methods on both keyword-based and query-by-example tasks.
Discipline
Data Storage Systems | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 15th IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13
First Page
28
Last Page
36
ISBN
9781467383912
Identifier
10.1109/ICCV.2015.12
Publisher
Institute of Electrical and Electronics Engineers Inc.
City or Country
Santiago, Chile
Citation
YAO, Ting; MEI, Tao; and NGO, Chong-wah.
Learning query and image similarities with ranking canonical correlation analysis. (2015). Proceedings of the 15th IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13. 28-36.
Available at: https://ink.library.smu.edu.sg/sis_research/6519
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.