Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
9-2009
Abstract
The problem of polysemy in keyword-based image search arises mainly from the inherent ambiguity in user queries. We propose a latent model based approach that resolves user search ambiguity by allowing sense specific diversity in search results. Given a query keyword and the images retrieved by issuing the query to an image search engine, we first learn a latent visual sense model of these polysemous images. Next, we use Wikipedia to disambiguate the word sense of the original query, and issue these Wiki-senses as new queries to retrieve sense specific images. A sense-specific image classifier is then learnt by combining information from the latent visual sense model, and used to cluster and re-rank the polysemous images from the original query keyword into its specific senses. Results on a ground truth of 17K image set returned by 10 keyword searches and their 62 word senses provides empirical indications that our method can improve upon existing keyword based search engines. Our method learns the visual word sense models in a totally unsupervised manner, effectively filters out irrelevant images, and is able to mine the long tail of image search.
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Data Science and Engineering
Publication
Proceedings of the 2009 British Machine Vision Conference, BMVC 2009, London, UK, September 7-10
First Page
1
Last Page
9
Identifier
10.5244/C.23.67
City or Country
London, United Kingdom
Citation
WAN, Kong-Wah; TAN, Ah-hwee; LIM, Joo-Hwee; CHIA, Liang-Tien; and ROY, Sujoy.
A latent model for visual disambiguation of keyword-based image search. (2009). Proceedings of the 2009 British Machine Vision Conference, BMVC 2009, London, UK, September 7-10. 1-9.
Available at: https://ink.library.smu.edu.sg/sis_research/6745
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Included in
Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons