Publication Type
Journal Article
Version
publishedVersion
Publication Date
12-2015
Abstract
While content-based landmark image search has recently received a lot of attention and became a very active domain, it still remains a challenging problem. Among the various reasons, high diverse visual content is the most significant one. It is common that for the same landmark, images with a wide range of visual appearances can be found from different sources and different landmarks may share very similar sets of images. As a consequence, it is very hard to accurately estimate the similarities between the landmarks purely based on single type of visual feature. Moreover, the relationships between landmark images can be very complex and how to develop an effective modeling scheme to characterize the associations still remains an open question. Motivated by these concerns, we propose multimodal hypergraph (MMHG) to characterize the complex associations between landmark images. In MMHG, images are modeled as independent vertices and hyperedges contain several vertices corresponding to particular views. Multiple hypergraphs are firstly constructed independently based on different visual modalities to describe the hidden high-order relations from different aspects. Then, they are integrated together to involve discriminative information from heterogeneous sources. We also propose a novel content-based visual landmark search system based on MMHG to facilitate effective search. Distinguished from the existing approaches, we design a unified computational module to support query-specific combination weight learning. An extensive experiment study on a large-scale test collection demonstrates the effectiveness of our scheme over state-of-the-art approaches.
Keywords
Content-based visual landmark search, high-order relations, multimodal hypergraph (MMHG), visual diversity
Discipline
Computer Sciences | Databases and Information Systems
Publication
IEEE Transactions on Cybernetics
Volume
45
Issue
12
First Page
2756
Last Page
2769
ISSN
2168-2267
Identifier
10.1109/TCYB.2014.2383389
Publisher
IEEE
Citation
ZHU, Lei; SHEN, Jialie; JIN, Hai; ZHENG, Ran; and XIE, Liang.
Content-Based Visual Landmark Search via Multimodal Hypergraph Learning. (2015). IEEE Transactions on Cybernetics. 45, (12), 2756-2769.
Available at: https://ink.library.smu.edu.sg/sis_research/2465
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1109/TCYB.2014.2383389
Comments
Formerly IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics