Publication Type

Journal Article

Version

publishedVersion

Publication Date

12-2009

Abstract

This paper presents an efficient and effective solution for retrieving image near-duplicate (IND) from image database. We introduce the coherent phrase model which incorporates the coherency of local regions to reduce the quantization error of the bag-of-words (BoW) model. In this model, local regions are characterized by visual phrase of multiple descriptors instead of visual word of single descriptor. We propose two types of visual phrase to encode the coherency in feature and spatial domain, respectively. The proposed model reduces the number of false matches by using this coherency and generates sparse representations of images. Compared to other method, the local coherencies among multiple descriptors of every region improve the performance and preserve the efficiency for IND retrieval. The proposed method is evaluated on several benchmark datasets for IND retrieval. Compared to the state-of-the-art methods, our proposed model has been shown to significantly improve the accuracy of IND retrieval while maintaining the efficiency of the standard bag-of-words model. The proposed method can be integrated with other extensions of BoW.

Keywords

Bag-of-word (BoW), image near-duplicate (IND), quantization, retrieval, TRECVID

Discipline

Computer Engineering | Databases and Information Systems | Software Engineering

Research Areas

Data Science and Engineering

Publication

IEEE Transactions on Multimedia

Volume

11

Issue

8

First Page

1434

Last Page

1445

ISSN

1520-9210

Identifier

10.1109/TMM.2009.2032676

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Additional URL

https://doi.org/10.1109/TMM.2009.2032676

Share

COinS