Publication Type

Journal Article

Version

submittedVersion

Publication Date

1-2011

Abstract

Near-duplicate keyframe (NDK) retrieval techniques are critical to many real-world multimedia applications. Over the last few years, we have witnessed a surge of attention on studying near-duplicate image/keyframe retrieval in multimedia community. To facilitate an effective approach to NDK retrieval on large-scale data, we suggest an effective Multi-Level Ranking (MLR) scheme that effectively retrieves NDKs in a coarse-to-fine manner. One key stage of the MLR ranking scheme is how to learn an effective ranking function with extremely small training examples in a near-duplicate detection task. To attack this challenge, we employ a semi-supervised learning method, semi-supervised support vector machines, which is able to significantly improve the retrieval performance by exploiting unlabeled data. Another key stage of the MLR scheme is to perform a fine matching among a subset of keyframe candidates retrieved from the previous coarse ranking stage. In contrast to previous approaches based on either simple heuristics or rigid matching models, we propose a novel Nonrigid Image Matching (NIM) approach to tackle near-duplicate keyframe retrieval from real-world video corpora in order to conduct an effective fine matching. Compared with the conventional methods, the proposed NIM approach can recover explicit mapping between two near-duplicate images with a few deformation parameters and find out the correct correspondences from noisy data simultaneously. To evaluate the effectiveness of our proposed approach, we performed extensive experiments on two benchmark testbeds extracted from the TRECVID2003 and TRECVID2004 corpora. The promising results indicate that our proposed method is more effective than other state-of-the-art approaches for near-duplicate keyframe retrieval.

Keywords

Image copy detection, Near-duplicate keyframe, nonrigid image matching, semi-supervised learning

Discipline

Computer Sciences | Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

ACM Transactions on Multimedia Computing, Communications, and Applications

Volume

7

Issue

1

First Page

4-1

Last Page

24

ISSN

1551-6857

Identifier

10.1145/1870121.1870125

Publisher

ACM

Copyright Owner and License

Authors

Additional URL

https://doi.org/10.1145/1870121.1870125

Share

COinS