Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
10-2009
Abstract
Following the exponential growth of social media, there now exist huge repositories of videos online. Among the huge volumes of videos, there exist large numbers of near-duplicate videos. Most existing techniques either focus on the fast retrieval of full copies or near-duplicates, or consider localization in a heuristic manner. This paper considers the scalable detection and localization of partial near-duplicate videos by jointly considering visual similarity and temporal consistency. Temporal constraints are embedded into a network structure as directed edges. Through the structure, partial alignment is novelly converted into a network flow problem where highly efficient solutions exist. To precisely decide the boundaries of the overlapping segments, pair-wise constraints generated from keypoint matching can be added to the network to iteratively refine the localization result. We demonstrate the effectiveness of partial alignment for three different tasks. The first task links partial segments in fulllength movies to videos crawled from YouTube. The second task performs fast web video search, while the third performs near-duplicate shot and copy detection. The experimental result demonstrates the effectiveness and efficiency of the proposed method compared to state-of-the-art techniques.
Keywords
Network flow, Partial near-duplicate, Temporal graph
Discipline
Databases and Information Systems | Data Storage Systems | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 17th ACM international conference on Multimedia, MM'09, Beijing, China, October 19-24
First Page
145
Last Page
154
ISBN
9781605586083
Identifier
10.1145/1631272.1631295
Publisher
ACM
City or Country
Beijing, China
Citation
TAN, Hung-Khoon; NGO, Chong-wah; HONG, Richang; and CHUA, Tat-Seng.
Scalable detection of partial near-duplicate videos by visual-temporal consistency. (2009). Proceedings of the 17th ACM international conference on Multimedia, MM'09, Beijing, China, October 19-24. 145-154.
Available at: https://ink.library.smu.edu.sg/sis_research/6530
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Included in
Databases and Information Systems Commons, Data Storage Systems Commons, Graphics and Human Computer Interfaces Commons