Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
7-2009
Abstract
The massive amount of near-duplicate and duplicate web videos has presented both challenge and opportunity to multimedia computing. On one hand, browsing videos on Internet becomes highly inefficient for the need to repeatedly fast-forward videos of similar content. On the other hand, the tremendous amount of somewhat duplicate content also makes some traditionally difficult vision tasks become simple and easy. For example, annotating pictures can be as simple as recycling the tags of Internet images retrieved from image search engines. Such tasks, of either to eliminate or to recycle near-duplicates, can usually be achieved by the nearest neighbor search of videos from Internet. The fundamental problem lies on the scalability of a search technique, in face of the intractable volume of videos which keep rolling on the web. In this paper, we investigate scalability of several well-known features including color signature and visual keywords for web-based retrieval. Indexing these features based on embedding technique for scalable retrieval is also presented. On an Internet video dataset of more than 700 hours collected during years 2006 to 2008, we show some preliminary insights to the challenge of scalable retrieval.
Discipline
Data Storage Systems | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of 2009 IEEE International Conference on Multimedia and Expo, ICME 2009, New York, June 28 - July 3
First Page
1624
Last Page
1627
ISBN
9781424442911
Identifier
10.1109/ICME.2009.5202830
Publisher
IEEE
City or Country
New York
Citation
ZHAO, Wan-Lei; TAN, Song; and NGO, Chong-wah.
Large-scale near-duplicate web video search: Challenge and opportunity. (2009). Proceedings of 2009 IEEE International Conference on Multimedia and Expo, ICME 2009, New York, June 28 - July 3. 1624-1627.
Available at: https://ink.library.smu.edu.sg/sis_research/6641
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.