Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
10-2006
Abstract
This paper proposes a new approach for the similarity measure and ranking of audio clips by graph modeling and matching. Instead of using frame-based or salient-based features to measure the acoustical similarity of audio clips, segment-based similarity is proposed. The novelty of our approach lies in two aspects: segment-based representation, and the similarity measure and ranking based on four kinds of similarity factors. In segmentbased representation, segments not only capture the change property of audio clip, but also keep and present the change relation and temporal order of audio features. In the similarity measure and ranking, four kinds of similarity factors: acoustical, granularity, temporal order and interference are progressively and jointly measured by optimal matching and dynamic programming, which guarantee the comprehensive and sufficient similarity measure between two audio clips. The experimental result shows that the proposed approach is better than some existing methods in terms of retrieval and ranking capabilities.
Keywords
Audio retrieval, Audio similarity measure
Discipline
Databases and Information Systems | Theory and Algorithms
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 14th ACM international conference on Multimedia, MM 2006, Santa Barbara, California, October 23-27
First Page
603
Last Page
606
ISBN
9781595934475
Identifier
10.1145/1180639.1180763
Publisher
ACM
City or Country
Santa Barbara, California
Citation
PENG, Yuxin; NGO, Chong-wah; FANG, Cuihua; CHEN, Xiaoou; and XIAO, Jianguo.
Audio similarity measure by graph modeling and matching. (2006). Proceedings of the 14th ACM international conference on Multimedia, MM 2006, Santa Barbara, California, October 23-27. 603-606.
Available at: https://ink.library.smu.edu.sg/sis_research/6493
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.