Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

10-2004

Abstract

This paper investigates the issues in polyphonic popular song retrieval. The problems that we consider include singing voice extraction, melodic curve representation, and database indexing. Initially, polyphonic songs are decomposed into singing voices and instruments sounds in both time and frequency domains based on SVM and ICA. The extracted singing voices are represented as two melodic curves that model the statistical mean and neighborhood similarity of notes. To speed up the matching between songs and query, we further adopt proportional transportation distance to index the songs as vantage point trees. Encouraging results have been obtained through experiments.

Keywords

ICA, Melodic Curve, Proportional Transportation Distance

Discipline

Graphics and Human Computer Interfaces | Theory and Algorithms

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 12th ACM International Conference on Multimedia, New York, 2004 October 10-16

First Page

308

Last Page

311

ISBN

9781581138931

Identifier

10.1145/1027527.1027598

Publisher

ACM

City or Country

New York

Share

COinS