Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

4-2007

Abstract

One critical task in content-based video retrieval is to rank search results with combinations of multimodal resources effectively. This paper proposes a novel multimodal and multilevel ranking framework for content-based video retrieval. The main idea of our approach is to represent videos by graphs and learn harmonic ranking functions through fusing multimodal resources over these graphs smoothly. We further tackle the efficiency issue by a multilevel learning scheme, which makes the semi-supervised ranking method practical for large-scale applications. Our empirical evaluations on TRECVID 2005 dataset show that the proposed multimodal and multilevel ranking framework is effective and promising for content-based video retrieval.

Keywords

video retrieval, multimodal fusion, multilevel ranking, semi-supervised learning, performance evaluation

Discipline

Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2007: Honolulu, Hawaii, 15-20 April 2007: Proceedings

First Page

1225

Last Page

1228

ISBN

9781424407286

Identifier

10.1109/ICASSP.2007.367297

Publisher

IEEE

City or Country

Piscataway, NJ

Additional URL

https://doi.org/10.1109/ICASSP.2007.367297

Share

COinS