Publication Type

Conference Paper

Version

publishedVersion

Publication Date

10-2012

Abstract

The Violent Scene Detection task offers a very practical challenge in detecting complex and diverse violent video clips in movies. In this working note paper, we will briefly describe our system and discuss the results, which achieved top performance in mAP@201 and runner-up in mAP@100, among all 35 submissions worldwide. The central component of our system is a set of features derived from the appearance and motion of local patch trajectories [2]. We use these features and SVM classifier as the baseline approach and add in a few other components to further improve the performance. Our findings indicate that the trajectory-based visual features already offer very competitive results. Other audio-visual features like SpatialTemporal Interest Points and MFCC do not significantly enhance the performance. In addition, smoothing detection scores of nearby shots leads to significant improvement. We conclude that—while audio feature may help marginally— good visual features are still the key factor in violent scene detection, and temporal information is very useful.

Keywords

Movie, Multi-modality, Temporal smoothing, Trajectory-based feature, Violent scene detection

Discipline

Databases and Information Systems | Graphics and Human Computer Interfaces

Research Areas

Intelligent Systems and Optimization

Publication

Multimedia Benchmark Workshop, MediaEval 2012, Pisa, Italy, October 4-5

Publisher

SpringerLink

City or Country

Pisa, Italy

Share

COinS