Publication Type

Journal Article

Version

acceptedVersion

Publication Date

10-2015

Abstract

The Internet has brought about fundamental changes in the way peoples generate and exchange media information. Over the last decade, unsolicited message images (image spams) have become one of the most serious problems for Internet service providers (ISPs), business firms and general end users. In this paper, we report a novel system called RoBoTs (Robust BoosTrap based spam detector) to support accurate and robust image spam filtering. The system is developed based on multiple visual properties extracted from different levels of granularity, aiming to capture more discriminative contents for effective spam image identification. In addition, a resampling based learning framework is developed to effectively integrate random forest and linear discriminative analysis (LDA) to generate comprehensive signature of spam images. It can facilitate more accurate and robust spam classification process with very limited amount of initial training examples. Using three public available test collections, the proposed system is empirically compared with the state-of-the-art techniques. Our results demonstrate its significantly higher performance from different perspectives.

Keywords

Algorithm, Security, Experimentation, Spam

Discipline

Databases and Information Systems | Information Security

Publication

Pattern Recognition

Volume

48

Issue

10

First Page

3227

Last Page

3238

ISSN

0031-3203

Identifier

10.1016/j.patcog.2015.02.027

Publisher

Elsevier

Copyright Owner and License

Authors

Additional URL

https://doi.org/10.1016/j.patcog.2015.02.027

Share

COinS