Publication Type

Conference Proceeding Article

Publication Date

4-2015

Abstract

Social media have been popular not only for individuals to share contents, but also for organizations to engage users and spread information. Given the trait differences between personal and organization accounts, the ability to distinguish between the two account types is important for developing better search/recommendation engines, marketing strategies, and information dissemination platforms. However, such task is non-trivial and has not been well studied thus far. In this paper, we present a new generic framework for classifying personal and organization accounts, based upon which comprehensive and systematic investigation on a rich variety of content, social, and temporal features can be carried out. In addition to generic feature transformation pipelines, the framework features a gradient boosting classifier that is accurate/robust and facilitates good data understanding such as the importance of different features. We demonstrate the efficacy of our approach through extensive experiments on Twitter data from Singapore, by which we discover several discriminative content, social, and temporal features.

Keywords

account type classification, gradient boosting, social media

Discipline

Computer Sciences | Social Media

Research Areas

Data Management and Analytics

Publication

Advances in Information Retrieval: 37th European Conference on IR Research, ECIR 2015, Vienna, Austria, March 29 - April 2, 2015: Proceedings

Volume

9022

First Page

465

Last Page

476

ISBN

9783319163543

Identifier

10.1007/978-3-319-16354-3_51

Publisher

Springer Verlag

City or Country

Cham

Additional URL

http://dx.doi.org/10.1007/978-3-319-16354-3_51

Share

COinS