Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
4-2015
Abstract
Social media have been popular not only for individuals to share contents, but also for organizations to engage users and spread information. Given the trait differences between personal and organization accounts, the ability to distinguish between the two account types is important for developing better search/recommendation engines, marketing strategies, and information dissemination platforms. However, such task is non-trivial and has not been well studied thus far. In this paper, we present a new generic framework for classifying personal and organization accounts, based upon which comprehensive and systematic investigation on a rich variety of content, social, and temporal features can be carried out. In addition to generic feature transformation pipelines, the framework features a gradient boosting classifier that is accurate/robust and facilitates good data understanding such as the importance of different features. We demonstrate the efficacy of our approach through extensive experiments on Twitter data from Singapore, by which we discover several discriminative content, social, and temporal features.
Keywords
Account type classification, Gradient boosting, Social media
Discipline
Computer Sciences | Social Media
Publication
Advances in Information Retrieval: 37th European Conference on IR Research, ECIR 2015, Vienna, Austria, March 29 - April 2, 2015: Proceedings
Volume
9022
First Page
465
Last Page
476
ISBN
9783319163543
Identifier
10.1007/978-3-319-16354-3_51
Publisher
Springer Verlag
City or Country
Cham
Citation
OENTARYO, Richard Jayadi; LOW, Jia-Wei; and LIM, Ee Peng.
Chalk and cheese in Twitter: Discriminating personal and organization accounts. (2015). Advances in Information Retrieval: 37th European Conference on IR Research, ECIR 2015, Vienna, Austria, March 29 - April 2, 2015: Proceedings. 9022, 465-476.
Available at: https://ink.library.smu.edu.sg/sis_research/2623
Copyright Owner and License
LARC
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1007/978-3-319-16354-3_51