Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
4-2011
Abstract
Twitter as a new form of social media can potentially contain much useful information, but content analysis on Twitter has not been well studied. In particular, it is not clear whether as an information source Twitter can be simply regarded as a faster news feed that covers mostly the same information as traditional news media. In This paper we empirically compare the content of Twitter with a traditional news medium, New York Times, using unsupervised topic modeling. We use a Twitter-LDA model to discover topics from a representative sample of the entire Twitter. We then use text mining techniques to compare these Twitter topics with topics from New York Times, taking into consideration topic categories and types. We also study the relation between the proportions of opinionated tweets and retweets and topic categories and types. Our comparisons show interesting and useful findings for downstream IR or DM applications.
Keywords
Twitter, microblogging, topic modeling
Discipline
Databases and Information Systems | Numerical Analysis and Scientific Computing | Social Media
Research Areas
Data Science and Engineering
Publication
Advances in Information Retrieval: 33rd European Conference on IR Research, ECIR 2011, Dublin, Ireland, April 18-21: Proceedings
First Page
338
Last Page
349
ISBN
9783642201608
Identifier
10.1007/978-3-642-20161-5_34
Publisher
Springer
City or Country
Cham
Citation
Zhao, Wayne X., Jiang Jing, Weng Jianshu, He Jing, Lim Ee-Peng, Yan Hongfei and Li Xiaoming. 2011. Comparing Twitter and Traditional Media Using Topic Models. In Advances in Information Retrieval: 33rd European Conference on IR Research, ECIR 2011, Dublin, Ireland, April 18-21. Proceedings. Cham: Springer.
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1007/978-3-642-20161-5_34
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons, Social Media Commons