SMU Research Data

Twitter-LDA

Wayne Xin ZHAO, Peking University
Jing JIANG, Singapore Management UniversityFollow
Jianshu WENG, Peking University
Jing HE, Peking University
Ee Peng LIM, Singapore Management UniversityFollow
Hongfei YAN, Peking University
Xiaoming LI, Peking University

Publication Type

Data Set

Year

4-2011

Research Area

Data Management and Analytics

School/Department

School of Information Systems

Description/Abstract

Latent Dirichlet Allocation (LDA) has been widely used in textual analysis. The original LDA is used to find hidden "topics" in the documents, where a topic is a subject like "arts" or "education" that is discussed in the documents. The original setting in LDA, where each word has a topic label, may not work well with Twitter as tweets are short and a single tweet is more likely to talk about one topic. Hence, Twitter-LDA (T-LDA) has been proposed to address this issue. T-LDA also addresses the noisy nature of tweets, where it captures background words in tweets. As experiments in [7] have shown that T-LDA could capture more meaningful topics than LDA in Microblogs.

The original setting in Latent Dirichlet Allocation (LDA), where each word has a topic label, may not work well with Twitter as tweets are short and a single tweet is more likely to talk about one topic. Hence, Twitter-LDA (T-LDA) has been proposed to address this issue. T-LDA also addresses the noisy nature of tweets, where it captures background words in tweets.

Related Publication

Zhao, W. X., Jiang, J., Weng, J., He, J., Lim, E. P., Yan, H., & Li, X. (2011). Comparing twitter and traditional media using topic models. In Advances in Information Retrieval (pp. 338-349). http://dx.doi.org/10.1007/978-3-642-20161-5_34

Disciplines

Computer Sciences | Databases and Information Systems

Citation

Zhao, W. X., Jiang, J., Weng, J., He, J., Lim, E. P., Yan, H., & Li, X. (2011). Twitter-LDA [data set]. Available in Github: https://github.com/minghui/Twitter-LDA

Link to Full Text

COinS

SMU Research Data

Twitter-LDA

Publication Type

Year

Research Area

School/Department

Description/Abstract

Related Publication

Disciplines

Citation

Search

Links

Browse

Links

SMU Research Data

Twitter-LDA

Authors

Publication Type

Year

Research Area

School/Department

Description/Abstract

Related Publication

Disciplines

Citation

Share

Search

Links

Browse

Links