Conference Proceeding Article
Topic modeling has traditionally been studied for single text collections and applied to social media data represented in the form of text documents. With the emergence of many social media platforms, users find themselves using different social media for posting content and for social interaction. While many topics may be shared across social media platforms, users typically show preferences of certain social media platform(s) over others for certain topics. Such platform preferences may even be found at the individual level. To model social media topics as well as platform preferences of users, we propose a new topic model known as MultiPlatform-LDA (MultiLDA). Instead of just merging all posts from different social media platforms into a single text collection, MultiLDA keeps one text collection for each social media platform but allowing these platforms to share a common set of topics. MultiLDA further learns the user-specific platform preferences for each topic. We evaluate MultiLDA against TwitterLDA, the state-of-the-art method for social media content modeling, on two aspects: (i) the effectiveness in modeling topics across social media platforms, and (ii) the ability to predict platform choices for each post. We conduct experiments on three real-world datasets from Twitter, Instagram and Tumblr sharing a set of common users. Our experiments results show that the MultiLDA outperforms in both topic modeling and platform choice prediction tasks. We also show empirically that among the three social media platforms, "Daily matters" and "Relationship matters" are dominant topics in Twitter, "Social gathering", "Outing" and "Fashion" are dominant topics in Instagram, and "Music", "Entertainment" and "Fashion" are dominant topics in Tumblr.
user preference, multiple social networks, topic modeling
Databases and Information Systems | Social Media | Theory and Algorithms
Data Management and Analytics
WWW '17 Proceedings of the 26th International Conference on World Wide Web
City or Country
New York, USA
LEE, Roy Ka Wei; HOANG, Tuan Anh; and Ee-peng LIM.
On analyzing user topic-specific platform preferences across multiple social media sites. (2017). WWW '17 Proceedings of the 26th International Conference on World Wide Web. 1351-1359. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/3651
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.