Publication Type

Journal Article

Version

publishedVersion

Publication Date

12-2013

Abstract

Social media data are produced continuously by a large and uncontrolled number of users. The dynamic nature of such data requires the sentiment and topic analysis model to be also dynamically updated, capturing the most recent language use of sentiments and topics in text. We propose a dynamic Joint Sentiment-Topic model (dJST) which allows the detection and tracking of views of current and recurrent interests and shifts in topic and sentiment. Both topic and sentiment dynamics are captured by assuming that the current sentiment-topic-specific word distributions are generated according to the word distributions at previous epochs. We study three different ways of accounting for such dependency information: (1) sliding window where the current sentiment-topic word distributions are dependent on the previous sentiment-topic-specific word distributions in the last S epochs; (2) skip model where history sentiment topic word distributions are considered by skipping some epochs in between; and (3) multiscale model where previous long- and short- timescale distributions are taken into consideration. We derive efficient online inference procedures to sequentially update the model with newly arrived data and show the effectiveness of our proposed model on the Mozilla add-on reviews crawled between 2007 and 2011.

Discipline

Databases and Information Systems

Research Areas

Data Science and Engineering

Publication

ACM Transactions on Intelligent Systems and Technology

Volume

5

Issue

1

First Page

1

Last Page

21

ISSN

2157-6904

Identifier

10.1145/2542182.2542188

Publisher

Association for Computing Machinery (ACM)

Additional URL

https://doi.org/10.1145/2542182.2542188

Share

COinS