Collaborative Topic Regression (CTR) combines ideas of probabilistic matrix factorization (PMF) and topic modeling (such as LDA) for recommender systems, which has gained increasing success in many applications. Despite enjoying many advantages, the existing Batch Decoupled Inference algorithm for the CTR model has some critical limitations: First of all, it is designed to work in a batch learning manner, making it unsuitable to deal with streaming data or big data in real-world recommender systems. Secondly, in the existing algorithm, the item-specific topic proportions of LDA are fed to the downstream PMF but the rating information is not exploited in discovering the low-dimensional representation of documents and this can result in a sub-optimal representation for prediction. In this paper, we propose a novel inference algorithm, called the Online Bayesian Inference algorithm for CTR model, which is efficient and scalable for learning from data streams. Furthermore, we jointly optimize the combined objective function of both PMF and LDA in an online learning fashion, in which both PMF and LDA tasks can reinforce each other during the online learning process. Our encouraging experimental results on real-world data validate the effectiveness of the proposed method.
Topic modeling, Online learning, Recommender systems, Collaborative filtering
Databases and Information Systems
Data Management and Analytics
Springer Verlag (Germany)
LIU, Chenghao; JIN, Tao; HOI, Steven C. H.; ZHAO, Peilin; and SUN, Jianling.
Collaborative topic regression for online recommender systems: an online and Bayesian approach. (2017). Machine Learning. 106, (5), 651-670. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/3703
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.