上海交通大学学报(英文版) ›› 2015, Vol. 20 ›› Issue (5): 535-539.doi: 10.1007/s12204-015-1618-7

• • 上一篇    下一篇

A Bandit Method Using Probabilistic Matrix Factorization in Recommendation

TU Shi-tao* (涂世涛), ZHU Lan-juan (朱兰娟)   

  1. (Key Laboratory of System Control and Information Processing of Ministry of Education; Department of Automation, Shanghai Jiaotong University, Shanghai 200240, China)
  • 出版日期:2015-10-28 发布日期:2015-10-29
  • 通讯作者: TU Shi-tao (涂世涛) E-mail: tushitao@126.com

A Bandit Method Using Probabilistic Matrix Factorization in Recommendation

TU Shi-tao* (涂世涛), ZHU Lan-juan (朱兰娟)   

  1. (Key Laboratory of System Control and Information Processing of Ministry of Education; Department of Automation, Shanghai Jiaotong University, Shanghai 200240, China)
  • Online:2015-10-28 Published:2015-10-29
  • Contact: TU Shi-tao (涂世涛) E-mail: tushitao@126.com

摘要: In recommendation system, sparse data and cold-start user have always been a challenging problem. Using a linear upper confidence bound (UCB) bandit approach as the item selection strategy based on the user historical ratings and user-item context, we model the recommendation problem as a multi-arm bandit (MAB) problem in this paper. Enabling the engine to recommend while it learns, we adopt probabilistic matrix factorization (PMF) in this strategy learning phase after observing the payoff. In particular, we propose a new approach to get the upper bound statistics out of latent feature matrix. In the experiment, we use two public datasets (Netfilx and MovieLens) to evaluate our proposed model. The model shows good results especially on cold-start users.

关键词: recommend, matrix factorization, bandit

Abstract: In recommendation system, sparse data and cold-start user have always been a challenging problem. Using a linear upper confidence bound (UCB) bandit approach as the item selection strategy based on the user historical ratings and user-item context, we model the recommendation problem as a multi-arm bandit (MAB) problem in this paper. Enabling the engine to recommend while it learns, we adopt probabilistic matrix factorization (PMF) in this strategy learning phase after observing the payoff. In particular, we propose a new approach to get the upper bound statistics out of latent feature matrix. In the experiment, we use two public datasets (Netfilx and MovieLens) to evaluate our proposed model. The model shows good results especially on cold-start users.

Key words: recommend, matrix factorization, bandit

中图分类号: