Journal of shanghai Jiaotong University (Science) ›› 2015, Vol. 20 ›› Issue (5): 535-539.doi: 10.1007/s12204-015-1618-7

Previous Articles     Next Articles

A Bandit Method Using Probabilistic Matrix Factorization in Recommendation

A Bandit Method Using Probabilistic Matrix Factorization in Recommendation

TU Shi-tao* (涂世涛), ZHU Lan-juan (朱兰娟)   

  1. (Key Laboratory of System Control and Information Processing of Ministry of Education; Department of Automation, Shanghai Jiaotong University, Shanghai 200240, China)
  2. (Key Laboratory of System Control and Information Processing of Ministry of Education; Department of Automation, Shanghai Jiaotong University, Shanghai 200240, China)
  • Online:2015-10-28 Published:2015-10-29
  • Contact: TU Shi-tao (涂世涛) E-mail: tushitao@126.com

Abstract: In recommendation system, sparse data and cold-start user have always been a challenging problem. Using a linear upper confidence bound (UCB) bandit approach as the item selection strategy based on the user historical ratings and user-item context, we model the recommendation problem as a multi-arm bandit (MAB) problem in this paper. Enabling the engine to recommend while it learns, we adopt probabilistic matrix factorization (PMF) in this strategy learning phase after observing the payoff. In particular, we propose a new approach to get the upper bound statistics out of latent feature matrix. In the experiment, we use two public datasets (Netfilx and MovieLens) to evaluate our proposed model. The model shows good results especially on cold-start users.

Key words: recommend| matrix factorization| bandit

摘要: In recommendation system, sparse data and cold-start user have always been a challenging problem. Using a linear upper confidence bound (UCB) bandit approach as the item selection strategy based on the user historical ratings and user-item context, we model the recommendation problem as a multi-arm bandit (MAB) problem in this paper. Enabling the engine to recommend while it learns, we adopt probabilistic matrix factorization (PMF) in this strategy learning phase after observing the payoff. In particular, we propose a new approach to get the upper bound statistics out of latent feature matrix. In the experiment, we use two public datasets (Netfilx and MovieLens) to evaluate our proposed model. The model shows good results especially on cold-start users.

关键词: recommend| matrix factorization| bandit

CLC Number: