Journal of Shanghai Jiao Tong University ›› 2021, Vol. 55 ›› Issue (5): 586-597.doi: 10.16183/j.cnki.jsjtu.2020.187

Special Issue: 《上海交通大学学报》2021年12期专题汇总专辑 《上海交通大学学报》2021年“自动化技术、计算机技术”专题

Previous Articles     Next Articles

A Heterogeneous Network Representation Method Based on Variational Inference and Meta-Path Decomposition

YUAN Ming, LIU Qun(), SUN Haichao, TAN Hongsheng   

  1. College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
  • Received:2020-06-18 Online:2021-05-28 Published:2021-06-01
  • Contact: LIU Qun E-mail:liuqun@cqupt.edu.cn

Abstract:

Aimed at the problem that the traditional meta-path random walk in heterogeneous network representation cannot accurately describe the heterogeneous network structure and cannot capture the true distribution of network nodes well, a heterogeneous network representation method based on variational inference and meta-path decomposition is proposed, which is named HetVAE. First, combining with the idea of path similarity, a node selection strategy is designed to improve the random walk of the meta-path. Next, the variational theory is introduced to effectively sample the latent variables in the original distribution. After that, a personalized attention machanism is implemented, which weights the node vector representation of different sub-networks obtained by decomposition. Then, these node vectors are fused by the proposed model, so that the final node vector representation can have richer semantic information. Finally, several experiments on different network tasks are performed on the three real data sets of DBLP, AMiner, and Yelp. The effectiveness of the model is verified by these results. In node classification and node clustering tasks, compared with some state-of-the-art algorithms, the Micro-F1 and normalized mutual information (NMI) increase by 1.12% to 4.36% and 1.35% to 18% respectively. It is proved that HetVAE can effectively capture the heterogeneous network structure and learn the node vetcor representation that conforms more with the true distribution.

Key words: heterogeneous network, network representation, variational autoencoder, random walk, attention mechanism

CLC Number: