J Shanghai Jiaotong Univ Sci ›› 2024, Vol. 29 ›› Issue (3): 557-565.doi: 10.1007/s12204-022-2543-1

• • 上一篇    下一篇

不平衡图多尺度融合节点分类方法

张静克1,何新林2,戚宗锋1,马 超2,李建勋2   

  1. (1.电子信息系统复杂电磁环境效应国家重点实验室,河南 洛阳471003;2. 上海交通大学 电子信息与电气工程学院,上海200240)
  • 收稿日期:2021-08-26 接受日期:2021-09-10 出版日期:2024-05-28 发布日期:2024-05-28

Unbalanced Graph Multi-Scale Fusion Node Classification Method

ZHANG Jingke1 (张静克), HE Xinlin2 (何新林), QI Zongfeng1 (戚宗锋), MA Chao2 (马 超), LI Jianxun2 (李建勋)   

  1. (1. State Key Laboratory of CEMEE, Luoyang 471003, Henan, China; 2. School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China)
  • Received:2021-08-26 Accepted:2021-09-10 Online:2024-05-28 Published:2024-05-28

摘要: 图是一种描述事物间复杂关系的数据结构。基于图网络的节点分类方法在实际应用中有重要作用。现有的图节点分类方法均没有考虑节点标签分布不均衡的情况。本文针对不平衡图节点分布设计了有向图上的图卷积算法,实现基于多尺度融合图卷积网络的节点分类。该方法根据数据集上标签的不平衡性为每个类别设计了不同的传播深度,基于类别传播深度和图邻接矩阵在图卷积网络各层设计了不同的聚合函数,相对地扩大正样本的信息传播范围,从而提高不平衡图节点分类的准确性。最后在公开文本分类数据集上验证了算法的有效性。

关键词: 节点分类,不平衡学习,文本分类

Abstract: Graphs are used as a data structure to describe complex relationships between things. The node classification method based on graph network plays an important role in practical applications. None of the existing graph node classification methods consider the uneven distribution of node labels. In this paper, a graph convolution algorithm on a directed graph is designed for the distribution of unbalanced graph nodes to realize node classification based on multi-scale fusion graph convolution network. This method designs different propagation depths for each class according to the unbalance ratio on the data set, and different aggregation functions are designed at each layer of the graph convolutional network based on the class propagation depth and the graph adjacency matrix. The scope of information dissemination of positive samples is expanded relatively, thereby improving the accuracy of classification of unbalanced graph nodes. Finally, the effectiveness of the algorithm is verified through experiments on the public text classification datasets.

Key words: node classification, unbalanced learning, text classification

中图分类号: