J Shanghai Jiaotong Univ Sci ›› 2024, Vol. 29 ›› Issue (1): 120-130.doi: 10.1007/s12204-022-2537-z

• Medicine-Engineering Interdisciplinary Research • Previous Articles     Next Articles

Weighted Heterogeneous Graph-Based Incremental Automatic Disease Diagnosis Method

基于加权异构图谱的增量式疾病自动诊断方法

TIAN Yuanyuan (田圆圆), JIN Yanrui (金衍瑞), LI Zhiyuan (李志远), LIU Jinlei (刘金磊), LIU Chengliang (刘成良)   

  1. (State Key Laboratory of Mechanical System and Vibration, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China)
  2. (上海交通大学 机械系统与振动国家重点实验室,上海200240)
  • Accepted:2022-08-30 Online:2024-01-28 Published:2024-01-24

Abstract: The objective of this study is to construct a multi-department symptom-based automatic diagnosis model. However, it is difficult to establish a model to classify plenty of diseases and collect thousands of diseasesymptom datasets simultaneously. Inspired by the thought of “knowledge graph is model”, this study proposes to build an experience-infused knowledge model by continuously learning the experiential knowledge from data, and incrementally injecting it into the knowledge graph. Therefore, incremental learning and injection are used to solve the data collection problem, and the knowledge graph is modeled and containerized to solve the large-scale multi-classification problems. First, an entity linking method is designed and a heterogeneous knowledge graph is constructed by graph fusion. Then, an adaptive neural network model is constructed for each dataset, and the data is used for statistical initialization and model training. Finally, the weights and biases of the learned neural network model are updated to the knowledge graph. It is worth noting that for the incremental process, we consider both the data and class increments. We evaluate the diagnostic effectiveness of the model on the current dataset and the anti-forgetting ability on the historical dataset after class increment on three public datasets. Compared with the classical model, the proposed model improves the diagnostic accuracy of the three datasets by 5%, 2%, and 15% on average, respectively. Meanwhile, the model under incremental learning has a better ability to resist forgetting.

Key words: knowledge graph, disease diagnosis, incremental learning, adaptive neural network, knowledge model

摘要: 该文目标是构建一个能实现多科室级别的基于症状的疾病自动诊断模型。但构建分类几千种疾病的模型、同时收集成千上万种疾病-症状数据集这两个任务是现有研究难以解决的。基于“知识图谱即是模型”的想法,提出了通过不断学习数据中的经验知识,增量式地注入到知识图谱中,以此来构建一个注入“经验”的知识模型。即通过增量式学习、注入来解决数据收集问题,通过将知识图谱模型化、容器化来解决超多分类问题。首先通过图谱融合构建了一份异构知识图谱并设计了一个实体链接方法。然后对于每份数据集构建一个自适应的神经网络模型,利用数据实现统计学初始化和模型训练。最后将学习完成的神经网络模型中权重和偏置更新到异构图谱中。对于增量过程,同时考虑了数据增量和类别增量两种情况。在三份公共数据集上评估了模型在当前数据集上的诊断效果,以及类别增量后对历史数据集的抗遗忘能力两个性能;与经典模型相比,诊断正确率分别平均提高了5%、2%和15%;同时模型在增量学习下具有较好的抗遗忘能力。

关键词: 知识图谱,疾病诊断,增量学习,自适应网络,知识模型

CLC Number: