J Shanghai Jiaotong Univ Sci ›› 2023, Vol. 28 ›› Issue (4): 418-.doi: 10.1007/s12204-023-2584-0

• • 上一篇    



  1. (1.兰州理工大学 计算机与通信学院,兰州 730050;2. 中国移动通信集团甘肃有限公司,兰州 730070)
  • 收稿日期:2021-08-17 接受日期:2022-03-07 出版日期:2023-07-28 发布日期:2023-07-31

Novel Scheme for Essential Proteins Identification Based on Improved Multicriteria Decision Making

LU Pengli1* (卢鹏丽),CHEN Yuntian1 (陈云天), LIAO Yonggang2 (廖永刚)   

  1. (1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China; 2. China Mobile Communications Group Gansu Co., Ltd., Lanzhou 730070, China)
  • Received:2021-08-17 Accepted:2022-03-07 Online:2023-07-28 Published:2023-07-31

摘要: 从蛋白质相互作用网络中识别关键蛋白质对生物进化和新药物研制具有重要意义。目前许多蛋白质关键性的评判标准只关注蛋白质的某个属性,这会有信息丢失的问题。针对这一问题,本文提出一种基于改进多准则决策的更全面有效的关键蛋白质鉴定方法(EPI-TOPSIS)。首先,考虑蛋白质的不同属性,从三个不同的方面来评估蛋白质重要性:基于表达序列的基因度中心性;基于定位信息和蛋白质复合物的亚细胞-邻居度中心性与亚细胞-复合物入度中心性。然后将介数中心性与这三种方法一起考虑作为多准则决策模型的属性准则,采用层次分析法赋予各个准则权重,通过多准则决策的逼近理想距离求解蛋白质关键性,并对蛋白质进行优先级排序。最后,在YDIP、YMIPS、Krogan和BioGRID网络上进行实验,结果表明EPI-TOPSIS性能优于对比算法。

关键词: 蛋白质相互作用网络,关键蛋白质,多准则决策,生物信息

Abstract: Identifying essential proteins from protein-protein interaction networks is important for studies onbiological evolution and new drug’s development. Most of the presented criteria for prioritizing essential proteinsonly focus on a certain attribute of the proteins in the network, which suffer from information loss. In order toovercome this problem, a relatively comprehensive and effective novel method for essential proteins identificationbased on improved multicriteria decision making (MCDM), called essential proteins identification-technique fororder preference by similarity to ideal solution (EPI-TOPSIS), is proposed. First, considering different attributes ofproteins, we propose three methods from different aspects to evaluate the significance of the proteins: gene-degreecentrality (GDC) for gene expression sequence; subcellular-neighbor-degree centrality (SNDC) and subcellular-indegree centrality (SIDC) for subcellular location information and protein complexes. Then, betweenness centrality(BC) and these three methods are considered together as the multiple criteria of the decision-making model.Analytic hierarchy process is used to evaluate the weights of each criterion, and the essential proteins are prioritizedby an ideal solution of MCDM, i.e., TOPSIS. Experiments are conducted on YDIP, YMIPS, Krogan and BioGRIDnetworks. The results indicate that EPI-TOPSIS outperforms several state-of-the-art approaches for identifyingthe essential proteins through the performance measures.

Key words: protein-protein interaction network, essential proteins, multicriteria decision making (MCDM), biological information