上海交通大学学报(英文版) ›› 2015, Vol. 20 ›› Issue (2): 164-170.doi: 10.1007/s12204-015-1605-z

• • 上一篇    下一篇

Improved Kernel Possibilistic Fuzzy Clustering Algorithm Based on Invasive Weed Optimization

ZHAO Xiao-qiang* (赵小强), ZHOU Jin-hu (周金虎)   

  1. (College of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China)
  • 发布日期:2015-04-02
  • 通讯作者: ZHAO Xiao-qiang (赵小强) E-mail:xqzhao@lut.cn

Improved Kernel Possibilistic Fuzzy Clustering Algorithm Based on Invasive Weed Optimization

ZHAO Xiao-qiang* (赵小强), ZHOU Jin-hu (周金虎)   

  1. (College of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China)
  • Published:2015-04-02
  • Contact: ZHAO Xiao-qiang (赵小强) E-mail:xqzhao@lut.cn

摘要:

Fuzzy c-means (FCM) clustering algorithm is sensitive to noise points and outlier data, and the possibilistic fuzzy c-means (PFCM) clustering algorithm overcomes the problem well, but PFCM clustering algorithm has some problems: it is still sensitive to initial clustering centers and the clustering results are not good when the tested datasets with noise are very unequal. An improved kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM) is proposed in this paper. This algorithm first uses invasive weed optimization (IWO) algorithm to seek the optimal solution as the initial clustering centers, and introduces kernel method to make the input data from the sample space map into the high-dimensional feature space. Then, the sample variance is introduced in the objection function to measure the compact degree of data. Finally, the improved algorithm is used to cluster data. The simulation results of the University of California-Irvine (UCI) data sets and artificial data sets show that the proposed algorithm has stronger ability to resist noise, higher cluster accuracy and faster convergence speed than the PFCM algorithm.

关键词: data mining, clustering algorithm, possibilistic fuzzy c-means (PFCM), kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM)

Abstract: Fuzzy c-means (FCM) clustering algorithm is sensitive to noise points and outlier data, and the possibilistic fuzzy c-means (PFCM) clustering algorithm overcomes the problem well, but PFCM clustering algorithm has some problems: it is still sensitive to initial clustering centers and the clustering results are not good when the tested datasets with noise are very unequal. An improved kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM) is proposed in this paper. This algorithm first uses invasive weed optimization (IWO) algorithm to seek the optimal solution as the initial clustering centers, and introduces kernel method to make the input data from the sample space map into the high-dimensional feature space. Then, the sample variance is introduced in the objection function to measure the compact degree of data. Finally, the improved algorithm is used to cluster data. The simulation results of the University of California-Irvine (UCI) data sets and artificial data sets show that the proposed algorithm has stronger ability to resist noise, higher cluster accuracy and faster convergence speed than the PFCM algorithm.

Key words: data mining, clustering algorithm, possibilistic fuzzy c-means (PFCM), kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM)

中图分类号: