Journal of shanghai Jiaotong University (Science) ›› 2015, Vol. 20 ›› Issue (2): 164-170.doi: 10.1007/s12204-015-1605-z

Previous Articles     Next Articles

Improved Kernel Possibilistic Fuzzy Clustering Algorithm Based on Invasive Weed Optimization

Improved Kernel Possibilistic Fuzzy Clustering Algorithm Based on Invasive Weed Optimization

ZHAO Xiao-qiang* (赵小强), ZHOU Jin-hu (周金虎)   

  1. (College of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China)
  2. (College of Electrical Engineering and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China)
  • Published:2015-04-02
  • Contact: ZHAO Xiao-qiang (赵小强) E-mail:xqzhao@lut.cn

Abstract: Fuzzy c-means (FCM) clustering algorithm is sensitive to noise points and outlier data, and the possibilistic fuzzy c-means (PFCM) clustering algorithm overcomes the problem well, but PFCM clustering algorithm has some problems: it is still sensitive to initial clustering centers and the clustering results are not good when the tested datasets with noise are very unequal. An improved kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM) is proposed in this paper. This algorithm first uses invasive weed optimization (IWO) algorithm to seek the optimal solution as the initial clustering centers, and introduces kernel method to make the input data from the sample space map into the high-dimensional feature space. Then, the sample variance is introduced in the objection function to measure the compact degree of data. Finally, the improved algorithm is used to cluster data. The simulation results of the University of California-Irvine (UCI) data sets and artificial data sets show that the proposed algorithm has stronger ability to resist noise, higher cluster accuracy and faster convergence speed than the PFCM algorithm.

Key words: data mining|clustering algorithm| possibilistic fuzzy c-means (PFCM)| kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM)

摘要:

Fuzzy c-means (FCM) clustering algorithm is sensitive to noise points and outlier data, and the possibilistic fuzzy c-means (PFCM) clustering algorithm overcomes the problem well, but PFCM clustering algorithm has some problems: it is still sensitive to initial clustering centers and the clustering results are not good when the tested datasets with noise are very unequal. An improved kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM) is proposed in this paper. This algorithm first uses invasive weed optimization (IWO) algorithm to seek the optimal solution as the initial clustering centers, and introduces kernel method to make the input data from the sample space map into the high-dimensional feature space. Then, the sample variance is introduced in the objection function to measure the compact degree of data. Finally, the improved algorithm is used to cluster data. The simulation results of the University of California-Irvine (UCI) data sets and artificial data sets show that the proposed algorithm has stronger ability to resist noise, higher cluster accuracy and faster convergence speed than the PFCM algorithm.

关键词: data mining|clustering algorithm| possibilistic fuzzy c-means (PFCM)| kernel possibilistic fuzzy c-means algorithm based on invasive weed optimization (IWO-KPFCM)

CLC Number: