Journal of Shanghai Jiao Tong University ›› 2022, Vol. 56 ›› Issue (12): 1638-1648.doi: 10.16183/j.cnki.jsjtu.2021.292

Special Issue: 《上海交通大学学报》2022年“电子信息与电气工程”专题

• Electronic Information and Electrical Engineering • Previous Articles     Next Articles

K-means Hybrid Iterative Clustering Based on Memory Transfer Sailfish Optimization

HUANG Hea,b, XIONG Wua,b, WU Kuna,b, WANG Huifengb, RU Fenga,b, WANG Juna()   

  1. b. Xi’an Key Laboratory of Intelligent Expressway Information Fusion and Control, Chang’an University, Xi’an 710064, China
    b. School of Electronics and Control Engineering, Chang’an University, Xi’an 710064, China
  • Received:2021-08-05 Online:2022-12-28 Published:2023-01-05
  • Contact: WANG Jun E-mail:jwang@nwu.edu.cn.

Abstract:

Aimed at the problem that the existing K-means clustering (KMC) algorithm is greatly affected by initialization, and the randomly generated clustering center can easily make the clustering result fall into local optimum and stop iterating, resulting in low clustering accuracy and poor robustness, a K-means hybrid iterative clustering algorithm based on memory transfer sailfish optimization (MTSFO-HIKMC) is proposed. First, learning from the existing improvement ideas, the maximum and minimum distance product is introduced to initialize the KMC cluster center, to avoid the uncertainty caused by random initialization. At the same time, in the iterative process, the current optimal solution is made to locally perform adaptive memory transfer correction to solve the problem of poor global optimization ability and insufficient search accuracy caused by the single search path of the sailfish algorithm. Using the Iris, Seeds, CMC and Wine international standard data sets, the MTSFO-HIKMC, the sailfish optimized K-means hybrid iterative clustering (SFO-KMC) algorithm, the introduction of the improved Moth-to-fire K-means cross iterative clustering (IMFO-KMC) algorithm, the KMC algorithm, and the fuzzy C-means (FCM) algorithm are compared and tested. From the obtained convergence curves and performance indicators, it can be seen that the MTSFO-HIKMC algorithm proposed in this paper has a faster convergence speed than IMFO-KMC. Compared with the IMFO-KMC algorithm, the dimensional space has a higher search accuracy. Compared with the KMC algorithm and FCM, it has a higher search accuracy. Compared with the SFO-KMC algorithm, its convergence speed and search accuracy are significantly improved, especially in high-dimensional data sets.

Key words: sailfish algorithm, adaptive memory transfer correction strategy, K-means clustering (KMC), maximum and minimum distance product, UCI standard data set

CLC Number: