Journal of the Operations Research Society of China ›› 2024, Vol. 12 ›› Issue (2): 351-362.doi: 10.1007/s40305-022-00394-9

Previous Articles     Next Articles

Local search yields a PTAS for fixed-dimensional k-means problem with penalties

Fan Yuan1, Da-Chuan Xu1, Dong-Lei Du2, Dong-Mei Zhang3   

  1. 1 Beijing Institute for Scientific and Engineering Computing, Beijing University of Technology, Beijing 100124, China;
    2 Faculty of Management, University of New Brunswick, Fredericton, NB E3B 9Y2, Canada;
    3 School of Computer Science and Technology, Shandong Jianzhu University, Jinan 250101, Shandong, China
  • Received:2020-09-20 Revised:2021-11-03 Online:2024-06-30 Published:2024-06-12
  • Contact: Dong-Mei Zhang, Fan Yuan, Da-Chuan Xu, Dong-Lei Du E-mail:zhangdongmei@sdjzu.edu.cn;yuanfan@emails.bjut.edu.cn;xudc@bjut.edu.cn;ddu@unb.ca
  • Supported by:
    The first two authors are supported by the National Natural Science Foundation of China (No. 12131003) and Beijing Natural Science Foundation Project (No. Z200002). The third author is supported by the Natural Sciences and Engineering Research Council of Canada (No.06446), and the National Natural Science Foundation of China (Nos. 11771386 and 11728104). The fourth author is supported by the National Natural Science Foundation of China (No. 11871081).

Abstract: We study a problem called the k-means problem with penalties (k-MPWP), which is a natural generalization of the typical k-means problem. In this problem, we have a set D of client points in ℝd, a set F of possible centers in ℝd, and a penalty cost pj > 0 for each point jD. We are also given an integer k which is the size of the center point set. We want to find a center point set SF with size k, choose a penalized subset of clients PD, and assign every client in D\P to its open center. Our goal is to minimize the sum of the squared distances between every point in D\P to its assigned centre point and the sum of the penalty costs for all clients in P. By using the multi-swap local search technique and under the fixed-dimensional Euclidean space setting, we present a polynomial-time approximation scheme (PTAS) for the k-MPWP.

Key words: Approximation algorithm, k-means, Local search, Penalty

CLC Number: