Near-Optimal Algorithms for Instance-Level Constrained k-Center Clustering - Details

author：

Guo, Longkun (Guo, Longkun.) ^[1] (Scholars：郭龙坤) | Jia, Chaoqi (Jia, Chaoqi.) ^[2] | Liao, Kewen (Liao, Kewen.) ^[3] | Lu, Zhigang (Lu, Zhigang.) ^[4] | Xue, Minhui (Xue, Minhui.) ^[5]

Indexed by：

Scopus SCIE

Abstract：

Many　practical　applications　impose　a　new　challenge　of　utilizing　instance-level　background　knowledge　(e.g.,　subsets　of　similar　or　dissimilar　data　points)　within　their　input　data　to　improve　clustering　results.　In　this　work,　we　build　on　the　widely　adopted　k-center　clustering,　modeling　its　input　instance-level　background　knowledge　as　must-link　(ML)　and　cannot-link　(CL)　constraint　sets,　and　formulate　the　constrained　k-center　problem.　Given　the　long-standing　challenge　of　developing　efficient　algorithms　for　constrained　clustering　problems,　we　first　derive　an　efficient　approximation　algorithm　for　constrained　k-center　at　the　best　possible　approximation　ratio　of　2　with　linear　programming　(LP)-rounding　technology.　Recognizing　the　limitations　of　LP-rounding　algorithms　including　high　runtime　complexity　and　challenges　in　parallelization,　we　subsequently　develop　a　greedy　algorithm　that　does　not　rely　on　the　LP　and　can　be　efficiently　parallelized.　This　algorithm　also　achieves　the　same　approximation　ratio　2　but　with　lower　runtime　complexity.　Lastly,　we　empirically　evaluate　our　approximation　algorithm　against　baselines　on　various　real　datasets,　validating　our　theoretical　findings　and　demonstrating　significant　advantages　of　our　algorithm　in　terms　of　clustering　cost,　quality,　and　runtime　complexity.

Keyword：

Approximation algorithm Approximation algorithms Australia Bipartite graph Clustering algorithms constrained clustering greedy algorithm Greedy algorithms Heuristic algorithms k-center Linear programming linear programming (LP)-rounding Measurement Polynomials Runtime

Community：

[ 1 ] [Guo, Longkun]Fuzhou Univ, Sch Math & Stat, Fuzhou 350116, Peoples R China
[ 2 ] [Jia, Chaoqi]RMIT Univ, Sch Accounting Informat Syst & Supply Chain, Melbourne, Vic 3000, Australia
[ 3 ] [Liao, Kewen]Deakin Univ, Sch Informat Technol, Burwood, Vic 3125, Australia
[ 4 ] [Lu, Zhigang]Western Sydney Univ, Sch Data Comp & Math Sci, Kingswood, NSW 2747, Australia
[ 5 ] [Xue, Minhui]CSIROs Data61, Sydney, NSW 2015, Australia