Fast algorithm of cluster analysis k-medoids | Prikladnaya Diskretnaya Matematika - Applied Discrete Mathematics. 2018. № 39. DOI: 10.17223/20710410/39/11

The algorithm k-medoids (KM) is widely used for clustering graphs. The following modifications of it by adding some procedures have been researched in a computer experiments: CKM = KM + CLARA (Cluster LARge Application), LSKM = KM + + L-SPAR (Local SPARsification), BKM = KM + CLARA + L-SPAR, NKM = = KM + NH (New Heuristic choosing cluster center), CNKM = NKM + CLARA, LSNKM = NKM + L-SPAR, FKM = NKM + CLARA + L-SPAR. The experiment results show that L-SPAR procedure allows to decrease the average running time for LSKM by 5.3%, BKM by 6.1%, LSNKM by 8.1%, FKM by 8.3% and to improve the clustering quality by 2%, 2.1%, 3%, 3.3% respectively. Besides, the number of iterations in NKM is twice more than in KM, but the running time of NKM is an order less than of KM, the running time of CKM is two thirds of the running time of KM, the computer complexity of CNKM linearly depends on the graph size, and the application of CLARA does not diminish it appreciably. NH allows 16 times decreasing the computer complexity of KM without loosing the clustering quality. The experiments were conducted on data arrays that represented graphs of social networks YouTube, Live Journal, and the shop Amazon.
Download file
Counter downloads: 427
  • Title Fast algorithm of cluster analysis k-medoids
  • Headline Fast algorithm of cluster analysis k-medoids
  • Publesher Tomask State UniversityTomsk State University
  • Issue Prikladnaya Diskretnaya Matematika - Applied Discrete Mathematics 39
  • Date:
  • DOI 10.17223/20710410/39/11
Keywords
быстрый алгоритм кластерного анализа, PAM-реализация k-medoids, методика CLARA, прореживание L-SPAR, fast algorithm of cluster analysis, PAM, CLARA, Local Graph Sparsifi-cation
Authors
References
Fortunato S. Community detection in graphs // Physics Reports. 2010. V. 486. Iss.3-5. P. 75-174.
Kaufman L. and Rousseeuw P. J. Finding Groups in Data: An Introduction to Cluster Analysis. New Jersey: Hoboken, 2005.
Newman M. J. Modularity and community structure in networks // Proc. of the National Academy of Sciences of the USA. 2006. V. 103. No.23. P.8577-8582.
Satuluri V. M. Scalable Clustering of Modern Networks. Ohio State University Columbus, OH, USA, 2012.
Ovelgonne M. Scalable Algorithms for Community Detection in Very Large Graphs. Karlsruhe, 2011. 146 p.
http://snap.stanford.edu/data/ - Stanford Large Network Dataset Collection. 2017.
 Fast algorithm of cluster analysis k-medoids | Prikladnaya Diskretnaya Matematika - Applied Discrete Mathematics. 2018. № 39. DOI: 10.17223/20710410/39/11
Fast algorithm of cluster analysis k-medoids | Prikladnaya Diskretnaya Matematika - Applied Discrete Mathematics. 2018. № 39. DOI: 10.17223/20710410/39/11
Download full-text version
Counter downloads: 597