项目作者: vivekamin

项目描述 :
Implementation of K-Means and K-Means++ algorithm
高级语言: Python
项目地址: git://github.com/vivekamin/kmeans-clustering.git
创建时间: 2017-12-28T22:40:39Z
项目社区:https://github.com/vivekamin/kmeans-clustering

开源协议:

下载


kmeans-clustering

The data contains 7,195 instances and 21 features.

implement Kmeans algorithm in Python with the number of clusters k ranging from 2 to 10.Initialized the cluster centers by randomly picking them among all the instances. Then for each number of clusters, upon convergence of Kmeans, computed the objective function and ploted the objective function value vs. the number of clusters k.

This initialization scheme has some problems.So implement the Kmeans++ algorithm where initialized cluster centers to be far from each other. Used the same number of clusters ranging from 2 to 10, and then computed the objective function values versus different number of clusters and plot the figure as in Kmeans.