Distance based k-means clustering algorithm for determining number of clusters for high dimensional data

dc.contributor.authorMohamed Cassim Alibuhtto
dc.contributor.authorNor Idayu Mahat
dc.date.accessioned2019-11-25T14:51:10Z
dc.date.available2019-11-25T14:51:10Z
dc.date.issued2020-01
dc.description.abstractClustering is one of the most common unsupervised data mining classification techniques for splitting objects into a set of meaningful groups. However, the traditional k-means algorithm is not applicable to retrieve useful information / clusters, particularly when there is an overwhelming growth of multidimensional data. Therefore, it is necessary to introduce a new strategy to determine the optimal number of clusters. To improve the clustering task on high dimensional data sets, the distance based k-means algorithm is proposed. The proposed algorithm is tested using eighteen sets of normal and non-normal multivariate simulation data under various combinations. Evidence gathered from the simulation reveal that the proposed algorithm is capable of identifying the exact number of clusters.en_US
dc.identifier.citationAlibuhtto, M.C., & Mahat, N.I.(2020).Distance based k-means clustering algorithm for determining number of clusters for high dimensional data. Decision science letters. 9(1), 51-58.en_US
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/3905
dc.language.isoenen_US
dc.publisherGrowing Scienceen_US
dc.subjectClustering High Dimensional Data K-means algorithm Optimal Cluster Simulationen_US
dc.titleDistance based k-means clustering algorithm for determining number of clusters for high dimensional dataen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Distance based k-means.pdf
Size:
913.37 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: