Performance analysis of hard clustering techniques for big IoT data analytics
Document Type
Conference Proceeding
Publication Title
2019 Cybersecurity and Cyberforensics Conference (CCC)
Publisher
IEEE
School
School of Science
RAS ID
30514
Abstract
Data analytics for Internet of Things (IoT) is an important task in today's connected environment. In particular, identification of infrequent patterns from a huge amount of data is certainly a challenging task. Clustering is a well established technique to divulge the patterns from any given dataset. However, one of the impediments for clustering is to provide the number of clusters that most of the clustering algorithm requires, for example the famous k-means requires the value of k (number of clusters to be produced). GenClust++ and x-means clustering algorithms can automatically identify the number of clusters unlike other hard clustering algorithms. In this paper, we investigate the effectiveness of these two algorithms to identify infrequent patterns or the anomalous clusters. We experimented with seven benchmark IoT datasets and it is evident that the performance of x-means in terms of TPR, FPR is better than GenClust++. In addition to that, in terms of the computational efficiency, x-means outperforms the GenClust++.
DOI
10.1109/CCC.2019.000-8
Access Rights
free_to_read
Comments
Ahmed, M., & Barkat, A. (2019). Performance analysis of hard clustering techniques for big IoT data analytics. In 2019 Cybersecurity and Cyberforensics Conference (CCC) (pp. 62-66). Melbourne, Australia: IEEE.
Available here.