Performance analysis of hard clustering techniques for big IoT data analytics

Document Type

Conference Proceeding

Publication Title

2019 Cybersecurity and Cyberforensics Conference (CCC)

Publisher

IEEE

School

School of Science

RAS ID

30514

Comments

Ahmed, M., & Barkat, A. (2019). Performance analysis of hard clustering techniques for big IoT data analytics. In 2019 Cybersecurity and Cyberforensics Conference (CCC) (pp. 62-66). Melbourne, Australia: IEEE.

Available here.

Abstract

Data analytics for Internet of Things (IoT) is an important task in today's connected environment. In particular, identification of infrequent patterns from a huge amount of data is certainly a challenging task. Clustering is a well established technique to divulge the patterns from any given dataset. However, one of the impediments for clustering is to provide the number of clusters that most of the clustering algorithm requires, for example the famous k-means requires the value of k (number of clusters to be produced). GenClust++ and x-means clustering algorithms can automatically identify the number of clusters unlike other hard clustering algorithms. In this paper, we investigate the effectiveness of these two algorithms to identify infrequent patterns or the anomalous clusters. We experimented with seven benchmark IoT datasets and it is evident that the performance of x-means in terms of TPR, FPR is better than GenClust++. In addition to that, in terms of the computational efficiency, x-means outperforms the GenClust++.

DOI

10.1109/CCC.2019.000-8

Access Rights

free_to_read

Share

 
COinS