Concept drift for big data

Document Type

Book Chapter

Publication Title

Combating Security Challenges in the Age of Big Data

Publisher

Springer

School

School of Science / ECU Security Research Institute

RAS ID

31623

Comments

Seraj R., Ahmed M. (2020) Concept Drift for Big Data. In: Fadlullah Z., Khan Pathan AS. (eds) Combating Security Challenges in the Age of Big Data. Advanced Sciences and Technologies for Security Applications. Springer, Cham. https://doi.org/10.1007/978-3-030-35642-2_2

Abstract

The term “concept drift” refers to a change in statistical distribution of the data. In machine learning and predictive analysis, a fundamental assumption exits which reasons that the data is a random variable which is being generated independently from an underlying stationary distribution. In this chapter we present discussions on concept drifts that are inherent in the context big data. We discuss different forms of concept drifts that are evident in streaming data and outline different techniques for handling them. Handling concept drift is important for big data where the data flow occurs continuously causing existing learned models to lose their predictive accuracy. This chapter will serve as a reference to academicians and industry practitioners who are interested in the niche area of handling concept drift for big data applications.

DOI

10.1007/978-3-030-35642-2_2

Access Rights

subscription content

Share

 
COinS