Rashid, A. N. M. B., Ahmed, M., Sikos, L. F., & Haskell-Dowland, P. (2020). Cooperative co-evolution for feature selection in big data with random feature grouping. Journal of Big Data, 7, article 107. https://doi.org/10.1186/s40537-020-00381-y

Abstract

© 2020, The Author(s). A massive amount of data is generated with the evolution of modern technologies. This high-throughput data generation results in Big Data, which consist of many features (attributes). However, irrelevant features may degrade the classification performance of machine learning (ML) algorithms. Feature selection (FS) is a technique used to select a subset of relevant features that represent the dataset. Evolutionary algorithms (EAs) are widely used search strategies in this domain. A variant of EAs, called cooperative co-evolution (CC), which uses a divide-and-conquer approach, is a good choice for optimization problems. The existing solutions have poor performance because of some limitations, such as not considering feature interactions, dealing with only an even number of features, and decomposing the dataset statically. In this paper, a novel random feature grouping (RFG) has been introduced with its three variants to dynamically decompose Big Data datasets and to ensure the probability of grouping interacting features into the same subcomponent. RFG can be used in CC-based FS processes, hence called Cooperative Co-Evolutionary-Based Feature Selection with Random Feature Grouping (CCFSRFG). Experiment analysis was performed using six widely used ML classifiers on seven different datasets from the UCI ML repository and Princeton University Genomics repository with and without FS. The experimental results indicate that in most cases [i.e., with naïve Bayes (NB), support vector machine (SVM), k-Nearest Neighbor (k-NN), J48, and random forest (RF)] the proposed CCFSRFG-1 outperforms an existing solution (a CC-based FS, called CCEAFS) and CCFSRFG-2, and also when using all features in terms of accuracy, sensitivity, and specificity.

DOI

10.1186/s40537-020-00381-y

Related Publications

Rashid, A. N. M. B. (2021). Cooperative co-evolution-based feature selection for big data analytics. https://ro.ecu.edu.au/theses/2428

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Download

Included in

Computer Sciences Commons, Data Science Commons

COinS

Link to publisher version (DOI)

10.1186/s40537-020-00381-y

Research outputs 2014 to 2021

Cooperative co-evolution for feature selection in big data with random feature grouping

Author Identifier

Document Type

Publication Title

Volume

Issue

Publisher

School

RAS ID

Funders

Comments

Abstract

DOI

Related Publications

Creative Commons License

Included in

Link to publisher version (DOI)

Search

Links

Browse

Author Information

Article Locations

Research outputs 2014 to 2021

Cooperative co-evolution for feature selection in big data with random feature grouping

Authors

Author Identifier

Document Type

Publication Title

Volume

Issue

Publisher

School

RAS ID

Funders

Comments

Abstract

DOI

Related Publications

Creative Commons License

Included in

Share

Link to publisher version (DOI)

Search

Links

Browse

Author Information

Article Locations