A study of feature subset evaluators and feature subset searching methods for phishing classification

Document Type

Conference Proceeding

Faculty

Faculty of Computing, Health and Science

School

School of Computer and Security Science / Security Research Centre (secAU)

RAS ID

12291

Comments

Khonji, M., Jones, A. , & Iraqi, Y. (2011). A study of feature subset evaluators and feature subset searching methods for phishing classification. Paper presented at the Annual Collaboration, Electronic messaging, Anti-Abuse and Spam (CEAS) Conference. Perth, WA.

Abstract

Phishing is a semantic attack that aims to take advantage of the naivety of users of electronic services (e.g. e-banking). A number of solutions have been proposed to minimize the impact of phishing attacks. The most accurate email phishing classi ers, that are publicly known, use machine learning techniques. Previous work in phishing email classi cation via machine learning have primarily focused on enhancing the classi cation accuracy by studying the addition of novel features, ensembles, or classi cation algorithms. This study follows a di erent path by taking advantage of previously proposed features. The primary focus of this paper is to enhance the classi cation accuracy of phishing email classi- ers by nding an e ective feature subset out of a number of previously proposed features, by evaluating various feature selection methods. The selected feature subset in this study resulted in a classi cation model with an f1 score of 99.396% for 21 heuristic features and a single classi er.

DOI

10.1145/2030376.2030392

Access Rights

subscription content

Share

 
COinS
 

Link to publisher version (DOI)

10.1145/2030376.2030392