Title

Lexical URL analysis for discriminating phishing and legitimate websites

Document Type

Conference Proceeding

Publisher

ACM

Faculty

Faculty of Computing, Health and Science

School

School of Computer and Security Science / Security Research Centre (secAU)

RAS ID

12292

Comments

This article was originally published as: Khonji, M., Iraqi, Y., & Jones, A. (2011). Lexical URL analysis for discriminating phishing and legitimate websites. Paper presented at the Annual Collaboration, Electronic messaging, Anti-Abuse and Spam (CEAS) Conference. Perth, WA.

Abstract

A study that aims to evaluate the practical e ectiveness of website classi cation by lexically analyzing URL tokens in addition to a novel tokenization mechanism to increase prediction accuracy. The study analyzes over 70,000 legit- imate and phishing URLs collected over 6 months period from PhishTank1, Khalifa University HTTP logs and volun- teers using an experimental HTTP proxy server. A statisti- cal classi cation model is then constructed and evaluated to measure the practical e ectiveness of the lexical URL anal- ysis presented in this paper.

This document is currently not available here.

 
COinS