Phishing Web Page Detection using Optimised Machine Learning

Jordan Stobbs, Biju Issac, Seibu Mary Jacob

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Citations (Scopus)
90 Downloads (Pure)

Abstract

Phishing is a type of social engineering attack that can affect any company or anyone. This paper explores the effect that different features and optimisation techniques have on the accuracy of intelligent phishing detection using machine learning algorithms. This paper explores both hyperparameter optimisation as well as feature selection optimisation. For hyperparameter tuning, both TPE (Tree-structured Parzen Estimator) and GA (Genetic Algorithm) were tested, with the best option being model dependent. For feature selection, GA, MFO (Moth Flame Optimisation) and PSO (Particle Swarm Optimisation) were used with PSO working best with a Random Forest model. This work used URL (Uniform Resource Locator), DOM (Document Object Model) structure, page rank and page information related features. This research found that the best combination was Random Forest using PSO for feature selection and TPE for hyperparameter optimisation, giving an accuracy of 99.33%.
Original languageEnglish
Title of host publication2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)
Subtitle of host publication29 December 2020 – 1 January 2021 Guangzhou, China
EditorsGuojun Wang, Ryan Ko, Md Zakirul Alam Bhuiyan, Yi Pan
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages483-490
Number of pages8
ISBN (Electronic)9781665403924
ISBN (Print)9781665403931
DOIs
Publication statusPublished - Dec 2020
Event19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020): 4th International Workshop on Cyberspace Security (IWCSS 2020) - Guangzhou University, Guangzhou, China
Duration: 29 Dec 20201 Jan 2021
http://ieee-trustcom.org/TrustCom2020/

Conference

Conference19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020)
Country/TerritoryChina
CityGuangzhou
Period29/12/201/01/21
Internet address

Keywords

  • phishing detection
  • bio-inspired optimisation
  • anti-phishing
  • optimisation

Fingerprint

Dive into the research topics of 'Phishing Web Page Detection using Optimised Machine Learning'. Together they form a unique fingerprint.

Cite this