Abstract
Phishing is a type of social engineering attack that can affect any company or anyone. This paper explores the effect that different features and optimisation techniques have on the accuracy of intelligent phishing detection using machine learning algorithms. This paper explores both hyperparameter optimisation as well as feature selection optimisation. For hyperparameter tuning, both TPE (Tree-structured Parzen Estimator) and GA (Genetic Algorithm) were tested, with the best option being model dependent. For feature selection, GA, MFO (Moth Flame Optimisation) and PSO (Particle Swarm Optimisation) were used with PSO working best with a Random Forest model. This work used URL (Uniform Resource Locator), DOM (Document Object Model) structure, page rank and page information related features. This research found that the best combination was Random Forest using PSO for feature selection and TPE for hyperparameter optimisation, giving an accuracy of 99.33%.
Original language | English |
---|---|
Title of host publication | 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) |
Subtitle of host publication | 29 December 2020 – 1 January 2021 Guangzhou, China |
Editors | Guojun Wang, Ryan Ko, Md Zakirul Alam Bhuiyan, Yi Pan |
Place of Publication | Piscataway, NJ |
Publisher | IEEE |
Pages | 483-490 |
Number of pages | 8 |
ISBN (Electronic) | 9781665403924 |
ISBN (Print) | 9781665403931 |
DOIs | |
Publication status | Published - Dec 2020 |
Event | 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020): 4th International Workshop on Cyberspace Security (IWCSS 2020) - Guangzhou University, Guangzhou, China Duration: 29 Dec 2020 → 1 Jan 2021 http://ieee-trustcom.org/TrustCom2020/ |
Conference
Conference | 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020) |
---|---|
Country/Territory | China |
City | Guangzhou |
Period | 29/12/20 → 1/01/21 |
Internet address |
Keywords
- phishing detection
- bio-inspired optimisation
- anti-phishing
- optimisation