Phishing Detection Project

Group Members

Oğuz GÖZLÜ - 2200356018
Betül Sema MANAV - 2200356019

Supervisor: Dr. Dilmurod Vahabdjanov

Promotional Video

Poster

Project Explanation

In today’s world, phishing attacks are a common vector for a wide range of cyberattacks and are one of the most prominent causes of data breaches. As the number of attacks grows daily, it has become essential to use automated tools that leverage AI and machine learning methodologies to analyze potential phishing sources, one of which is email. We have proposed this research-themed project as a means to learn the machine learning algorithms used in this context, as well as to raise awareness about phishing attacks.

Our solution is a hybrid approach that uses both traditional machine learning algorithms and CNNs to improve phishing email detection. We use two datasets, Nazario and Enron, to train and evaluate our models

Implementation Details

Project Flowchart

Comparison With the State-Of-The-Art

Model	Accuracy	F1-score
k-Nearest Neighbors	98.42%	97.72%
Naïve Bayes	91.38%	91.88%
Logistic Regression	95.74%	95.57%
Support Vector Machine	98.68%	98.38%
Random Forest	97.71%	97.59%
CNN (batch size 64)	max: 99.00% avg: 98.38%	max: 89.85% avg: 71.48%
CNN (batch size 32)	max: 99.06% avg: 98.70%	max: 94.70% avg: 81.90%

Project Source Code

Download Source Code