Phishing Detection Project

Group Members

Supervisor: Dr. Dilmurod Vahabdjanov

Promotional Video

Poster

Download Poster

Project Explanation

In today’s world, phishing attacks are a common vector for a wide range of cyberattacks and are one of the most prominent causes of data breaches. As the number of attacks grows daily, it has become essential to use automated tools that leverage AI and machine learning methodologies to analyze potential phishing sources, one of which is email. We have proposed this research-themed project as a means to learn the machine learning algorithms used in this context, as well as to raise awareness about phishing attacks.

Our solution is a hybrid approach that uses both traditional machine learning algorithms and CNNs to improve phishing email detection. We use two datasets, Nazario and Enron, to train and evaluate our models

Implementation Details

Project Flowchart

Comparison With the State-Of-The-Art

Model Accuracy F1-score
k-Nearest Neighbors 98.42% 97.72%
Naïve Bayes 91.38% 91.88%
Logistic Regression 95.74% 95.57%
Support Vector Machine 98.68% 98.38%
Random Forest 97.71% 97.59%
CNN (batch size 64) max: 99.00% avg: 98.38% max: 89.85% avg: 71.48%
CNN (batch size 32) max: 99.06% avg: 98.70% max: 94.70% avg: 81.90%

Project Source Code

Download Source Code