NORMA eResearch @NCI Library

Email Spam Detection: Leveraging Fine-Tuned Transformer Models with Attention Mechanism

Shah, Samrat Sanjaykumar (2024) Email Spam Detection: Leveraging Fine-Tuned Transformer Models with Attention Mechanism. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (1MB) | Preview
[thumbnail of Configuration Manual]
Preview
PDF (Configuration Manual)
Download (2MB) | Preview

Abstract

Due to ongoing threats to email security, it is becoming increasingly important to use advanced methods to consistently get rid of unwanted emails. To meet this need three advanced machine learning (ML) techniques DistilBERT, XLM-RoBERTa, and RoBERTa are tested to see how well they can find spam emails. Along with that pre-trained ML systems are tuned on the Enron-Spam dataset, which is a standard way to test how well spam identification works. Metrics like accuracy, precision, recall, and F1-score are used to test and analyze these improved systems in great depth to see how well they work. The research also investigates how focusing features built into these designs can make the models more accurate and clearer. The results show that the best method is the improved DistilBERT model, which is 96% accurate. The study shows that focusing mechanisms are important for making these models work better by helping with more accurate feature extraction and classification. Furthermore, this study adds to the progress in email security by showing how advanced ML can be used to find spam and how important narrowing methods are for making models work better. These findings are important for making spam filtering technologies better and more reliable. This will improve email security and the user experience in today's digital world.

Item Type: Thesis (Masters)
Supervisors:
Name
Email
Mustafa, Raza Ul
UNSPECIFIED
Uncontrolled Keywords: Email Spam Detection; Deep Learning; Transformer Models; RoBERTa; XLM-RoBERTa; DistilBERT; Attention Mechanisms
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QA Mathematics > Computer software > Computer Security
T Technology > T Technology (General) > Information Technology > Computer software > Computer Security
Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4150 Computer Network Resources > The Internet > Electronic Mail
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunications > The Internet > Electronic Mail
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
Divisions: School of Computing > Master of Science in Cyber Security
Depositing User: Ciara O'Brien
Date Deposited: 05 Jun 2025 10:33
Last Modified: 05 Jun 2025 10:33
URI: https://norma.ncirl.ie/id/eprint/7751

Actions (login required)

View Item View Item