Telsoc
Published on Telsoc (https://telsoc.org)

Home > Improving Phishing Email Detection Using the Hybrid Machine Learning Approach

Improving Phishing Email Detection Using the Hybrid Machine Learning Approach

Naveen Palanichamy [1]

Multimedia University

Yoga Shri Murti [2]

Multimedia University


JTDE - Vol 11, No 3 - September 2023 [3]

[4]
59 [5]

Abstract

Phishing emails pose a severe risk to online users, necessitating effective identification methods to safeguard digital communication. Detection techniques are continuously researched to address the evolution of phishing strategies. Machine learning (ML) is a powerful tool for automated phishing email detection, but existing techniques like support vector machines and Naive Bayes have proven slow or ineffective in handling spam filtering. This study attempts to provide a phishing email detector and reliable classifier using a hybrid machine classifier with term frequency-inverse document frequency (TF-IDF) and an effective feature extraction technique (FET) on a real-world dataset from Kaggle. Exploratory data analysis is conducted to enhance understanding of the dataset and identify any conspicuous errors and outliers to facilitate the detection process. The FET converts the data text into a numerical representation that can be used for ML algorithms. The model’s performance is evaluated using accuracy, precision, recall, F1 score, receiver operating characteristic (ROC) curve and area under the ROC curve metrics. The research findings indicate that the hybrid model utilising TF-IDF achieved superior performance, with an accuracy of 87.5%. The paper offers valuable knowledge on using ML to identify phishing emails and highlights the importance of combining various models.
Article PDF: 
PDF icon 778-palanichamy-article-v11n3pp120-142.pdf [6]

Copyright notice:

Copyright is held by the Authors subject to the Journal Copyright notice. [7]

Cite this article as:

Naveen Palanichamy, Yoga Shri Murti . 2023. Improving Phishing Email Detection Using the Hybrid Machine Learning Approach . JTDE, Vol 11, No 3, Article 778. http://doi.org/10.18080/JTDE.v11n3.778 [8]. Published by Telecommunications Association Inc. ABN 34 732 327 053. https://telsoc.org [9]



Source URL:https://telsoc.org/journal/jtde-v11-n3/a778

Links
[1] https://telsoc.org/journal/author/naveen-palanichamy [2] https://telsoc.org/journal/author/yoga-shri-murti [3] https://telsoc.org/journal/jtde-v11-n3 [4] https://www.addtoany.com/share#url=https%3A%2F%2Ftelsoc.org%2Fjournal%2Fjtde-v11-n3%2Fa778&title=Improving%20Phishing%20Email%20Detection%20Using%20the%20Hybrid%20Machine%20Learning%20Approach%20 [5] https://telsoc.org/print/4139?rate=5y8yfBe5jRtqBPwNj-xWLh6ZemtxyeQSMt02t1PJtb8 [6] https://telsoc.org/sites/default/files/journal_article/778-palanichamy-article-v11n3pp120-142.pdf [7] https://telsoc.org/copyright [8] http://doi.org/10.18080/jtde.v11n3.778 [9] https://telsoc.org