NORMA eResearch @NCI Library

Using hybrid deep learning and word embedding based approach for advance cyberbullying detection

Bhatt, Jigar (2020) Using hybrid deep learning and word embedding based approach for advance cyberbullying detection. Masters thesis, Dublin, National College of Ireland.

[img]
Preview
PDF (Master of Science)
Download (584kB) | Preview
[img]
Preview
PDF (Configuration manual)
Download (1MB) | Preview

Abstract

The ever-increasing use of social media in the internet space have induced a number of problems like cyberbullying and cyberaggression over the internet. Researchers have made a commendable progress on the ongoing fight against cyberbullying but a lot of unresolved issues still persist that primarily motivates the purpose of the research. The paper aims to integrate recent advances in the field of word embedding like fastText, ELMo and stacked flair embeddings combined with a host of robust deep learning techniques to further the efficiency of detection over the state-of-art. Two distinct datasets Formspring and Wikipedia were requested and processed for the purpose of the research. A number of different combinations of word embedding with deep learning methods were tested and compared with CNN with ELMo embedding delivering the most promising results with an F1 score of 0.82 on both datasets. On the other hand, CNN with fastText obtained F1 score of 0.82 on Formspring and 0.64 on Wikipedia dataset but was computationally faster than the counterparts. Moreover, transfer learning was performed using the models to test and prove the robustness and efficacy of the models. The system performed considerably well with superior scores in precision, recall and F1 over the state-of-the-art across all the test cases performed.

Item Type: Thesis (Masters)
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science

Q Science > QA Mathematics > Computer software
T Technology > T Technology (General) > Information Technology > Computer software
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Dan English
Date Deposited: 18 Jan 2021 16:31
Last Modified: 18 Jan 2021 16:31
URI: http://norma.ncirl.ie/id/eprint/4384

Actions (login required)

View Item View Item