NORMA eResearch @NCI Library

Transfer Learning for Identification of disaster tweets using fine-tuning DistilBERT

Deshmukh, Nikhil Vishnupant (2023) Transfer Learning for Identification of disaster tweets using fine-tuning DistilBERT. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (1MB) | Preview
[thumbnail of Configuration manual]
Preview
PDF (Configuration manual)
Download (911kB) | Preview

Abstract

This study presents an in-depth exploration of tweet classification for disaster-related content detection, addressing the challenges posed by the proliferation of social media during crises. Three distinct models were developed and evaluated: CNN + LSTM with Glove Embedding, RoBERTa Transformer, and FineTune DistilBERT Transformer. These models were rigorously assessed for their ability to distinguish between disaster and non-disaster-related tweets, with a focus on accuracy, sensitivity, and specificity. The CNN + LSTM model exhibited promising precision but lacked recall for disastrous tweets. RoBERTa demonstrated enhanced performance owing to its extensive training data and methodology. However, the FineTune DistilBERT model emerged as the standout performer, showcasing a balanced sensitivity-specificity trade-off and achieving an impressive 89% accuracy. Leveraging its DistilBERT architecture, this model offers a compact yet powerful solution for accurate tweet classification. The findings underline the potential of transformer-based models in crisis informatics, specifically for identifying disaster-related content in social media streams. This study contributes to advancing rapid and accurate crisis response tools, empowering humanitarian organizations with improved insights and aiding decision-making in disaster scenarios. Further research avenues may explore ensemble methods and domain-specific fine-tuning to enhance model performance across diverse disaster contexts.

Item Type: Thesis (Masters)
Supervisors:
Name
Email
Tomer, Vikas
UNSPECIFIED
Uncontrolled Keywords: Transformer-based models; FineTune DistilBERT Transformer; CNN+LSTM; RoBERTa; Disaster contexts
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
P Language and Literature > P Philology. Linguistics > Computational linguistics. Natural language processing
Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4150 Computer Network Resources > The Internet > World Wide Web > Websites > Online social networks
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunications > The Internet > World Wide Web > Websites > Online social networks
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Tamara Malone
Date Deposited: 08 Nov 2024 13:49
Last Modified: 08 Nov 2024 13:49
URI: https://norma.ncirl.ie/id/eprint/7178

Actions (login required)

View Item View Item