NORMA eResearch @NCI Library

Classification of Airline Customer Sentiment Expressed in Twitter Tweets using Lexicons, Decision Tree, and Naïve Bayes

Higgins, Liam (2022) Classification of Airline Customer Sentiment Expressed in Twitter Tweets using Lexicons, Decision Tree, and Naïve Bayes. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (822kB) | Preview
[thumbnail of Configuration manual]
Preview
PDF (Configuration manual)
Download (700kB) | Preview

Abstract

This paper describes Natural Language Processing (NLP) and Machine Learning approaches to Sentiment Analysis of Twitter Tweets relating to commercial passenger airlines. By extracting and analysing textual data obtained in real-time from the social media platform, Twitter, the research proposes a methodology to collect, process, and interpret emotional responses contained within Tweets. Two main approaches to classifying sentiment are described. Firstly, lexicon-based approaches using three valence lexicons (Syuzhet, Afinn, and Bing) and one emotion lexicon (NRC) to determine the semantic orientation of words found within Tweet text are discussed. Secondly, two supervised machine learning classification algorithms (Naïve Bayes and Decision Tree) are used to perform sentiment classification. The goal of the research is to provide a diverse and commercially useful method for airlines to monitor customer sentiment relating to their experiences of airline services. The importance and commercial application of obtaining customer insights from Tweets which have been posted online and describe customer experiences and attitudes is discussed. The paper aims to provide airlines with a means to improve service offerings, differentiate from competitors, and gain competitive advantage based on analysing customer sentiment to their services. A maximum accuracy of 71% was achieved using a Naïve Bayes classifier algorithm.

Item Type: Thesis (Masters)
Subjects: H Social Sciences > HE Transportation and Communications
Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
P Language and Literature > P Philology. Linguistics > Computational linguistics. Natural language processing
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4150 Computer Network Resources > The Internet > World Wide Web > Websites > Online social networks
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunications > The Internet > World Wide Web > Websites > Online social networks
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Tamara Malone
Date Deposited: 26 Jan 2023 15:10
Last Modified: 03 Mar 2023 11:32
URI: https://norma.ncirl.ie/id/eprint/6131

Actions (login required)

View Item View Item