NORMA eResearch @NCI Library

Learning to detect fake online reviews using readability tests and text analytics

Shetty, Siddhanth Chandrahas (2019) Learning to detect fake online reviews using readability tests and text analytics. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (1MB) | Preview

Abstract

A customer highly relies on reviews when buying any product online, hence playing a crucial part in the customer's decision-making process. With the rise of online communities and portals, millions of reviews are getting posted and determining the credibility of them with such a high volume data is difficult. Although it is essential to classify them, as it profoundly impacts the business. Due to its hidden nature, fake reviews are used by companies to increase their market strength, which is a matter of concern. Many studies have been conducted with respect to this domain, where different statistical and textual analysis was performed to identify fake and genuine reviews. In this research, we propose the use of readability tests as features in combination with other general ratings and textual features on restaurant reviews datasets from Yelp for online spam review detection. We use supervised machine learning techniques such as Naïve Bayes, XGBoost, AdaBoost, and Gradient Boosting Machine for the classification of reviews using the mentioned feature sets. The results by the models are promising and displays the effectiveness of the proposed models in detecting fake reviews.

Item Type: Thesis (Masters)
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QA Mathematics > Computer software
T Technology > T Technology (General) > Information Technology > Computer software
H Social Sciences > HF Commerce > Electronic Commerce
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Caoimhe Ní Mhaicín
Date Deposited: 11 Oct 2019 14:20
Last Modified: 11 Oct 2019 14:20
URI: https://norma.ncirl.ie/id/eprint/3849

Actions (login required)

View Item View Item