Muthuraj, Diwakar (2024) Sentiment Analysis in Tamil-English Code-Mixed Data Using Hybrid Deep Learning Techniques. Masters thesis, Dublin, National College of Ireland.
Preview |
PDF (Master of Science)
Download (2MB) | Preview |
Preview |
PDF (Configuration Manual)
Download (3MB) | Preview |
Abstract
Sentiment Analysis (SA) is the process of classifying the sentiments found in data as positive, negative, or neutral. Some of the important real-world applications of sentiment analysis are social media trends, customer feedback, political discourse, and market insights. With this vast application, it has a major role in Natural Language Processing. Looking at most social media posts, comments, ecommerce product reviews, and online forums, the bilingual communities largely use code-mixed text, which is a frequent interchange of words from different languages to express their opinions. This code-mixed text has non-standard grammar, transliterations, slang words; thus, these complexities introduce challenges in sentiment analysis. Tamil-English being one of the most used code-mixes is chosen for this research project to examine fine-tuning hyperparameters of hybrid models to efficiently classify sentiment in Tamil-English code-mixed data. In this project, various experiments are done with a base model mBERT+TextGCN, with different tools and techniques to prepare the data for the model. These steps include preprocessing, handling class imbalance, feature engineering, feature extraction etc. Then to improve the efficiency of proposed IndicBART+TextGCN further, fine-tuning of hyperparameters are performed and evaluated using accuracy, precision, recall, F1 Score and confusion matrix. By following these effective techniques, the IndicBART+TextGCN model achieved a weighted average of precision 0.71, recall 0.68, f1-score 0.67. This result shows that the preprocessing, handling class imbalance, feature engineering and efficient fine-tuning of IndicBART+TextGCN has improved this hybrid model’s ability to classify sentiments from the Tamil-English code-mixed data.
Actions (login required)
![]() |
View Item |