NORMA eResearch @NCI Library

Enhancing Customer Churn Prediction in the Telecom Sector Using Advance Machine Learning Techniques and Explainable AI

Bhushan, Shreyas Bhargav (2024) Enhancing Customer Churn Prediction in the Telecom Sector Using Advance Machine Learning Techniques and Explainable AI. Masters thesis, Dublin, National College of Ireland.

[thumbnail of Master of Science]
Preview
PDF (Master of Science)
Download (1MB) | Preview
[thumbnail of Configuration Manual]
Preview
PDF (Configuration Manual)
Download (1MB) | Preview

Abstract

In telecommunications industry, customer churn is one of biggest issues that is faced by the operators, it is phenomenon where the customers stop using their services or switch to other operators. Telecommunications companies profitability can be heavily impacted by high churn rates, as acquiring new customers is often easy when compared to keeping existing customers. The study proposes a robust and explainable machine learning model to predict and control the customer churn. The proposed framework integrates heterogenous multi-stacking of ensemble technique, combining base models like random Forest, XGBoost, k-Nearest Neighbors (KNN) with logistic regression as meta model. To select the significant features we have applied Recursive Feature Elimination (RFE), and Synthetic Minority Oversampling Technique (SMOTE) was implemented to handle the imbalance of the class. Stratified k-fold was applied to cross validate the performance of the models. The multi-stacked model outperformed all the base models with an accuracy of 81%, while maintaining balance between recall and precision. The evaluation metrics like Accuracy, Precision, Recall, F1-score, ROCAUC score and confusion matrix was used to validate the efficiency of the model. To address the “black box” nature of the ensemble model, the Explainable AI technique called as SHapley Additive exPlanations (SHAP) was used to improve the interpretability of the models, the technique provided insights for both global and local important features. SHAP helped to identify the significant features influencing the churn like contract type, tenure, and monthly charges. These insights help to gain the trust of the stakeholders and design targeted retention strategies.

Item Type: Thesis (Masters)
Supervisors:
Name
Email
Tomer, Vikas
UNSPECIFIED
Uncontrolled Keywords: Customer Churn Prediction; Random Forest; XGBoosting; Logistic Regression; k-Nearest Neighbor SHapley Additive exPlanations (SHAP)
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QH Natural history > QH301 Biology > Methods of research. Technique. Experimental biology > Data processing. Bioinformatics > Artificial intelligence
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Artificial intelligence
H Social Sciences > HF Commerce > Marketing > Consumer Behaviour
Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunications
Divisions: School of Computing > Master of Science in Data Analytics
Depositing User: Ciara O'Brien
Date Deposited: 01 Sep 2025 15:23
Last Modified: 01 Sep 2025 15:23
URI: https://norma.ncirl.ie/id/eprint/8683

Actions (login required)

View Item View Item