Comparative Study of Different Machine Learning Models for Customer Churn Analysis Using SMOTE and Feature Variation Along with Customer Segmentation

Mary Shana Thankam*, Neamat Elgayar

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Customer churn is a major issue faced by the companies in both the online and offline markets, which adversely affects profit and revenue. Recently, Machine Learning (ML) is being used to analyze and predict customer churns. In this research the problem of churn prediction is studied with special focus on feature selection and unbalanced data sets. Also, churn analysis has mainly dealt with prediction and not methods to retain customers. In our study, we use a customer dataset from a US telecom company. We compare several classifiers for churn prediction including logistic regression, decision trees, SVM, random forest, k-NN and XgBoost. Besides, methods to retain the customers are discussed. The importance of feature selection is highlighted in this paper and a detailed experimental study of model performance on balanced and unbalanced datasets are explored. After comparing the F1 scores, AUC scores and precision-recall curve, it is seen that XgBoost outperformed all the other algorithms. On the other hand, retaining customers requires the careful study of their behavioral patterns. Customer segmentation is an effective way used by the marketing teams to identify the different groups of customers. In this paper, k-means, agglomerative clustering, gaussian mixture (GM) and Density-Based Spatial Clustering of Applications with Noise (DBSCAN) are used for clustering the customers into segments. We evaluate the clustering results using silhouette analysis.
Original languageEnglish
Title of host publication2023 International Conference on Modeling, Simulation & Intelligent Computing (MoSICom)
EditorsJagadish Nayak, Vilas H Gaidhane, Nilesh Goel
PublisherIEEE
ISBN (Electronic)9798350393415
ISBN (Print)9798350393422
DOIs
Publication statusPublished - 19 Mar 2024
EventIEEE International Conference on Modelling, Simulation and Intelligent Computing 2023 - Dubai, United Arab Emirates
Duration: 7 Dec 20239 Dec 2023
https://www.bits-pilani.ac.in/news/ieee-international-conference-mosicom-2023/

Conference

ConferenceIEEE International Conference on Modelling, Simulation and Intelligent Computing 2023
Abbreviated titleMoSICom 2023
Country/TerritoryUnited Arab Emirates
CityDubai
Period7/12/239/12/23
Internet address

Keywords

  • Classification
  • Clustering
  • Customer Churn
  • Decision Trees
  • k Nearest Neighbor
  • Machine Learning
  • Random Forest
  • Silhouette Analysis
  • Support Vector Machine

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Hardware and Architecture
  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering
  • Modelling and Simulation

Fingerprint

Dive into the research topics of 'Comparative Study of Different Machine Learning Models for Customer Churn Analysis Using SMOTE and Feature Variation Along with Customer Segmentation'. Together they form a unique fingerprint.

Cite this