Klasifikasi Customer Churn Menggunakan XGBoost dengan Optimasi GridSearchCV Berbasis Shapley Additive Explanations

Authors

  • I Gusti Ayu Riyana Astarani Universitas Udayana Author
  • Luh Arida Ayu Rahning Putri Universitas Udayana Author

DOI:

https://doi.org/10.24843/JNATIA.2025.v04.i01.p01

Keywords:

Customer churn, XGBoost, GridSearchCV, SHAP

Abstract

Customer churn is a significant challenge in the banking sector, often leading to revenue loss and requiring predictive strategies to enhance customer retention. This study implements the Extreme Gradient Boosting (XGBoost) algorithm for churn classification, with hyperparameter optimization using the GridSearchCV technique to improve model performance. The dataset comprises 10,000 banking customers with 9 features and 1 target label. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. Prior to tuning, the XGBoost model achieved an accuracy of 80.8%. After applying optimal parameters, the model's performance improved to 81.5%, along with higher precision and recall values, indicating improved robustness and consistency. For model interpretability, Shapley Additive Explanations (SHAP) were used and visualized through a beeswarm Plot. The analysis identified age, customer activity status, and number of products owned as the most influential features in predicting churn. Based on these findings, this study proposes business recommendations including age-based customer segmentation, enhancing active customer engagement, and optimizing product offerings as strategies to reduce churn.

Downloads

Published

2025-11-01

How to Cite

[1]
I Gusti Ayu Riyana Astarani and Luh Arida Ayu Rahning Putri, “Klasifikasi Customer Churn Menggunakan XGBoost dengan Optimasi GridSearchCV Berbasis Shapley Additive Explanations”, Jnatia, vol. 4, no. 1, pp. 1–10, Nov. 2025, doi: 10.24843/JNATIA.2025.v04.i01.p01.