Telecom Churn Prediction

Business Analysis and Product Research

What?

Developed a predictive model to identify customers at risk of churning for an Iranian telecom provider entering a competitive market. The aim was to inform data-driven retention strategies by analyzing usage behavior, service interactions, and customer value indicators.

Why?

In highly competitive sectors like telecommunications, customer churn poses a significant risk to profitability. This project provided:

  • Predictive insight to support targeted customer retention efforts

  • Business-aligned interpretations to guide decision-makers

  • Deeper understanding of behavioral and service-related churn drivers

How?

📈 Data Analysis & Feature Engineering

  • Conducted a correlation matrix analysis to explore relationships between variables such as usage frequency, complaints, and customer status

  • Applied Variance Inflation Factor (VIF) analysis to detect multicollinearity (i.e., highly correlated predictors), and refined the dataset by removing or transforming problematic variables to enhance model interpretability

🤖 Predictive Modeling & Optimization

Tested multiple classification algorithms including:

  • Logistic Regression

  • K-Nearest Neighbors (KNN)

  • Quadratic Discriminant Analysis (QDA)

  • Naive Bayes

Refined the logistic regression model by selecting significant predictors (e.g., call failures, complaints, subscription length), improving model accuracy from 87.9% to 89.9%

🔎 Model Evaluation & Threshold Tuning

Evaluated models using standard metrics:

  • Accuracy

  • Confusion Matrix (True Positives, True Negatives, False Positives, False Negatives)

  • Area Under the Receiver Operating Characteristic Curve (AUC-ROC) — achieved a high AUC score of 0.93, indicating strong sensitivity and model performance

  • Optimized the classification threshold from the default 0.5 to 0.51, reducing error rate and improving predictive precision

🔁 Validation Strategy

  • Employed 10-Fold Cross-Validation, a model evaluation technique that partitions the dataset into 10 parts to ensure robustness and minimize overfitting

Outcomes

Outcomes

Outcomes

  • Led evaluation of model performance and validation strategy

  • Contributed to statistical testing, threshold tuning, and variable refinement

  • Co-authored stakeholder-focused presentations linking findings to business value

Links.

© 2025 • Snehasini M Antonious

unsplash.com/@reddfrancisco

Create a free website with Framer, the website builder loved by startups, designers and agencies.