Post

Project: Kalbe X Rakamin Data Scientist

Project: Kalbe X Rakamin Data Scientist

Problem Statement

Inventory Team:

Predicting the sales quantity (quantity) for the overall Kalbe products.

  • Knowing the estimated quantity of products sold, the inventory team can create sufficient daily stock supplies.
  • The prediction should be done on a daily basis.

    Marketing Team:

    Creating clusters/segments of customers based on several criteria.

  • Creating customer segments.
  • The marketing team will use these customer segments to provide personalized promotions and sales treatment.

Tools:

  • Python
  • Jupyter Notebook
  • Tableau
  • Dbeaver
  • PostgreSQL

Exploration with PostgreSQL and Dbeaver

Average Customer Age based on Marital Status:

  • Married = 43
  • Single = 29

Average Customer Age based on Gender:

  • Woman = 40
  • Man = 39

Sotre with Highest Total Quantity: Lingga

Top Product by Total Amount: Cheese Stick

Tableau Dashboard

For interactive use, please visit: Tableau Tableau

Machine Learning

Time Series Model

MAE is found to be 0.24, meaning the model’s forecasts differ from the actual values by approximately 0.24 units. MAPE is 6.53%, meaning the model’s forecasts deviate from the actual values by about 6.53%. MSE is 0.08, meaning the model’s forecasts have, on average, squared differences of 0.08 with the actual values. RMSE is 0.29, meaning the model’s forecasts have a root mean squared difference of approximately 0.29 with the actual values.

Clustering Model

From the 2-cluster division, it is found that in cluster 1, customers have a higher preference for Thai Tea, Ginger Candy, and Oat. Meanwhile, in cluster 0, customers have a higher preference for Cheese Stick, Choco bar, Crackers, and Cashew.

Code source: Github

This post is licensed under CC BY 4.0 by the author.