Business Intelligence for Sales Directors
Clustering is an unsupervised machine learning technique that groups similar data points together based on their characteristics. In e-commerce, clustering can help identify natural groupings of products, customers, or orders, enabling more targeted strategies for inventory management, marketing, and customer service.
We applied K-means clustering to product dimensions (weight, length, height, width) to identify natural groupings of products based on their physical characteristics.
Figure 1: Elbow method for determining optimal number of clusters
The elbow method suggests that 4 clusters is an optimal choice for grouping products based on their dimensions, as it represents a good balance between cluster cohesion and model complexity.
Figure 2: Visualization of product clusters in 2D space (using PCA for dimensionality reduction)
| Cluster | Count | Weight (g) | Length (cm) | Height (cm) | Width (cm) | Description |
|---|---|---|---|---|---|---|
| Cluster 1 | 21,342 | 621 | 21.6 | 11.0 | 16.6 | Small items |
| Cluster 2 | 1,245 | 16,432 | 53.9 | 45.1 | 42.8 | Very large items |
| Cluster 3 | 7,854 | 2,634 | 30.3 | 32.4 | 26.6 | Medium items |
| Cluster 4 | 2,510 | 2,712 | 52.2 | 12.7 | 34.4 | Long items |
Business Insight: The clustering analysis has identified four distinct product groups based on their physical dimensions:
These clusters can inform inventory management strategies, warehouse organization, packaging solutions, and shipping cost optimization. For example, different storage solutions can be designed for each cluster, and shipping carriers can be selected based on their suitability for handling items in each cluster.
We applied K-means clustering to order items based on price, freight value, and product dimensions to identify patterns in customer purchases.
Figure 3: Visualization of order item clusters in 2D space (using PCA for dimensionality reduction)
| Cluster | Count | Price (BRL) | Freight Value (BRL) | Weight (g) | Description |
|---|---|---|---|---|---|
| Cluster 1 | 78,452 | 96.32 | 18.75 | 734 | Low-cost small items |
| Cluster 2 | 18,764 | 114.28 | 24.63 | 2,687 | Medium-cost medium items |
| Cluster 3 | 2,225 | 401.56 | 42.87 | 13,024 | High-cost large items |
Business Insight: The clustering analysis has identified three distinct order item groups:
These clusters reveal distinct purchasing patterns that can inform marketing strategies, pricing models, and customer segmentation. For example:
The clustering results provide several practical applications for e-commerce business operations:
Our clustering analysis has revealed natural groupings in our e-commerce data:
These insights provide a foundation for more targeted business strategies across inventory management, logistics, marketing, and customer service, ultimately leading to improved operational efficiency and customer satisfaction.