Challenge: Analyzing Clusters

Apply what you have learned about the H2OKMeansEstimator algorithm to analyze the clusters.

We'll cover the following

Problem statement

This challenge centers on a dataset profiling drivers with key features like Avg_Distance_per_Day, Over_speeding_percentage, and Avg_Speed_per_Day. We want to apply the H2OKMeansEstimator algorithm to perform K-Means clustering on this dataset and generate box plots for the respective clusters. These plots will help us discern each segment’s unique traits and behaviors within the dataset, enhancing our understanding of the key differentiators.

Driver segments

Click the “Run” button below to check out some dataset samples and the list of input features.

Get hands-on with 1200+ tech skills courses.