What is XGBoost?

Explanation

From the diagram above,

First, we start off by making use of the single naive model (usually a model with a slightly good accuracy or metric score) in the ensemble and make predictions.
The result of these predictions is then used to measure the loss function obtained by the model. Here, metrics like to mean squared error (MSE) and R-Squared can be used depending on the problem at hand.
The loss function obtained from step 2 is then used to train a new model.
The newly trained model is then added to the ensemble.
This process continues until a better model with the least loss function is obtained. At the end of the day, we can say that the model has been boosted!

Why use XGBoost?

XGBoost outperforms the usual machine learning algorithms as it has proven to produce models with better accuracy or metric scores.

Because it runs on the python and R programming languages, it has become popular among data science professionals. Additionally, it is compatible with systems running Windows, Linux, macOS, etc.

Our dataset

To illustrate how the XGBoost is used for making predictions, we will use the Boston dataset.

Explanation

Line 1: We import the XGBRegressor model from xgboost.
Line 2: We declare an instance of the XGBRegressor model and the value is assigned to a variable, model.
Line 3: The model is then trained using the training data sets of our features and target variables.
Line 4: We make predictions on the validation data using the .predict() function. The result is assigned to a variable, prediction.
Line 5: We return the output of the prediction.

Evaluating the model

We will be using the root mean squared error (rmse) metric to evaluate our regression model.

Hyperparameter tuning of the XGBRegressor

It will interest you to note that so far so good we have only used the default parameter values of the XGBRegressor model. Now, let's take a closer look at the model and explore its parameters and ultimately choose the best parameter values (hyperparameters) for a better-performing model; this process is called hyperparameter tuning. We will be making use of GridSearchCV from the sklearn.model_selection module for the tuning process.

Hyperparameters of XGBoost

Below are the most commonly used tuned hyperparameters for the XGBRegressor algorithm:

learning_rate (int): Typical values range between 0.01–0.2. This simply specifies how quickly the model trains or fits errors by making use of additional base learners.
max_depth (int): Typical values range between 1–10. This specifies how deep the nodes of the decision tree can go. It can not take a negative number.
gamma (int): Typical values range between 0–0.5. This takes a value signifying the loss reduction required to make a further partition on a leaf node of the tree.
subsample (int): Typical values range between 0.5–0.9. This is a value to represent the fraction of the training data that should be used to train each tree.

Step 1: Create a parameter grid for the tuning

A parameter grid is a dictionary that contains different values of the hyperparameter needed for the tuning process so that after tuning, the best values are chosen automatically.

We will make use of a few parameters for tuning as it can be very time-consuming when you try to make use of all the parameters for the tuning process.

Bravo! We can now see that the root mean squared error of the new model new_model is much better than that of the initial model model.

What were the best parameter values?

After the tuning process, a model is then created using the best parameter values. The model is then automatically built and trained using these hyperparameters on the dataset. Now how do we obtain the best hyperparameter values that were used by this model?

To obtain the best parameter values used by the new model, we simply use the .best_params_ method. We will illustrate this in the code below:

Free Resources

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

What is XGBoost?

How gradient Boosting works

Explanation

Why use XGBoost?

Our dataset

Exploratory data analysis (EDA)

Selecting features and the target variable

Splitting our data

Data Standardization

Using XGBoost

Explanation

Evaluating the model

Hyperparameter tuning of the XGBRegressor

Hyperparameters of XGBoost

Step 1: Create a parameter grid for the tuning

Step 2: Import the GridSearchCV

Explanation

Step 3: Making predictions and evaluation

What were the best parameter values?