...

The Standard ML Pipeline

Learn about the ML pipeline and operations like data preparation, validation, model training, and more.

We'll cover the following...

Data preparation
Data validation
Model training
Model validation
Industry standards

Press + to interact

We’ll dive into each of these steps in this lesson and cover the operations that are typically performed during it. In the next lesson, we’ll discuss how these operations can sometimes become sources of disasters that create irreversible damage to the pipeline and therefore to the team and the company.

Data preparation

Once a dataset is acquired, steps are taken to convert the raw data into something that a model can understand. This typically involves feature engineering (i.e., deciding how to break apart or combine columns into more meaningful variables), data cleaning, dimensionality reduction (e.g., principal component analysis or PCA), and much more.

This is one ...

Introduction

Disasters in Data

Disasters in Models

Measuring Causal Relations with Python

Alternatives to Traditional ML

Adversarial Robustness of Neural Networks

Conclusion

Assessment: Disasters in ML Pipelines

The Standard ML Pipeline

Data preparation