Building a Machine Learning Pipeline from Scratch/

...

System Architecture

Examine the architecture of the ML pipeline.

We'll cover the following...

Components of the pipeline
Encapsulation: Datasets and models
How to follow along

To recap our course goals, this entire course is a single project. We start by designing the ML pipeline, and over the duration of the course, we add various components to it. As a concrete example of how we can use the pipeline, we then create an ML classification project.

In this chapter, we start with the architecture, or design, of our software. The first step in the development of any software is design. Typically, this means determining the scope of the project, identifying the various components of the system, and drawing a block diagram that shows how the various parts fit together. In addition, it includes designing what goes inside each block and including the interfaces where the blocks connect. What are the logically distinct functionalities in training a model?

Components of the pipeline

A pipeline contains the following components:

Loading data
Preprocessing data
Feature engineering data
Merging data
Training the model
Evaluating the model
Generating the ...

Introduction

Getting Started

Structuring the ML Pipeline

Directed Acyclic Graphs (DAGs)

The ML Library

Create Your First Data Pipeline with a Dashboard

The Pipeline Core

Extending the Pipeline

Build a News ETL Data Pipeline Using Python and SQLite

Testing

Deployment

Other Considerations

Wrapping Up

Appendix

Final Assessment

System Architecture

Components of the pipeline