Model pipelines

It is typical for model pipelines to require other ETLs to run in a data platform before the pipeline can run on the most recent data. For example, there may be an upstream step in the data platform that translates JSON strings into schematized events that are used as input for a model. In this situation, it might be necessary to rerun the pipeline on a day when issues occurred with the JSON transformation process.

In this lesson, we’ll avoid this complication by using a static input data source, but the tools that ...

Introduction to Building Scalable Model Pipelines

Models as Web Endpoints

Models as Serverless Functions

Create an Echo Function in Lambda

Working with S3 in Lambda

Working with API in Lambda

Containers for Reproducible Models

Working with AWS Container Registry

Workflow Tools for Model Pipelines

PySpark for Batch Pipelines

Cloud Dataflow for Batch Modeling

Streaming Model Workflows

Course Conclusion

Sklearn Workflow

Batch model pipelines workflow

Model pipelines