Save For Inference
Save a condensed model to be used for inference on real-time data.
We'll cover the following...
Chapter Goals:
- Learn how to save a model for inference
A. Deploying a model
As mentioned in chapter 4, a saved model checkpoint consists of three files: .data
, .index
, and .meta
. Since the .meta
file contains the entire computation graph structure, which includes all the data in the training dataset, it can get quite large. The large file size becomes an issue when deploying an inference model.
An inference model is a fully trained and evaluated model used to make predictions on real-time data. When we deploy an inference model for production, we don’t usually deploy the code used to build the model, either for proprietary reasons or because there are too many auxiliary code files. When we don’t have the code that ...
Access this course and 1400+ top-rated courses and projects.