...

/

Inference Using the TF Lite Model

Inference Using the TF Lite Model

Learn to make inferences using the TF Lite Interpreter in Android apps.

We'll cover the following...

The process of on-device inference involves running a TF Lite model to make predictions based on unknown input data. TF inference APIs support common mobile and embedded platforms such as Android. TF Lite models run through an interpreter to infer from the input data. The interpreter is optimized for resource-constrained devices. It uses a custom memory allocator that results in low initialization and execution latency. Let’s explore the use of the TF Lite interpreter to perform inference in an Android app.

Main steps

The following figure explains the main steps to perform inference using a TF Lite model interpreter.

Steps to perform inference using a TF Lite Interpreter
Steps to perform inference using a TF Lite Interpreter
  1. Load: First we load a TF Lite model (.tflite extension), which contains the model’s execution graph, to memory.

  2. Transform: We then transform data to a format acceptable by the TF Lite model. For instance, we have to resize and rescale input data according to the format the model expects. Moreover, we might need to change the data type of the input as per the model requirements. ...