Prediction Using Pre-trained EfficientNet

Explore the fundamental concepts of the EfficientNet model architecture.

This lesson will provide a step-by-step guide to building inference scripts using EfficientNet. Moreover, this lesson has multiple interactive playgrounds for you to practice with.

Overview of EfficientNet

EfficientNet is another convolutional neural network architecture that performs well on Imagenet and other image classification tasks. During its initial release in 2019, it was considered a state-of-the-art model.

It’s an efficiency-oriented model that utilizes a scaling method that uniformly scales all dimensions (depth, width, and resolution) using a compound coefficient. Hence, it tends to outperform other architectures without extensive grid search of hyperparameters.

Variant (B0 - B7)

EfficientNet comes with eight different variants, and each one contains different parameters:

  • resolution: This has different resolutions for each variant.
  • depth and width: The channel size is a multiple of 8.
  • resource limit: This is the increasing depth or width. Retaining the resolution might improve performance.

The table below highlights the desired resolution of the input image for all the variants.

Base Model Resolution
EfficientNetB0 224
EfficientNetB1 240
EfficientNetB2 260
EfficientNetB3 300
EfficientNetB4 380
EfficientNetB5 456
EfficientNetB6 528
EfficientNetB7 600

PyTorch Image Model

Like ResNet, the PyTorch Image Model framework has a pre-trained model for EfficientNet. Currently, it provides the pre-trained weights for B0, B1, B2, B3, and B4.

Import

We can use the create_model function provided by timm and easily load the pre-trained model by using the following code snippet:

Get hands-on with 1300+ tech skills courses.