EfficientNet (2019)

Learn the fundamentals of the EfficientNet image classification architecture with compound scaling.

General structure

EfficientNet is a convolutional neural network architecture and scaling method that uniformly scales all the dimensions of a neural network rather than using different scales for different sizes. They claim that a well-prepared architecture should conserve its proportions for each dimension, and we should not break its rankings by changing each of them manually. For this purpose, they present a coefficient called the compound scale factor and use this to decide the final scale factors for each dimension.

The figure below shows the different scales of a convolutional neural network. Widening a network means increasing the number of channels in convolutional layers. Deepening a network means increasing the number of layers in a neural network. Finally, resolution is the size of the input image.

Get hands-on with 1400+ tech skills courses.