2D convolution

The 2D convolution operation applies a 2D window to the input 2D matrix, where the window slides across the input matrix to perform element-wise multiplication and summing operations.

The following example shows how to travel a 5x5 2D matrix with a 3x3 window:

In literature, these two terms are often used interchangeably, mostly because we don’t see the flips when we use built-in functions, and the rest are exactly the same. Nevertheless, it’s important to know the main difference.

When to use convolution?

Depending on the kernel type and the variables inside, the effect of the convolution can vary. We can use convolution to blur or sharpen an image and detect the edges. If our goal is one of them, we usually determine the kernel variables specifically according to our aim. However, in convolutional layers, we use convolution operations to extract local features, and we don’t determine the kernel variables manually. The kernel variables are determined during training.

Before We Start

Basics of Convolutional Neural Networks

Cats vs Dogs Classification with Convolutional Neural Networks

Popular Neural Network Architectures for Image Classification

Using PyTorch for Image Classification

Model Deployment

Using a PyTorch Model in JavaScript with ONNX

Basics of Object Detection

Two-Stage Object Detection Architectures

One-Stage Object Detection Architectures

YOLOv7 Model Train and Inference on Edge

Conclusion

Appendix

Building a System for Safety Helmet Detection Based on YOLOv5

What is 2D Convolution?

2D convolution

Correlation vs. convolution

When to use convolution?

Zero padding