Automated Inspection with Computer Vision/

...

Annotation Formats

Learn to convert two annotation formats exported by CVAT into PyTorch tensors suitable for training a semantic segmentation CNN.

We'll cover the following...

CVAT for images 1.1
Segmentation mask 1.1

We learned to save the semantic segmentation annotation data in two formats: CVAT for images 1.1 and Segmentation mask 1.1.

Training a semantic segmentation dataset with PyTorch requires having target tensors with the type torch.int64. Each pixel in the target tensor must hold a long integer indicating the category index of the corresponding pixel in the original image. So, we need to run some code to convert the data exported by CVAT into suitable target tensors.

CVAT for images 1.1

The CVAT for images 1.1 format exports the semantic segmentation data into a single XML file. At the highest level, there is a <version> element and a <meta> element, where data about the annotation task is stored.

Press + to interact

Introduction

Getting Started with Images

Image I/O and Annotations

Color Spaces and Thresholding

Convert Color Spaces, Threshold

Smoothing and Masking

Detection of Features

Image Registration

3D Vision

Getting Started with Neural Networks

Convolutional Neural Networks

Project: Create and Train a CNN for Classification

Object Detection and Semantic Segmentation

Cats vs Dogs Classification with Convolutional Neural Networks

Dataset Annotation

Final Remarks

Recognize Handwritten Digits Using a Deep Neural Network

Annotation Formats

CVAT for images 1.1