Text Classification

Learn about text classification and how to do it using Python.

We'll cover the following

Introduction

When we build machine-learning models, we use the text classification technique as a first step to overcome the lack of labels, especially when we have two datasets: one dataset with labels and the other without.

In the context of text classification, a label is a categorical variable that we want to predict.

This process involves creating a new label for the second dataset using an existing text classification model. Therefore, we define text classification as a technique that classifies text content into predefined groups or categories. Here’s a table of a few commonly used classifiers for text classification and a brief description of when to use each:

Get hands-on with 1200+ tech skills courses.