...

Training the spaCy Text Classifier

Let's learn about the details of spaCy's text classifier component.

We'll cover the following...

Getting to know the TextCategorizer class
Formatting training data for the TextCategorizer
Defining the training loop
Testing the new component
Training TextCategorizer for multilabel classification

In this section, we will learn about the details of spaCy's text classifier component TextCategorizer. Previously, we saw that the spaCy NLP pipeline consists of components. We also learned about the essential components of the spaCy NLP pipeline, which are the sentence tokenizer, POS tagger, dependency parser, and named entity recognition (NER).

TextCategorizer is an optional and trainable pipeline component. In order to train it, we need to provide examples and their class labels. We first add TextCategorizer to the NLP pipeline and then do the training procedure. The illustration below shows where exactly the TextCategorizer component lies in the NLP pipeline; this component comes after the essential components. In the following diagram, textcat refers to the TextCategorizer component.

Press + to interact

A neural network architecture lies behind spaCy's TextCategorizer. TextCategorizer provides us with user-friendly and end-to-end approaches to train the classifier, so we don't have to deal directly with the neural network architecture. We'll design our own neural network architecture in the upcoming chapters. After looking at the architecture, we’re ready to dive into TextCategorizer code. Let’s get to know the TextCategorizer class first.

Getting to know the `TextCategorizer` class

Now let's get to know the TextCategorizer class in detail. First of all, we import TextCategorizer from the pipeline components:

Getting Started

Core Operations with spaCy

Linguistic Features

Rule-Based Matchmaking

Working with Word Vectors and Semantic Similarity

Putting Everything Together: Semantic Parsing with spaCy

Assessment: spaCy Features

Auto-Tagging System for Content Categorization

Customizing spaCy Models

Text Classification with spaCy

spaCy and Transformers

Putting Everything Together: Designing a Chatbot with spaCy

Appendix

Conclusion

Assessment - Machine Learning with spaCy

Training the spaCy Text Classifier

Getting to know the `TextCategorizer` class

Assessment: spaCy Features

Auto-Tagging System for Content Categorization

Assessment - Machine Learning with spaCy

Training the spaCy Text Classifier

Getting to know the TextCategorizer class

Getting to know the `TextCategorizer` class