Sentiment Analysis with spaCy

Let's look at a real-world dataset and train a sentiment analysis model.

In this lesson, we'll work on a real-world dataset and train spaCy's TextCategorizer on this dataset. We'll be working on the Amazon Fine Food Reviews dataset from Kaggle in this chapter. The original dataset is huge, with 100,000 rows. We sampled 4,000 rows. This dataset contains customer reviews about fine food sold on Amazon. Reviews include user and product information, user rating, and text.

We can load the dataset through the following method:

Get hands-on with 1400+ tech skills courses.