Installing spaCy's Statistical Models

Explore how to install spaCy's statistical language models needed for NLP tasks such as part-of-speech tagging and named entity recognition. Understand different model sizes, naming conventions, and various installation methods including pip and spaCy's download commands to effectively integrate language models into your projects.

We'll cover the following...

Overview
Installing language models

Overview

The spaCy installation doesn't come with the statistical language models needed for the spaCy pipeline tasks. spaCy language models contain knowledge about a specific language collected from a set of resources. Language models let us perform a variety of NLP tasks, including POS tagging and named-entity recognition (NER).

Different languages have different models and are language specific. There are also different models available for the same language. We'll see the differences between those models in detail in the Pro tip at the end of this section, but basically, the training data is different. The underlying statistical algorithm is the same. Some of the currently supported languages are as follows:

LANGUAGE	CODE	LANGUAGE DATA	MODELS
Chinese	zh	lang/zh </>	3 models
Danish	da	lang/da </>	3 models
Dutch	nl	lang/nl </>	3 models
English	en	lang/en </>	3 models
French	fr	lang/fr </>	3 models
German	de	lang/de </>	3 models
Greek	el	lang/el </>	3 models
Italian	it	lang/it </>	3 models
Japanese	ja	lang/ja </>	3 models
Lithuanian	lt	lang/lt </>	3 models
Multi-Language	xx	lang/xx </>	3 models
Norwegian Bokmal	nb	lang/nb </>	3 models
Polish	pl	lang/pl </>	3 models
Portugese	pt	lang/pt </>	3 models
Romanian	ro	lang/ro </>	3 models
Spanish	es	lang/es </>	3 models

1.Getting Started

2.Core Operations with spaCy

3.Linguistic Features

4.Rule-Based Matchmaking

5.Working with Word Vectors and Semantic Similarity

6.Putting Everything Together: Semantic Parsing with spaCy

Assessment

Project

7.Customizing spaCy Models

8.Text Classification with spaCy

9.spaCy and Transformers

10.Putting Everything Together: Designing a Chatbot with spaCy

11.Appendix

12.Conclusion

Assessment

Installing spaCy's Statistical Models

Overview