Document-Term Matrix

Learn about how a document-term matrix is a commonly accepted data structure for natural language processing.

We'll cover the following...

A document-term matrix is fairly simple to understand. It is a matrix with rows and columns.

  • Each row represents a document. In our case, there will be one row for Frankenstein and a second row for The Last Man.

  • Each column represents a term. In this case, terms are words, although they can be sentences, lines, paragraphs, or n-grams (more on these in a later lesson). ...

Create a free account to view this lesson.

By signing up, you agree to Educative's Terms of Service and Privacy Policy