Alternative Approaches

Learn the significance of alternative approaches for extracting information from documents.

We'll cover the following...

Bag of words

Tokenizing—or breaking a document into units—is simple to understand when tokens are just words from the document. This is often called a “bag of words.” However, this method has problems, such as a lack of context. It’s a simple way of looking at a document, but there are other, more sophisticated strategies.

More sophisticated approaches

...