Alternative Approaches
Learn the significance of alternative approaches for extracting information from documents.
We'll cover the following...
Bag of words
Tokenizing—or breaking a document into units—is simple to understand when tokens are just words from the document. This is often called a “bag of words.” However, this method has problems, such as a lack of context. It’s a simple way of looking at a document, but there are other, more sophisticated strategies.