Understanding Metadata in Text Analysis

Learn how metadata is used by the tm package during natural language processing.

Metadata

Metadata is information about the corpus and its content. This includes information like the author, timestamp, and so on. Metadata is data about data!

A corpus contains two types of metadata: corpus metadata and document-level metadata. Here’s how to list the document metadata:

Get hands-on with 1400+ tech skills courses.