Search⌘ K
AI Features

Project 3: PDF scraping in Python + REGEX

Explore how to scrape data from PDFs with Python by applying regex and bounding box techniques to identify and list product models from manufacturers containing 'Tandem.' This lesson guides you through extracting real-world information efficiently using accessible Python tools.

We'll cover the following...

PDF scraping example:

In this project we will use a pdf file (see the screenshot below) from the diabetes.org website. Our goal is to list all the equipment models developed by the manufacturers names containing the word tandem (case insensitive).

Find all the product models by the manufacturer called `Tandem`
Find all the product models by the manufacturer called `Tandem`
...