Building Robust Object-Oriented Python Applications and Libraries/

...

Parsing Information With Regular Expressions

Learn about the functionalities of regular expressions using the re module.

We'll cover the following...

Methods to get the matching groups
- Other features of the re module
  - Using the findall and search functions
Making regular expressions efficient

Let’s now focus on the Python side of things. The regular expression syntax is the furthest thing from object-oriented programming. However, Python’s re module provides an object-oriented interface to enter the regular expression engine.

We’ve been checking whether the re.match() function returns a valid object or not. If a pattern does not match, that function returns None. If it does match, however, it returns a useful object that we can inspect for information about the pattern.

So far, our regular expressions have answered questions such as does this string match this pattern? Matching patterns is useful, but in many cases, a more interesting question is if this string matches this pattern, what is the value of a relevant substring? If we use groups to identify parts of the pattern that we want to reference later, we can get them out of the match return value, as illustrated in the next example:

Press + to interact

The full specification describing all valid email addresses is extremely complicated, and the regular expression that accurately matches all possibilities is obscenely long. So, we cheated and made a smaller regular expression that matches many common email addresses; the point is that we want to access the domain name (after the @ sign) so we can connect to that address. This is done easily by wrapping that part of the pattern in parentheses and calling the group() method on the object returned by match().

We’ve used an additional ...

Object-Oriented Design

Objects in Python

When Objects Are Alike

Expecting the Unexpected

When to Use Object-Oriented Programming

Abstract Base Classes and Operator Overloading

Python Data Structures

Object-Oriented and Functional Programming Intersection

Strings, Serialization, and File Paths

The Iterator Pattern

Common Design Patterns

Advanced Design Patterns

Testing Object-Oriented Programs

Concurrency

Conclusion

Build a Python Airline Reservation System

Parsing Information With Regular Expressions