How to perform extractive summarization of text in Python

If you are building an application that has the capability to perform natural language processing on text data, it may be because you are generating a summary for the text dataThis is really helpful if you have a long piece of text and want to fetch only the important sentences to understand that text..

What is extractive summarization?

Extractive summarization is a type of summarization in which the articles are summarized by selecting a subset of words from the original article that retain the most important points. With this approach, we would not be generating a summary that contains words other than those present in the original article.

We will use a package named summarizer to help you generate summarized content in just one line of code!

Let’s first install the package by running:

pip install summarizer

As a dependency of this package, you also need to install nltk, which is one of the most widely used libraries, to perform Natural Language Processing.

Install this by running:

pip install nltk

We will be using the summarize() function from this package. Let’s take a look at the details of this function.

Parameters

The summarize() function accepts the following parameters:

title: This is the title of your text article. It will be used to determine what the article is about and the (potential) most important words.
text: The complete text data of your article.
count: This is an optional parameter with the default value of $5$ . It denotes the number of sentences that you want to return in the summary.

Return value

The summarize() function returns a list of the most important sentences. This can be treated as the summarized content for your text data.

Code

Now, since we know all the details, let’s move on to the code.

Free Resources

License: Creative Commons-Attribution-ShareAlike 4.0 (CC-BY-SA 4.0)

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

TRENDING TOPICS

Learn to Code

Tech Interview Prep

Generative AI

Data Science

Machine Learning

GitHub Students Scholarship

Early Access Courses

Blind 75

Layoffs

Pricing

For Individuals

Try for Free

Gift a Subscription

CONTRIBUTE

Become an Author

Become an Affiliate

Earn Referral Credits

RESOURCES

Blog

Cheatsheets

Webinars

Answers

ABOUT US

Our Team

Careers

Hiring

Frequently Asked Questions

Press

LEGAL

Cookie Policy

Business Terms of Service

Data Processing Agreement

INTERVIEW PREP COURSES

Grokking the Modern System Design Interview

Grokking the Product Architecture Design Interview

Grokking the Coding Interview Patterns

Machine Learning System Design