The `merge()` function

The pandas library provides full-featured, high-performance, in-memory JOIN operations highly similar to relational databases like SQL. The merge() function serves as the entry point for performing the standard database JOIN operations.

So what is the difference between merge and JOIN? JOIN refers to the SQL-like operations that combine rows from two or more DataFrames based on a related column. On the other hand, although conveying a similar meaning to JOIN, merge is essentially the name of the pandas function for performing various types of joins (i.e., merge()).

To reduce confusion, we’ll use JOIN (instead of merge) to describe the various methods of combining DataFrames and reserve the use of merge to the merge() function.

Note: The merge() function can be applied to both DataFrames and named Series objects. However, we’ll focus primarily on DataFrames because named Series objects can easily be structured as single-column DataFrames.

As an example, a basic line of code for an INNER JOIN of two DataFrames, df1 and df2, on a common key (the 'id' column) looks like this:

Press + to interact

Using merge() With Corresponding SQL JOIN Types

`merge()` method (how)	SQL JOIN type	Details
`how='left'`	LEFT (OUTER) JOIN	Returns all rows from the left table and matching rows from the right table
`how='right'`	RIGHT (OUTER) JOIN	Returns all rows from the right table and matching rows from the left table
`how='outer'`	(FULL) OUTER JOIN	Returns a set of rows that includes rows from both left and right tables
`how='inner'`	INNER JOIN	Uses the intersection of keys from both tables
`how='cross'`	CROSS JOIN	Create a cartesian product of rows of both tables

Before We Begin

Reading Data into pandas

Combining Data

Reshaping and Manipulating Data

Encoding Data Types

Handling Numerical Data

Handling Categorical Data

Handling Text Data

Handling Time Series Data

Handling Sparse Data Structures

Handling Missing Data

Data Analysis and Visualization with sidetable and Bokeh

Leveraging Further Features of pandas

Utilizing Extended Libraries

Wrap Up

Appendix

Time Series Analysis and Visualization Using Python and Plotly

Merge 2—Implementation

The `merge()` function

Using merge() With Corresponding SQL JOIN Types

Data Analysis and Visualization with sidetable and Bokeh

Time Series Analysis and Visualization Using Python and Plotly

Merge 2—Implementation

The merge() function

Using merge() With Corresponding SQL JOIN Types

The `merge()` function