Handling Outliers
Learn how to handle outliers using Python.
We'll cover the following...
How to handle outliers
There are many methods of handling outliers in a dataset. Here are a few of them:
Ignoring outliers: We can investigate outlier records to determine whether they are genuine. If they are, we can leave them for further data analysis.
Removing outliers: We can remove the records that are considered outliers from a dataset. We perform this operation when removing them doesn't significantly impact further data analysis.
Imputing outliers: We can replace outlier values with a particular value, such as the mean or median of the dataset. Generally, we perform this operation when the goal is to maintain the size and representativeness of the dataset.
Applying log ...