Missing Data
This lesson focuses on missing data by explaining some techniques to clean such data from our dataset.
We'll cover the following...
What is missing data? #
So, you have read in your data only to discover that some values are missing! What do you do?
First, you should try to understand why your data points are missing. Are they missing at random or not? Data that is missing at random could be removed with a large enough dataset. If your data is missing for a reason, perhaps missing means zero? Or is it a strong signal of a sensor malfunction? These non-random missing values should be fixed or leveraged. For example, set them to zero if they should be zero.
Second, you should consider how much data you have relative to how many rows have missing ...