Data Cleaning
Learn how to perform data cleaning in Altair, with a focus on handling missing values, managing duplicates, and manipulating data.
Data cleaning is all about identifying and correcting inaccuracies and inconsistencies in data, which makes it more reliable and easier to work with.
Data cleaning involves the following main aspects:
Handling missing values
Managing duplicates
Manipulating data (formatting, normalization, and standardization)
Altair provides some functions to perform data cleaning. However, in most cases, it is better to clean the data before passing them to Altair, and use Altair only to render the visualization.
Handling missing values
A missing value is simply a value that is not present in the data. There are many reasons why values might be missing from the data, such as errors in data collection, preprocessing, or intentional omission of ...