Python Data Analysis and Visualization/

...

Solution Review: Cleaning Auto MPG Dataset

This lesson provides the solution to the previous challenge.

We'll cover the following...

- Cleaning the dataset

Press + to interact

Python 3.5

import pandas as pd
def read_csv():
    # Define the column names as a list
    names = ["mpg", "cylinders", "displacement", "horsepower", "weight", "acceleration", "model_year", "origin", "car_name"]
    # Read in the CSV file from the webpage using the defined column names
    df = pd.read_csv("auto-mpg.data", header=None, names=names, delim_whitespace=True)
    return df
# Remving outliers from the data
def outlier_detection(df):
    df = df.quantile([.90, .10])
    return df
print(outlier_detection(read_csv()))

What is Analytics

Python Basics for Analytics

Reading Data

Describing Data

Cleaning Data

Visualizing Data

Solution Review: Cleaning Auto MPG Dataset

Cleaning the dataset #