Solution Review: Cleaning Auto MPG Dataset
This lesson provides the solution to the previous challenge.
We'll cover the following...
Cleaning the dataset
Python 3.5
import pandas as pddef read_csv():# Define the column names as a listnames = ["mpg", "cylinders", "displacement", "horsepower", "weight", "acceleration", "model_year", "origin", "car_name"]# Read in the CSV file from the webpage using the defined column namesdf = pd.read_csv("auto-mpg.data", header=None, names=names, delim_whitespace=True)return df# Remving outliers from the datadef outlier_detection(df):df = df.quantile([.90, .10])return dfprint(outlier_detection(read_csv()))
According to the problem ...