Ad

Saturday, March 24, 2018

Data Preprocessing using Pandas - Exploratory Data Analysis (EDA) cheatsheet

Helpful pandas functions for data quality:

  • DataFrame.dtypes()
  • DataFrame.info()
  • DataFrame.describe()
  • x.value_counts()
  • x.isnull()
  • x.mean()
  • x.var()

No comments:

Post a Comment

Machine Learning Workflow

Data cleaning Missing data Outlier Others: duplicates, typos, special characters Strategy for missing data: imputation, mean, median...