Quick reference for building a repository locally, pushing it to GitHub, and maintaining commits back and forth.
Create iterator
iter(object)
Plot scatterplot matrix
pd.plotting.scatter_matrix(df, figsize=(15,15));
Verify that 2 DataFrames’ columns are the same
(df1.dtypes == df2.dtypes).all()
Convert a column’s datatype
df[‘column’] = df[‘column’].astype(float) # bool, float, int, str
Count duplicate rows
df.duplicated().sum()
Count rows with any null values
df.isnull().any(axis=1).sum()