Pandas Dataframe resampling

Resampling pandas dataframe to calculate mean/median for a selected window size. import pandas as pd millisecond = "ms" idx = pd.date_range('1/1/2022', periods=100, freq=millisecond) series = pd.Series(list(range(100, 200)), index=idx) df = pd.DataFrame({'s': series}) window_size = 10 df.resample(f"{window_size}{millisecond}").mean() df.resample(f"{window_size}{millisecond}").median() df.resample(f"{window_size}{millisecond}").last()

July 17, 2022 · 1 min

Pandas Multi-Index

Pandas DataFrame create multiindex using existing columns import pandas as pd df = pd.read_csv("./data/dataset.csv") df = df.set_index(["INSTANCES", "TIMEPOINTS"], inplace=False)

July 14, 2022 · 1 min

Pandas cheat sheets

Pandas cheat sheets Find duplicates by column value: df[df.duplicated(['col_name'])] Find duplicates by row: df[df.duplicated()] Select rows from a DataFrame based on column values df.loc[df['column_name'] == some_value] Rename df column df.rename(columns={'gdp':'log(gdp)'}, inplace=True) Datatype of the columns df.dtypes

November 20, 2020 · 1 min