bodo

Parallelize apply after pandas groupby

Parallelize apply after pandas groupby Question: I have used rosetta.parallel.pandas_easy to parallelize apply after groupby, for example: from rosetta.parallel.pandas_easy import groupby_to_series_to_frame df = pd.DataFrame({‘a’: [6, 2, 2], ‘b’: [4, 5, 6]},index= [‘g1’, ‘g1’, ‘g2’]) groupby_to_series_to_frame(df, np.mean, n_jobs=8, use_apply=True, by=df.index) However, has anyone figured out how to parallelize a function that returns a DataFrame? This code …

Total answers: 8