Kedro
━━━━━

My Notes about using kedro

Date: November 2, 2019

See all of my kedro related posts in [[ tag/kedro ]].

[1m[38;2;167;192;128m[4m[38;2;127;187;179m#kedrotips[0m <[38;2;122;132;120mhttps://twitter.com/search?q=%23kedrotips&f=live[0m>[0m
[38;2;71;82;88m─────────────────────────────────────────────────────────────[0m

I am tweeting out most of these snippets as I add them, you can find them all here [4m[38;2;127;187;179m#kedrotips[0m <[38;2;122;132;120mhttps://twitter.com/search?q=%23kedrotips[0m>.

[1m[38;2;167;192;128m🗣 Heads up[0m
[38;2;71;82;88m──────────[0m

Below are some quick snippets/notes for when using kedro to build data pipelines. So far I am just compiling snippets. Eventually I will create several posts on kedro. These are mostly things that I use In my everyday with kedro. Some are a bit more essoteric. Some are helpful when writing production code, some are useful more usefule for exploration.

[1m[38;2;167;192;128m📚 Catalog[0m
[38;2;71;82;88m─────────[0m

Image: catalogs <https://images.waylonwalker.com/jesse-orrico-h6xNSDlgciU-unsplash.jpg>

[3mPhoto by jesse orrico on Unsplash[0m

[1m[38;2;167;192;128m### CSVLocalDataSet[0m

[1mpython[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mpandas[0m[38;2;214;203;180m [0m[38;2;214;203;180mas[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;214;203;180m[0m
  [38;2;214;203;180miris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;122;132;120m.[0m[38;2;214;203;180mread_csv[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mhttps://raw.githubusercontent.com/kedro-org/kedro/d3218bd89ce8d1148b1f79dfe589065f47037be6/kedro/template/[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%20c[0m[38;2;178;201;143mookiecutter.repo_name[0m[38;2;178;201;143m%20%[0m[38;2;178;201;143m7D[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7D/data/01_raw/iris.csv[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180mdata_set[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mCSVLocalDataSet[0m[38;2;214;203;180m([0m[38;2;214;203;180mfilepath[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143mtest.csv[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                                 [0m[38;2;214;203;180mload_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                                 [0m[38;2;214;203;180msave_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180m{[0m[38;2;178;201;143m"[0m[38;2;178;201;143mindex[0m[38;2;178;201;143m"[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mFalse[0m[38;2;214;203;180m}[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180msave[0m[38;2;214;203;180m([0m[38;2;214;203;180miris[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mreloaded_iris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180m)[0m

[1myaml[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mtest_data[0m[38;2;214;203;180m:[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180mtype[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mCSVLocalDataset[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180mfilepath[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mtest[0m[38;2;122;132;120m.[0m[38;2;214;203;180mcsv[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180mload_args[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mNone[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180msave_args[0m[38;2;214;203;180m:[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m      [0m[38;2;214;203;180mindex[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mFalse[0m

[1m[38;2;167;192;128mCSVHTTPDataSet[0m
[38;2;71;82;88m──────────────[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mcities[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mCSVHTTPDataSet[0m[38;2;214;203;180m([0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180mfileurl[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143mhttps://raw.githubusercontent.com/kedro-org/kedro/d3218bd89ce8d1148b1f79dfe589065f47037be6/kedro/template/[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%20c[0m[38;2;178;201;143mookiecutter.repo_name[0m[38;2;178;201;143m%20%[0m[38;2;178;201;143m7D[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7D/data/01_raw/iris.csv[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180mauth[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180mload_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180m)[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mcities[0m[38;2;214;203;180m:[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180mtype[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mCSVHTTPDataSet[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180mfileurl[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mhttps[0m[38;2;214;203;180m:[0m[38;2;122;132;120m/[0m[38;2;122;132;120m/[0m[38;2;214;203;180mpeople[0m[38;2;122;132;120m.[0m[38;2;214;203;180msc[0m[38;2;122;132;120m.[0m[38;2;214;203;180mfsu[0m[38;2;122;132;120m.[0m[38;2;214;203;180medu[0m[38;2;122;132;120m/[0m[38;2;122;132;120m~[0m[38;2;214;203;180mjburkardt[0m[38;2;122;132;120m/[0m[38;2;214;203;180mdata[0m[38;2;122;132;120m/[0m[38;2;214;203;180mcsv[0m[38;2;122;132;120m/[0m[38;2;214;203;180mcities[0m[38;2;122;132;120m.[0m[38;2;214;203;180mcsv[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180mauth[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mNone[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180mload_args[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mNone[0m

[1m[38;2;167;192;128mHDFLocalDataSet[0m
[38;2;71;82;88m───────────────[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mpandas[0m[38;2;214;203;180m [0m[38;2;214;203;180mas[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mkedro[0m[38;2;122;132;120m.[0m[38;2;214;203;180mio[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mHDFLocalDataSet[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;122;132;120m.[0m[38;2;214;203;180mread_csv[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mhttps://raw.githubusercontent.com/kedro-org/kedro/d3218bd89ce8d1148b1f79dfe589065f47037be6/kedro/template/[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%20c[0m[38;2;178;201;143mookiecutter.repo_name[0m[38;2;178;201;143m%20%[0m[38;2;178;201;143m7D[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7D/data/01_raw/iris.csv[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mHDFLocalDataSet[0m[38;2;214;203;180m([0m[38;2;214;203;180mfilepath[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143miris.hdf[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180mkey[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143mtest_hdf_key[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180mload_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180msave_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180msave[0m[38;2;214;203;180m([0m[38;2;214;203;180miris[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mreloaded_iris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180m)[0m

[38;2;122;132;120m[code][0m
  cars:
     type: HDFLocalDataSet
     filepath: test.hdf
     key: test_hdf_key

[1m[38;2;167;192;128mHDFS3LocalDataSet[0m
[38;2;71;82;88m─────────────────[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mpandas[0m[38;2;214;203;180m [0m[38;2;214;203;180mas[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mkedro[0m[38;2;122;132;120m.[0m[38;2;214;203;180mio[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mHDFS3DataSet[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;122;132;120m.[0m[38;2;214;203;180mread_csv[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mhttps://raw.githubusercontent.com/kedro-org/kedro/d3218bd89ce8d1148b1f79dfe589065f47037be6/kedro/template/[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%20c[0m[38;2;178;201;143mookiecutter.repo_name[0m[38;2;178;201;143m%20%[0m[38;2;178;201;143m7D[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7D/data/01_raw/iris.csv[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mHDFS3DataSet[0m[38;2;214;203;180m([0m[38;2;214;203;180mfilepath[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143miris.hdf[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                        [0m[38;2;214;203;180mbucket_name[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143mbucket-us-west-1[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                        [0m[38;2;214;203;180mkey[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143mtest_hdf_key[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                        [0m[38;2;214;203;180mload_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                        [0m[38;2;214;203;180msave_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180msave[0m[38;2;214;203;180m([0m[38;2;214;203;180miris[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mreloaded_iris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180m)[0m

[38;2;122;132;120m[code][0m
  cars:
     type: HDFS3DataSet
     filepath: cars.hdf
     bucket_name: bucket-us-west-1
     key: test_hdf_key

[1m[38;2;167;192;128mJSONLocalDataSet[0m
[38;2;71;82;88m────────────────[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mpandas[0m[38;2;214;203;180m [0m[38;2;214;203;180mas[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mkedro[0m[38;2;122;132;120m.[0m[38;2;214;203;180mio[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mJSONLocalDataSet[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;122;132;120m.[0m[38;2;214;203;180mread_csv[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mhttps://raw.githubusercontent.com/kedro-org/kedro/d3218bd89ce8d1148b1f79dfe589065f47037be6/kedro/template/[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%20c[0m[38;2;178;201;143mookiecutter.repo_name[0m[38;2;178;201;143m%20%[0m[38;2;178;201;143m7D[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7D/data/01_raw/iris.csv[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mcars[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mJSONLocalDataSet[0m[38;2;214;203;180m([0m[38;2;214;203;180mfilepath[0m[38;2;122;132;120m=[0m[38;2;178;201;143m"[0m[38;2;178;201;143miris.json[0m[38;2;178;201;143m"[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                        [0m[38;2;214;203;180mload_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                        [0m[38;2;214;203;180msave_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180msave[0m[38;2;214;203;180m([0m[38;2;214;203;180miris[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mreloaded_iris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180m)[0m

[38;2;122;132;120m[code][0m
  cars:
     type: JSONLocalDataSet
     filepath: iris.json

[1m[38;2;167;192;128mParquetLocalDataSet[0m
[38;2;71;82;88m───────────────────[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mpandas[0m[38;2;214;203;180m [0m[38;2;214;203;180mas[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mkedro[0m[38;2;122;132;120m.[0m[38;2;214;203;180mio[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mParquetLocalDataSet[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mpd[0m[38;2;122;132;120m.[0m[38;2;214;203;180mread_csv[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mhttps://raw.githubusercontent.com/kedro-org/kedro/d3218bd89ce8d1148b1f79dfe589065f47037be6/kedro/template/[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7B[0m[38;2;178;201;143m%20c[0m[38;2;178;201;143mookiecutter.repo_name[0m[38;2;178;201;143m%20%[0m[38;2;178;201;143m7D[0m[38;2;178;201;143m%[0m[38;2;178;201;143m7D/data/01_raw/iris.csv[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mParquetLocalDataSet[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143miris[0m[38;2;178;201;143m'[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180mengine[0m[38;2;122;132;120m=[0m[38;2;178;201;143m'[0m[38;2;178;201;143mauto[0m[38;2;178;201;143m'[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180mload_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180msave_args[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m,[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m                           [0m[38;2;214;203;180mversion[0m[38;2;122;132;120m=[0m[38;2;214;203;180mNone[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180msave[0m[38;2;214;203;180m([0m[38;2;214;203;180miris[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mreloaded_iris[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180miris_data_set[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180m)[0m

[38;2;122;132;120m[code][0m
  cars:
     type: JSONLocalDataSet
     filepath: cars

PickleS3DataSet

SQLTableDataSet

SQLQueryDataSet

TextLocalDataSet

ExcelLocalDataSet

[1m[38;2;167;192;128m⏳ Loading Data[0m
[38;2;71;82;88m──────────────[0m

Image: loading data <https://images.waylonwalker.com/battlecreek-coffee-roasters-eg6OUchGCsw-unsplash.jpg>

[3mPhoto by Battlecreek Coffee Roasters on Unsplash[0m

[1m[38;2;167;192;128m### Simple Loading[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mdf[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mcars[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m

[1m[38;2;167;192;128m### list all datasets[0m

[38;2;122;132;120m[code][0m
  catalog.list()

[1m[38;2;167;192;128m### Saving Data[0m

[38;2;122;132;120m[code][0m
  catalog.save('cars', cars)

[1m[38;2;167;192;128m### 🔍 Finding data[0m

[1msimple keyword search[0m

[38;2;122;132;120m[code][0m
  query = 'raw'
  [data for data in catalog.list() if query in data]

[3msee on[0m [4m[38;2;127;187;179m[3m#kedrotips[0m[0m <[38;2;122;132;120mhttps://twitter.com/_WaylonWalker/status/1197130980659732480?s=20[0m>

[1mmulti keyword serch[0m

[38;2;122;132;120m[code][0m
  query = 'raw sales'
  data_sets = catalog.list()
  for word in query.split():
   data_sets = [
         data
         for data in data_sets
         if query in data
         ]

[3msee on[0m [4m[38;2;127;187;179m[3m#kedrotips[0m[0m <[38;2;122;132;120mhttps://twitter.com/_WaylonWalker/status/1197528461587419139?s=20[0m>

[1m🐒 monkey patch it[0m

[38;2;122;132;120m[code][0m
  def query(*search_terms):
       data_sets = catalog.list()
       for search in search_terms:
           data_sets = [
           data
           for data in data_sets
           if search in data
           ]
       return data_sets

  catalog.query = query

_see on [4m[38;2;127;187;179m#kedrotips[0m <[38;2;122;132;120mhttps://twitter.com/_WaylonWalker/status/1197855759507300352?s=20[0m>

[1m[38;2;167;192;128m### 🤙 YOLO[0m

[3mYou Only Load Once[0m

[1msimple[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mdata[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180m[[0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180md[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m        [0m[38;2;230;126;128mfor[0m[38;2;214;203;180m [0m[38;2;214;203;180md[0m[38;2;214;203;180m [0m[38;2;122;132;120min[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m        [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mquery[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mc_pri[0m[38;2;178;201;143m'[0m[38;2;214;203;180m,[0m[38;2;214;203;180m [0m[38;2;178;201;143m'[0m[38;2;178;201;143mcars[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m        [0m[38;2;214;203;180m][0m

[1mmore refined[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mdata[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180m{[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180md[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180md[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;230;126;128mfor[0m[38;2;214;203;180m [0m[38;2;214;203;180md[0m[38;2;214;203;180m [0m[38;2;122;132;120min[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mquery[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mc_pri[0m[38;2;178;201;143m'[0m[38;2;214;203;180m,[0m[38;2;214;203;180m [0m[38;2;178;201;143m'[0m[38;2;178;201;143mcars[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180m}[0m

[1m🍷 refined like a fine wine[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mtypes[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mSimpleNamespace[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mdata[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mSimpleNamespace[0m[38;2;122;132;120m*[0m[38;2;122;132;120m*[0m[38;2;214;203;180m{[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180md[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180md[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;230;126;128mfor[0m[38;2;214;203;180m [0m[38;2;214;203;180md[0m[38;2;214;203;180m [0m[38;2;122;132;120min[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mquery[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mc_pri[0m[38;2;178;201;143m'[0m[38;2;214;203;180m,[0m[38;2;214;203;180m [0m[38;2;178;201;143m'[0m[38;2;178;201;143mcars[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180m}[0m[38;2;214;203;180m)[0m

[1m🧀 Make it a function[0m [3mgetting funcy[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mtypes[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mSimpleNamespace[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;230;126;128mdef[0m[38;2;214;203;180m [0m[38;2;178;201;143myolo[0m[38;2;214;203;180m([0m[38;2;122;132;120m*[0m[38;2;214;203;180msearch_terms[0m[38;2;214;203;180m)[0m[38;2;214;203;180m:[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;178;201;143m"""[0m[38;2;178;201;143myou only load once[0m[38;2;178;201;143m[0m
  [38;2;178;201;143m   using query method from previous tip[0m[38;2;178;201;143m"""[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;214;203;180mdata[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mSimpleNamespace[0m[38;2;214;203;180m([0m[38;2;122;132;120m*[0m[38;2;122;132;120m*[0m[38;2;214;203;180m{[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m       [0m[38;2;214;203;180md[0m[38;2;214;203;180m:[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mload[0m[38;2;214;203;180m([0m[38;2;214;203;180md[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m   [0m[38;2;230;126;128mfor[0m[38;2;214;203;180m [0m[38;2;214;203;180md[0m[38;2;214;203;180m [0m[38;2;122;132;120min[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180mquery[0m[38;2;214;203;180m([0m[38;2;122;132;120m*[0m[38;2;214;203;180msearch_terms[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;214;203;180m}[0m[38;2;214;203;180m)[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m    [0m[38;2;230;126;128mreturn[0m[38;2;214;203;180m [0m[38;2;214;203;180mdata[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180mall_pri[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180myolo[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mc_pri[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m

[1m🐒 monkey patch it[0m

[38;2;122;132;120m[code][0m
  [38;2;214;203;180mfrom[0m[38;2;214;203;180m [0m[38;2;214;203;180mfunctools[0m[38;2;214;203;180m [0m[38;2;214;203;180mimport[0m[38;2;214;203;180m [0m[38;2;214;203;180mpartial[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180myolo[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180myolo[0m[38;2;214;203;180m[0m
  [38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180myolo[0m[38;2;122;132;120m.[0m[38;2;214;203;180m__doc__[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;178;201;143m"[0m[38;2;178;201;143myou only load once[0m[38;2;178;201;143m"[0m[38;2;214;203;180m[0m
  [38;2;214;203;180m[0m
  [38;2;214;203;180mall_pri[0m[38;2;214;203;180m [0m[38;2;122;132;120m=[0m[38;2;214;203;180m [0m[38;2;214;203;180mcatalog[0m[38;2;122;132;120m.[0m[38;2;214;203;180myolo[0m[38;2;214;203;180m([0m[38;2;178;201;143m'[0m[38;2;178;201;143mc_pri[0m[38;2;178;201;143m'[0m[38;2;214;203;180m)[0m

[1m[38;2;167;192;128m### adding catalogs together[0m

[38;2;122;132;120m[code][0m
  from kedro.io import DataCatalog
  DataCatalog({**cat1.__dict__['_data_sets'], **cat2.__dict__['_data_sets']})

[1m[38;2;167;192;128m🛢 Building pipelines[0m
[38;2;71;82;88m────────────────────[0m

Image: building pipelines <https://images.waylonwalker.com/roman-pentin-T5QT2bmiD4E-unsplash.jpg>

[3mPhoto by roman pentin on Unsplash[0m

[1m[38;2;167;192;128m### 📍 Creating Nodes[0m

[38;2;122;132;120m[code][0m
  from kedro.pipeline import node
  node = node(lambda x: x.dropna(), inputs='raw_cars', outputs='int_cars')

[38;2;122;132;120m[code][0m
  from kedro.pipeline import node

  def drop_columns(df, *columns):
     for column in columns:
        df = df.drop(columns=column)
     return df

  node = node(
     lambda x: drop_columns(x, 'vs', 'am', 'gear', 'carb'),
     inputs='int_cars',
     outputs='pri_cars'
     )

[1m[38;2;167;192;128m### 🛢 Creating a pipeline[0m

[1m[38;2;167;192;128m### Don’t be so verbose[0m

Create similar nodes dynamically

[38;2;122;132;120m[code][0m
  def halve_dataframe(data: pd.DataFrame) -> List[pd.DataFrame]:
     """ splits a dataframe in half """
      return np.array_split(data, 2)

  nodes = []
  datasets = [
     'cars', 'trucks', 'boats', 'motorcycles', 'planes',
     'ships', 'busses', 'trains', 'subways'
     ]

  # creates a pipeline node for every dataset in the datasets list
  for dataset in datasets
     nodes.append(
         node(halve_dataframe,
              'e_modin_{dataset}',
              ['train_{dataset}', 'test_{dataset}']),
     )

[1m[38;2;167;192;128m🏃‍♂️ Running Pipelines[0m
[38;2;71;82;88m──────────────────────[0m

Image: running pipelines <https://images.waylonwalker.com/rodion-kutsaev-xNdPWGJ6UCQ-unsplash.jpg>

[3mPhoto by Rodion Kutsaev on Unsplash[0m

[1m🔖 filter by tags[0m

[38;2;122;132;120m[code][0m
  nodes = pipeline.only_nodes_with_tags('cars')

[3msee on[0m [4m[38;2;127;187;179m[3m#kedrotips[0m[0m <[38;2;122;132;120mhttps://twitter.com/_WaylonWalker/status/1195319044808888321?s=20[0m>

[1mfilter by node[0m

[38;2;122;132;120m[code][0m
  nodes = pipeline.only_nodes('b_int_cars')

_see on [4m[38;2;127;187;179m#kedrotips[0m <[38;2;122;132;120mhttps://twitter.com/_WaylonWalker/status/1196406204479737856?s=20[0m>

[1mfilter nodes like[0m

[38;2;122;132;120m[code][0m
  query_string = 'cars'
  nodes = [
     node.name
     for node in pipeline.nodes
     if query_string in node.name
     ]
  pipeline.only_nodes(*nodes)

[3msee on[0m [4m[38;2;127;187;179m[3m#kedrotips[0m[0m <[38;2;122;132;120mhttps://twitter.com/_WaylonWalker/status/1196813895228428288?s=20[0m>

[1monly nodes with tags[0m [3mor[0m

[38;2;122;132;120m[code][0m
  nodes = pipeline.only_nodes_with_tags('cars', 'trains')

[1monly nodes with tags[0m [3mand[0m

[38;2;122;132;120m[code][0m
  raw_nodes = pipeline.only_nodes_with_tags('raw')
  car_nodes = pipeline.only_nodes_with_tags('cars')
  raw_car_nodes = raw_nodes & car_nodes

[38;2;122;132;120m[code][0m
  raw_nodes = (
     pipeline
     .only_nodes_with_tags('raw')
     .only_nodes_with_tags('cars')
     )

[1madd pipelines[0m

[38;2;122;132;120m[code][0m
  car_nodes = pipeline.only_nodes_with_tags('cars')
  train_nodes = pipeline.only_nodes_with_tags('trains')
  transportation_nodes = car_nodes + train_nodes

[1mensure nodes are attached[0m

[38;2;122;132;120m[code][0m
  cars_attached = len(
     pipeline
     .only_nodes_with_tags('cars')
     .grouped_nodes
     ) == 1

[1m[38;2;167;192;128m### 🎂 Pipeline Decorators[0m

[4m[38;2;127;187;179mexample - log_time[0m <[38;2;122;132;120mhttps://kedro.readthedocs.io/en/latest/_modules/kedro/pipeline/decorators.html#log_time[0m>

[38;2;122;132;120m[code][0m
  from kedro.pipeline.decorators import log_time, mem_profile
  pipeline.decorate(log_running_time)

[1m[38;2;167;192;128mPipeline IO[0m
[38;2;71;82;88m───────────[0m

[38;2;167;192;128mpipleine.all_inputs()[0m and [38;2;167;192;128mpipeline.all_outputs()[0m return sets of pipeline inputs and outputs and you can do set operations on them. This is particularly useful to find the upper and lower edges of your pipeline or subset of pipeline. The pipeline object here is any [38;2;167;192;128mkedro[0m pipeline including a filtered subset.

[1m[38;2;167;192;128m### Find all raw data[0m

[38;2;122;132;120m[code][0m
  pipeline.all_inputs() - pipeline.all_outputs()

[1m[38;2;167;192;128m### Find all final data[0m

[38;2;122;132;120m[code][0m
  pipeline.all_outputs() - pipeline.all_inputs()

[1m[38;2;167;192;128m### Find all nodes that do not raw[0m

This one is probably one that is pushing the limits of what I would do in a list comprehension that I use in prod or even put into a text editor, but I commonly use ipython for my adhoc work and keeping it all in one line is very handy. Complex list comprehensions kinda start becoming like regex in a way that they are really easy to write and really hard to read. I don’t think this one quite hits that point but its getting close.

I find this one super useful to help me either move data beween environments, or avoid unnecessary database calls.

[38;2;122;132;120m[code][0m
  raw_inputs = pipeline.all_inputs() - pipeline.all_outputs()
  raw_nodes = [node for node in pipeline.nodes if [i for i in raw_inputs if i in set(node.inputs)] != []]