Hydra-Sklearn Preprocessing Pipelines

This repository accompanying the blog post:

Creating Configurable Data Pre-Processing Pipelines by Combining Hydra and Sklearn - by Eli Simhayev & Benjamin Bodner

Update 4.1.23

When I wrote this blog-post, the stable version of Hydra was 1.1. Now, the stable version is 1.3, so note that this code work with Hydra 1.1 :)

Running Different Pipelines

Run:

python main.py preprocessing_pipeline=decision_tree

to execute the decision_tree preprocessing pipeline. You might also run other pipelines (from configs/preprocessing_pipeline) by just changing:

python main.py preprocessing_pipeline=<your-pipeline>

Hydra also supports Tab completion to complete config.

Adding New Pipelines

Adding new pipelines can be easily done using a yaml configuration in configs/preprocessing_pipeline. You might add another configurations: which model to use, which visualizations, etc. - learn more here: Hydra — A fresh look at configuration for machine learning projects

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs		configs
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
blog_code.ipynb		blog_code.ipynb
hydra_sklearn_pipeline.py		hydra_sklearn_pipeline.py
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

README.md

README.md

blog_code.ipynb

blog_code.ipynb

hydra_sklearn_pipeline.py

hydra_sklearn_pipeline.py

main.py

main.py

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

Hydra-Sklearn Preprocessing Pipelines

Update 4.1.23

Running Different Pipelines

Adding New Pipelines

We hope this will help you to better organize your data preprocessing pipelines 🙂

About

Languages

elisim/hydra-sklearn-pipelines

Folders and files

Latest commit

History

Repository files navigation

Hydra-Sklearn Preprocessing Pipelines

Update 4.1.23

Running Different Pipelines

Adding New Pipelines

We hope this will help you to better organize your data preprocessing pipelines 🙂

About

Topics

Resources

Stars

Watchers

Forks

Languages