Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Jeff Nirschl b824d7c2d5
Fix bug in replace_nan that removed imputation:method from params.yaml. Re-run DVC repro
3 years ago
..
afecec1ddb
initial commit using cookiecutter data science
3 years ago
ad08b38ba1
Parameter tuning with hyperopt for RF model with feature engineering.
3 years ago
b824d7c2d5
Fix bug in replace_nan that removed imputation:method from params.yaml. Re-run DVC repro
3 years ago
219842899f
Update script build_features.py to include 6 hand-crafted features such as family_size, is_orphan, is_single_mother etc. as well as polynomial transform features from sklearn.preprocessing. Re-run DVC repro successfully.
3 years ago
ea4f046e65
Move function create_data_dictionary out of make_dataset.py to reduce code complexity. Create new script data_dictionary.py to manage data dictionary and data summary table. DVC stage 1 working but other stages currently broken.
3 years ago
b824d7c2d5
Fix bug in replace_nan that removed imputation:method from params.yaml. Re-run DVC repro
3 years ago
7ca1feda42
Add script to split training data into train/dev sets using stratified K-fold cross validation. Save indices for train/dev splits as CSV.
3 years ago

Comments

Loading...