Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Commit History
Message Author SHA1 Date
Update README.md   jnirschl 3 years ago
Update README.md to include code for getting started. Add setup.py   Jeff Nirschl 3 years ago
Update dvc files   Jeff Nirschl 3 years ago
Re-do DVC stage predict_output. I had forgotten to add models/estimator.pkl as a dependency for the stage. Running successfully now.   Jeff Nirschl 3 years ago
Add bash example code for stage train_model to README.md. Add indent to other sections of bash example code/   Jeff Nirschl 3 years ago
Correct DVC DAG for stage normalize data. The function normalize.py accepts the featurized data, not the nan-imputed data. Successful re-run stage normalize_data and split_train_dev.   Jeff Nirschl 3 years ago
Add placeholder script build_features.py to allow feature engineering (currently just saves a copy of the input dataframe as "_featurized.csv"). Add DVC stage build_features (feature engineering) prior to feature normalization. Run DVC stages build_features, normalize_data, and split_train_dev with all stages working. Update README.md to include feature engineering stage.   Jeff Nirschl 3 years ago
Add description of TDSP stages 1 and 2 to README.md   Jeff Nirschl 3 years ago
Move function create_data_dictionary out of make_dataset.py to reduce code complexity. Create new script data_dictionary.py to manage data dictionary and data summary table. DVC stage 1 working but other stages currently broken.   Jeff Nirschl 3 years ago
Remove duplicate code and use function load_data from src/data/__init__.py. Fix typo in README.md   Jeff Nirschl 3 years ago
Update README.md to include step-by-step overview of using DVC for the TDSP data science life cycle.   Jeff Nirschl 3 years ago
initial commit using cookiecutter data science   Jeff Nirschl 3 years ago