DVC example mirror for DATA 607
|
10 months ago | |
---|---|---|
.dvc | 10 months ago | |
run | 10 months ago | |
.gitignore | 10 months ago | |
README.md | 10 months ago | |
check_coeffs.py | 10 months ago | |
clean_data.py | 10 months ago | |
cleaned_data.csv | ||
coefs.json | 10 months ago | |
data.csv | ||
data.csv.dvc | 10 months ago | |
gen_data.py | 10 months ago | |
linear_regressor.joblib | ||
mse.json | 10 months ago | |
n_data.json | 10 months ago | |
train.py | 10 months ago | |
writing_scripts.ipynb | 10 months ago |
Legend |
---|
DVC Managed File |
Git Managed File |
Metric |
Stage File |
External File |
Legend |
---|
DVC Managed File |
Git Managed File |
Metric |
Stage File |
External File |
DVC example for Data Science in Context DATA 607
python gen_data -d <n_datapoints> -c <c1,c2,c3,c4>
(c1-c4 are coefficients that the linear regressor should find)
git checkout <branch>
data.csv.dvc (checkout the dvc pointer from a specific branch/point in time)\
dvc checkout
(dvc sees the pointer has changed, pulls in the right version of data.csv from the dvc cache)\
dvc repro run/check_coeffs.dvc
(dvc reproduces the pipeline up to the stage specified.)
make changes to run/<stage-definition-file>.sh
files\
bash run/<stage-definition-file>.sh