Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Gianfranco Demarco e653ad193b
Added GDRIVE and updated train loop
1 year ago
e653ad193b
Added GDRIVE and updated train loop
1 year ago
13272c09a4
Some code fixes but remove flake8
1 year ago
4e3c7ea784
Add label studio annotations & annotation file
1 year ago
0f6c0d9acf
Some code refactoring and lazy dataset loading!
1 year ago
88c8c76a82
Cookiecutter data science commit
1 year ago
0f6c0d9acf
Some code refactoring and lazy dataset loading!
1 year ago
88c8c76a82
Cookiecutter data science commit
1 year ago
88c8c76a82
Cookiecutter data science commit
1 year ago
src
e653ad193b
Added GDRIVE and updated train loop
1 year ago
0f6c0d9acf
Some code refactoring and lazy dataset loading!
1 year ago
93cbac9a65
DVC setup
1 year ago
788e101cf9
removed data from DVC and added RAW
1 year ago
530b43b442
some linting
1 year ago
b467a655ba
Add CI/CD (#2)
1 year ago
88c8c76a82
Cookiecutter data science commit
1 year ago
1c1fada9b3
Added train to MLProject
1 year ago
b467a655ba
Add CI/CD (#2)
1 year ago
9d243afc69
update pipeline; add templates
1 year ago
9d243afc69
update pipeline; add templates
1 year ago
cc596471dc
Download images with multi threading
1 year ago
795a73044f
update job
1 year ago
30fe6daca7
Add dvc
1 year ago
0f6c0d9acf
Some code refactoring and lazy dataset loading!
1 year ago
d14dffe687
Update mlflow setup
1 year ago
ab4727024d
veeeeeeeeeeeery ugly (but working) fit and and predict model
1 year ago
cc596471dc
Download images with multi threading
1 year ago
ba0dbb796c
fix everything
1 year ago
13272c09a4
Some code fixes but remove flake8
1 year ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Commands

To run the main entry point, run:

python ./run.py 

To run a specific entry_point (e.g. download_data), run:

python ./run.py -e download_data 

Pipeline


1. Make dataset

Downloads the card database in SQLite format from MTGJson.
Then, for each card, downloads its image.
This step supports partial downloading of the resources and resuming.

Tip!

Press p or to see the previous file or, n or to see the next file

About

A Python ML boilerplate based on Cookiecutter Data Science, providing support for data versioning (DVC), experiment tracking, Model&Dataset cards, etc.

Collaborators 1

Comments

Loading...