Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Peter Bull f24a8f77c7
Initial docs commit
8 years ago
f24a8f77c7
Initial docs commit
8 years ago
3fdd064e9f
Fix missing imports
8 years ago
ba99332b2d
Start to flesh out READMEs
8 years ago
f24a8f77c7
Initial docs commit
8 years ago
a199fa53f2
Add cookiecutter dependency version
8 years ago
ff0439f2cb
Stupidest part of the JSON spec.
8 years ago
f24a8f77c7
Initial docs commit
8 years ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

cookiecutter-data-science

An opinionated, but not-afraid-to-be-wrong project template for data science projects. Pull requests welcome. Debate encouraged.

Requirements to create project:

To start a new project:

cookiecutter https://github.com/drivendata/cookiecutter-data-science

asciicast

Data

** By default, the data folder is included in the .gitignore file.** If you have a small amount of data that rarely changes, you may want to include the data in the repository. Github currently warns if files are over 50MB and rejects files over 100MB. Some other options for storing large data include AWS S3 with a syncing tool (e.g., s3cmd), Git Large File Storage, Git Annex, and dat.

The prefered workflow if data is not in the repository is to have a make command make data that will download or create the relevant datasets.

Tip!

Press p or to see the previous file or, n or to see the next file

About

No description

Collaborators 1

Comments

Loading...