Too many stages in your DVC graph.

It seems your pipeline has many nodes!
Consider adding directories instead of adding many single files.

dvc add path/to/dir

No Description

danigrim 217422538c
add files from google drive
217422538c
add files from google drive
5 months ago
635e57a765
add_files
5 months ago
env
217422538c
add files from google drive
5 months ago
src
635e57a765
add_files
5 months ago
217422538c
add files from google drive
5 months ago
1af76f4f43
stop tracking data
5 months ago
ff7fb0f107
add data to dvc
5 months ago
635e57a765
add_files
5 months ago
ff7fb0f107
add data to dvc
5 months ago
635e57a765
add_files
5 months ago
Data Pipeline

We couldn't calculate your pipeline. It seems it has too many nodes!

README.md

First Repo Project

This project is a simple 'Ham or Spam' classifier for emails using the Enron data set. It contains two python code files, 5 data files, and one constants file.

  • code directory - holds the data-preprocessing and modeling files:
    • data-preprocessing.py - processing the raw data (enron.csv), splits it to train and test sets, and saves it to the data directory.
    • modeling.py - simple Random Forest Regressor.
  • data directory - contains the raw and processed data.
  • src - contains the constants file.
  • requirements.txt - python dependencies that are required to run the python files.
  • README.md - Read me file.