Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
62828bf0d0
Pipeline updates
2 years ago
2 years ago
0842de02bb
Initial commit
2 years ago
2e56d23f7d
new updates
2 years ago
src
2 years ago
9f217b5cd6
Updates
2 years ago
0842de02bb
Initial commit
2 years ago
3c60def382
updates
2 years ago
0842de02bb
Initial commit
2 years ago
2 years ago
90610addae
readme updates
2 years ago
2e56d23f7d
new updates
2 years ago
2e56d23f7d
new updates
2 years ago
2 years ago
2 years ago
2 years ago
0842de02bb
Initial commit
2 years ago
d1aa7b9d88
added params
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

summarization

T5 Summarisation Using Pytorch Lightning

Instructions

  1. Clone the repo.
  2. Edit the params.yml to change the parameters to train the model.
  3. Run make dirs to create the missing parts of the directory structure described below.
  4. Optional: Run make virtualenv to create a python virtual environment. Skip if using conda or some other env manager.
    1. Run source env/bin/activate to activate the virtualenv.
  5. Run make requirements to install required python packages.
  6. Process your data, train and evaluate your model using make run
  7. When you're happy with the result, commit files (including .dvc files) to git.

Project Organization

├── LICENSE
├── Makefile           <- Makefile with commands like `make dirs` or `make clean`
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- The original, immutable data dump.
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-jqp-initial-data-exploration`.
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── metrics.txt    <- Relevant metrics after evaluating the model.
│   └── training_metrics.txt    <- Relevant metrics from training the model.
│
├── requirements.txt   <- The requirements file for reproducing the analysis environment
│
├── setup.py           <- makes project pip installable (pip install -e .) so src can be imported
├── src                <- Source code for use in this project.
│   ├── __init__.py    <- Makes src a Python module
│   │
│   ├── data           <- Scripts to download or generate data
│   │   └── make_dataset.py
│   │   └── process_data.py
│   │
│   ├── models         <- Scripts to train models 
│   │   ├── predict_model.py
│   │   └── train_model.py
│   │   └── evaluate_model.py
│   │   └── model.py
│   │
│   └── visualization  <- Scripts to create exploratory and results oriented visualizations
│       └── visualize.py
│
├── tox.ini            <- tox file with settings for running tox; see tox.testrun.org
└── data.dvc          <- Traing a model on the processed data.

Tip!

Press p or to see the previous file or, n or to see the next file

About

T5 Summarisation Using Pytorch Lightning

Collaborators 1

Comments

Loading...