Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Commit History
Message Author SHA1 Date
Ran data prep dvc stage   Guy 5 years ago
Change logo size   Dean 5 years ago
Calculated BPE using DVC   Guy 5 years ago
Added missing commoncrawl dependency   Tolstoyevsky 5 years ago
Refactor data preparation a bit   Tolstoyevsky 5 years ago
Universal encoder seems ready   Tolstoyevsky 5 years ago
Small things - hyperparam yaml, script to clean corrupted unicode   Tolstoyevsky 5 years ago
Finished preprocessing   Tolstoyevsky 5 years ago
Setup the stub of the preprocessing dvc stage   Tolstoyevsky 5 years ago
Fixed commoncrawl yet again   Tolstoyevsky 5 years ago
Fixed training.dvc   Tolstoyevsky 5 years ago
Reorganized dvc   Tolstoyevsky 5 years ago
Working on the prepare script, tokenization looks like it's working   Tolstoyevsky 5 years ago
Switched to the correct dataset (nc-12 instead of nc-13)   Tolstoyevsky 5 years ago
Unzipped raw data   Tolstoyevsky 5 years ago
Moved data into data folder   Tolstoyevsky 5 years ago
Init DVC + downloaded and tracking data   Tolstoyevsky 5 years ago
stitch preprocessing pipeline   Ruty Rinott 5 years ago
Add CheckpointManager to keep avg checkpoint weights in memory to reduce disk read when averaging + various checkpoint refactoring   Wei Ho 5 years ago
Add standalone binaries   Myle Ott 5 years ago
Support custom Dictionary implementations in 'preprocess.py' (#448)   Davide Caroselli 5 years ago
Do distributed init after data loading   Myle Ott 5 years ago
Add --input option to interactive.py to support reading from file   Myle Ott 5 years ago
Merge internal changes (#483)   Myle Ott 5 years ago
make dictionary class as input for fairseq preprocess functions (#482)   Jingfei Du 5 years ago
Add code for "Pay Less Attention with Lightweight and Dynamic Convolutions" (#473)   Myle Ott 5 years ago
refactor AdversarialTrainer factor out helper functions   Xian Li 5 years ago
Adafactor Optimizer (#472)   Lucio Dery 5 years ago
Only use c10d distributed primitives   Myle Ott 5 years ago
LSTM improvements (fixes #414)   Myle Ott 5 years ago