Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Commit History
Message Author SHA1 Date
stitch preprocessing pipeline   Ruty Rinott 5 years ago
Add standalone binaries   Myle Ott 5 years ago
Support custom Dictionary implementations in 'preprocess.py' (#448)   Davide Caroselli 5 years ago
make dictionary class as input for fairseq preprocess functions (#482)   Jingfei Du 5 years ago
Enforce UTF-8 when open() text files (#460)   Davide Caroselli 5 years ago
change f"{args}" to "{}".format(args) (#467)   vufg 5 years ago
FIX: '--user-dir' on multi-gpu (#449)   Davide Caroselli 5 years ago
Merge internal changes (#283)   Myle Ott 5 years ago
Fix arg formatting in preprocess.py and add fmt control for black formatting (#399)   Myle Ott 5 years ago
Fix build for docs   Myle Ott 5 years ago
pipeline for LM training   Ruty Rinott 5 years ago
Parallel preprocessing   Sergey Edunov 5 years ago
Add documentation   Myle Ott 5 years ago
Correct the help name of the prefixes arguments (#234)   alvations 5 years ago
Fix `--output-format raw` option to preprocess.py (Fixes #188) (#190)   Myle Ott 5 years ago
Conv lm implementation   alexeib 6 years ago
Fix preprocess.py   Myle Ott 6 years ago
Pad dictionary to be a multiple of 8 in preprocessing   Myle Ott 6 years ago
Allow more flexible pre-processing and generation (#227)   Sergey Edunov 6 years ago
Fix tests and flake8   Myle Ott 6 years ago
fairseq-py goes distributed (#106)   Myle Ott 6 years ago
Remove Python3.6 format string from preprocess.py (fixes #60) (#61)   Myle Ott 6 years ago
Replace unk with original string   Louis Martin 6 years ago
Support custom dictionary in preprocess.py   Myle Ott 6 years ago
Fix flake8 warnings   Louis Martin 6 years ago
Initial commit   Sergey Edunov 6 years ago