Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Tolstoyevsky fc04b378e3
Universal decoder seems ready (still need to fix minor architecture mismatches in the layers, mainly dropout positions)
5 years ago
..
56f9ec3c38
Use ATen built-in conv_tbc method (#66)
6 years ago
82a9f9230f
Fix arg formatting in preprocess.py and add fmt control for black formatting (#399)
5 years ago
42be3ebd41
Merge internal changes (#483)
5 years ago
fc04b378e3
Universal decoder seems ready (still need to fix minor architecture mismatches in the layers, mainly dropout positions)
5 years ago
fc04b378e3
Universal decoder seems ready (still need to fix minor architecture mismatches in the layers, mainly dropout positions)
5 years ago
3e67386bbc
Adafactor Optimizer (#472)
5 years ago
bbb4120b00
Support custom Dictionary implementations in 'preprocess.py' (#448)
5 years ago
8eb232ce15
Merge internal changes
5 years ago
42be3ebd41
Merge internal changes (#483)
5 years ago
7e0d222cdd
Only use c10d distributed primitives
5 years ago
0daba38ecb
Save and restore wall time in checkpoints
6 years ago
6641520612
fairseq-py goes distributed (#106)
6 years ago
bbb4120b00
Support custom Dictionary implementations in 'preprocess.py' (#448)
5 years ago
fc312d28d3
ability to checkpoint when reaching certain number of updates
6 years ago
8ce6499dbf
Merge internal changes (#422)
5 years ago
7633129ba8
Merge internal changes (#283)
5 years ago
7633129ba8
Merge internal changes (#283)
5 years ago
38f1dee950
Enforce UTF-8 when open() text files (#460)
5 years ago
c49c292c79
Add CheckpointManager to keep avg checkpoint weights in memory to reduce disk read when averaging + various checkpoint refactoring
5 years ago
c49c292c79
Add CheckpointManager to keep avg checkpoint weights in memory to reduce disk read when averaging + various checkpoint refactoring
5 years ago

Comments

Loading...