Guy/fairseq

Message	Author	SHA1	Date
Allow larger maxlen (fixes #100) (#101)	Myle Ott	7e86e30cc5	6 years ago
Adjust weight decay by the current learning rate to make it work correctly during annealing	Sergey Edunov	9a95121633	6 years ago
Merge pull request #91 from facebookresearch/prepare_wmt	Sergey Edunov	e4c935aa17	6 years ago
spelling	Sergey Edunov	52b6119a53	6 years ago
Update README with new models	Sergey Edunov	2c18c27365	6 years ago
Merge pull request #95 from bastings/patch-1	Sergey Edunov	fb366144d2	6 years ago
Adding README and more parameters to En2De script	Sergey Edunov	971c2d6363	6 years ago
Ratio should be predlen/reflen not reflen/predlen	Joost Bastings	1ff3efce63	6 years ago
Merge branch 'master' of github.com:facebookresearch/fairseq-py into prepare_wmt	Sergey Edunov	d9f46c5472	6 years ago
Switch to news-commentary-v12	Sergey Edunov	4185d3ed02	6 years ago
Fixed Weight Decay Regularization in Adam	Michael Auli	ee36a6f3e3	6 years ago
Fix tests	Myle Ott	66d9fcf5c8	6 years ago
Output correct perplexity when training with --sentence-avg	Myle Ott	f9362e87bd	6 years ago
Fix max_positions calculation in train.py	Myle Ott	81ace092ef	6 years ago
Better warning message for inputs that are too long	Myle Ott	334694363b	6 years ago
ATen Fix	Michael Auli	66314a60d8	6 years ago
Momentum correction	Michael Auli	173c577b9d	6 years ago
Report log likelihood for label smoothing	Sergey Edunov	dd31fa92d0	6 years ago
Share input/output embed	Sergey Edunov	c5378602d4	6 years ago
Better support for torch.no_grad (since volatile is deprecated)	Myle Ott	907ca927eb	6 years ago
Fix training	Myle Ott	0b84ab197a	6 years ago
Save dictionary in model base classes	Myle Ott	5eddda8b8a	6 years ago
Fix gradient clipping when --clip-norm=0	Myle Ott	08a74a326f	6 years ago
Fix LearnedPositionalEmbedding	Myle Ott	fd28c8806b	6 years ago
Move normalization of model output (e.g., via LSM) into model definition	Myle Ott	4db6579a63	6 years ago
Move positional embeddings into LearnedPositionalEmbedding module	Myle Ott	c21a6e2993	6 years ago
Fix warning about deprecated `volatile` kwarg for Variables	Myle Ott	185a0df599	6 years ago
Add option to SequenceGenerator to retain dropout	Myle Ott	dccf79092e	6 years ago
Add --max-sentences-valid to train.py	Myle Ott	c542884dec	6 years ago
Streamline data formatting utils	Myle Ott	eb005cdb2b	6 years ago

Newer Older

Guy / fairseq

Guy
/
fairseq