Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Type:  model Data Domain:  nlp
Jeff Wu 6dab221dad
reorganize and add temp 0.7
5 years ago
6dab221dad
reorganize and add temp 0.7
5 years ago
src
2d0b62225c
Update encoder.py to work on windows
5 years ago
c2dae27c10
First commit
5 years ago
6dab221dad
reorganize and add temp 0.7
5 years ago
0aad2ab3f4
Fetch model using curl, add shebang to download_files.sh and mark it executable
5 years ago
c2dae27c10
First commit
5 years ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

gpt-2

Code and samples from the paper "Language Models are Unsupervised Multitask Learners".

For now, we have only released a smaller (117M parameter) version of GPT-2.

See more details in our blog post.

Installation

Download the model data

sh download_model.sh 117M

Install python packages:

pip3 install -r requirements.txt

Unconditional sample generation

WARNING: Samples are unfiltered and may contain offensive content.

To generate unconditional samples from the small model:

python3 src/generate_unconditional_samples.py | tee samples

There are various flags for controlling the samples:

python3 src/generate_unconditional_samples.py --top_k 40 --temperature 0.7 | tee samples

Conditional sample generation

To give the model custom prompts, you can use:

python3 src/interactive_conditional_samples.py --top_k 40

GPT-2 samples

While we have not yet released GPT-2 itself, you can see some samples from it in the gpt-2-samples folder. We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.

Future work

We may release code for evaluating the models on various benchmarks.

We are still considering release of the larger models.

License

Coming soon!

Tip!

Press p or to see the previous file or, n or to see the next file

About

This is the DAGsHub mirror of GPT-2 made by OpenAI.

Code for the paper "Language Models are Unsupervised Multitask Learners"

https://openai.com/blog/better-language-models/
Collaborators 1

Comments

Loading...