Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
c98a2fc6ab
Support kwargs in cmd line
5 years ago
2beb97a499
Stopped squeezing everything
5 years ago
5 years ago
8463636000
Merge branch 'bonus'
5 years ago
02dc4817b7
learning frequency at 0.00075
5 years ago
8463636000
Merge branch 'bonus'
5 years ago
8463636000
Merge branch 'bonus'
5 years ago
69e80315f2
Added the option to run "classic" dqn using the new stats mechanics
5 years ago
02dc4817b7
learning frequency at 0.00075
5 years ago
084dbaa9f4
Returned default rates, and added a save state rate
5 years ago
Storage Buckets

README.txt

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
  1. TAU RL DQN Project
  2. By:
  3. Guy Smoilovsky 305007916
  4. Tomer Ben Moshe 303016661
  5. Emma Rapoport 301481958
  6. main.py was not changed much from original (apart from changing hyperparameters).
  7. dqn_model.py has a couple of new models in addition to the ones provided.
  8. dqn_learn.py was modified a bit from the original (except implementing the required bits of code), to allow saving and loading checkpoints.
  9. This was required for training on Google Colab, as it was impossible to get enough consecutive training time due to disconnections.
  10. experiments.py is used as the entry point (main class) for all experiments beyond the scope of questions 1 and 2.
  11. It activates either double_dqn_learn.py or dqn_learn_ex.py depending on command line arguments.
  12. The two new dqn_learn classes are very simple to the original dqn_learn.py, but were changed to allow easier logging and checkpoints, and to allow double DQN.
  13. Example of running experiments.py:
  14. python experiments.py save_path=saved_state_double_dqn_separable_adam_qbert model=DQN_SEPARABLE learning_func=double_dqn optimizer=Adam gym_task_index=4
Tip!

Press p or to see the previous file or, n or to see the next file

About

No description

Collaborators 1

Comments

Loading...