Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

params.yaml 1.8 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
  1. seed: 123
  2. gpu: 1
  3. log_interval: 500
  4. feature: review
  5. label: sentiment
  6. pad_token: <pad>
  7. unk_token: <unk>
  8. sos_token: <sos>
  9. eos_token: <eos>
  10. max_len: 512
  11. basic:
  12. vocab_size: 50000
  13. min_freq: 3
  14. lstm:
  15. embed_dim: 128
  16. use_bag: false
  17. use_eos: true
  18. attention_method: concat
  19. hidden_size: 512
  20. n_layers: 2
  21. dropout: 0.1
  22. max_len: 256
  23. mlp:
  24. embed_dim: 128
  25. use_bag: true
  26. hidden_size: 512
  27. dropout: 0.1
  28. cnn:
  29. embed_dim: 128
  30. use_bag: false
  31. use_eos: true
  32. hidden_size: 512
  33. kernel_size: 3
  34. n_layers: 4
  35. dropout: 0.33
  36. max_len: 512
  37. selected:
  38. embed_size: 50
  39. use_bag: false
  40. attention_method: concat
  41. hidden_size: 512
  42. n_layers: 2
  43. dropout: 0.33
  44. train:
  45. batch_size: 16
  46. shuffle: true
  47. epochs: 6
  48. early_stops: 2
  49. optimizer:
  50. lr: 2e-5
  51. step_lr: 500
  52. gamma: 0.5
  53. clip: 1.0
  54. weight_decay: 1e-5
  55. validate:
  56. batch_size: 32
  57. shuffle: true
  58. epochs: 5
  59. kfold: 10
  60. early_stops: 3
  61. optimizer:
  62. lr: 1e-4
  63. step_lr: 500
  64. gamma: 0.5
  65. clip: 1.0
  66. weight_decay: 0
  67. evaluate:
  68. batch_size: 64
  69. bert:
  70. do_lower_case: true
  71. max_len: 128
  72. eval_max_len: 128
  73. bert_hidden_size: 1024
  74. basic:
  75. dropout: 0.1
  76. cnn:
  77. dropout: 0.1
  78. hidden_size: 1024
  79. kernel_size: 3
  80. lstm:
  81. hidden_size: 768
  82. dropout: 0.1
  83. n_layers: 2
  84. attention_method: concat
  85. xlnet:
  86. do_lower_case: true
  87. max_len: 128
  88. eval_max_len: 128
  89. bert_hidden_size: 1024
  90. basic:
  91. dropout: 0.1
  92. cnn:
  93. dropout: 0.1
  94. hidden_size: 1024
  95. kernel_size: 3
  96. roberta:
  97. do_lower_case: true
  98. max_len: 128
  99. eval_max_len: 128
  100. bert_hidden_size: 1024
  101. basic:
  102. dropout: 0.1
  103. cnn:
  104. dropout: 0.1
  105. hidden_size: 1024
  106. kernel_size: 3
  107. albert:
  108. do_lower_case: true
  109. max_len: 128
  110. eval_max_len: 128
  111. bert_hidden_size: 2048
  112. cnn:
  113. dropout: 0
  114. hidden_size: 2048
  115. kernel_size: 3
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...