Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.yaml 1.6 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
  1. stages:
  2. split:
  3. cmd: python3 main.py split
  4. deps:
  5. - data/CrossValidated-Questions-Nov-2020.csv
  6. - main.py
  7. params:
  8. - paths.dataset
  9. - paths.test_df
  10. - paths.train_df
  11. - split.seed
  12. outs:
  13. - outputs/test.csv.gz
  14. - outputs/train.csv.gz
  15. featurize:
  16. cmd: python3 main.py featurize
  17. deps:
  18. - main.py
  19. - outputs/test.csv.gz
  20. - outputs/train.csv.gz
  21. params:
  22. - paths.test_df
  23. - paths.test_df_featurized
  24. - paths.train_df
  25. - paths.train_df_featurized
  26. outs:
  27. - outputs/test-featurized.csv.gz
  28. - outputs/train-featurized.csv.gz
  29. tfidf:
  30. cmd: python3 main.py tfidf
  31. deps:
  32. - main.py
  33. - outputs/test-featurized.csv.gz
  34. - outputs/train-featurized.csv.gz
  35. params:
  36. - paths.test_df_featurized
  37. - paths.test_tfidf
  38. - paths.tfidf
  39. - paths.train_df_featurized
  40. - paths.train_tfidf
  41. - tfidf.max_features
  42. outs:
  43. - outputs/test-tfidf.npz
  44. - outputs/tfidf.pkl
  45. - outputs/train-tfidf.npz
  46. train:
  47. cmd: python3 main.py train
  48. deps:
  49. - main.py
  50. - outputs/train-featurized.csv.gz
  51. - outputs/train-tfidf.npz
  52. params:
  53. - paths.model
  54. - paths.train_df_featurized
  55. - paths.train_tfidf
  56. - train.loss
  57. - train.seed
  58. - paths.train_metrics
  59. outs:
  60. - outputs/model.pkl
  61. metrics:
  62. - metrics-train.yaml:
  63. cache: false
  64. test:
  65. cmd: python3 main.py test
  66. deps:
  67. - main.py
  68. - outputs/test-featurized.csv.gz
  69. - outputs/test-tfidf.npz
  70. - outputs/model.pkl
  71. params:
  72. - paths.model
  73. - paths.test_df_featurized
  74. - paths.test_tfidf
  75. - paths.test_metrics
  76. metrics:
  77. - metrics-test.yaml:
  78. cache: false
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...