Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 2.1 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
  1. schema: '2.0'
  2. stages:
  3. preprocess:
  4. cmd: python scripts/preprocess.py
  5. deps:
  6. - path: data/raw/arxiv_raw.csv
  7. md5: 700b601ba33941b1c43b50da539211b5
  8. size: 7025160
  9. - path: data/raw/biorxiv_raw.csv
  10. md5: e08273e5c158ae8d4f780b6131d74ca7
  11. size: 53702451
  12. - path: data/raw/pubmed_raw.csv
  13. md5: 3adbc5b6eb565fa98a26d67dbee40a14
  14. size: 458192680
  15. - path: data/raw/scopus_raw.csv
  16. md5: b710ece2fff58f13adc49163380f5fc0
  17. size: 1314681357
  18. - path: scripts/preprocess.py
  19. md5: fea1318dfb56b1728390247a0060f26e
  20. size: 3125
  21. outs:
  22. - path: data/prepared/arxiv_covid_19.csv
  23. md5: 47b1dd8524502033efa05330bce3f5ce
  24. size: 7602253
  25. - path: data/prepared/biorxiv_covid_19.csv
  26. md5: 5b04f770213da5e6bb93651b91fea0a1
  27. size: 65911895
  28. - path: data/prepared/pubmed_covid_19.csv
  29. md5: 1539b4fe3320fa5aa3fbfc093d3b0423
  30. size: 441640004
  31. - path: data/prepared/scopus_covid_19.csv
  32. md5: 549a576ac8bb25b16f6c45bd60053249
  33. size: 1135142596
  34. merge:
  35. cmd: python scripts/merge_datasets.py
  36. deps:
  37. - path: data/prepared/arxiv_covid_19.csv
  38. md5: 47b1dd8524502033efa05330bce3f5ce
  39. size: 7602253
  40. - path: data/prepared/biorxiv_covid_19.csv
  41. md5: 5b04f770213da5e6bb93651b91fea0a1
  42. size: 65911895
  43. - path: data/prepared/pubmed_covid_19.csv
  44. md5: 1539b4fe3320fa5aa3fbfc093d3b0423
  45. size: 441640004
  46. - path: data/prepared/scopus_covid_19.csv
  47. md5: 549a576ac8bb25b16f6c45bd60053249
  48. size: 1135142596
  49. - path: scripts/merge_datasets.py
  50. md5: 3d650209936a6d96a2bc5f2f93501e12
  51. size: 16804
  52. outs:
  53. - path: data/raw/final_raw.csv
  54. md5: 60f26d5672a4cb811e0ef8f3f172890f
  55. size: 1362212602
  56. preprocess_final:
  57. cmd: python scripts/preprocess.py final
  58. deps:
  59. - path: data/raw/final_raw.csv
  60. md5: 60f26d5672a4cb811e0ef8f3f172890f
  61. size: 1362212602
  62. - path: scripts/preprocess.py
  63. md5: fea1318dfb56b1728390247a0060f26e
  64. size: 3125
  65. outs:
  66. - path: data/prepared/final_covid_19.csv
  67. md5: 66ce368389ba37f9f1f88568b8224ff9
  68. size: 1401204171
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...