Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 41 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
  1. schema: '2.0'
  2. stages:
  3. preprocess_1151-commits:
  4. cmd: cp downloaded-data/1151-commits.csv data && echo "data/1151-commits.csv"
  5. >> .gitignore && git add .gitignore
  6. deps:
  7. - path: downloaded-data/1151-commits.csv
  8. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  9. size: 346306
  10. outs:
  11. - path: data/1151-commits.csv
  12. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  13. size: 346306
  14. preprocess_herzig:
  15. cmd: cp downloaded-data/herzig.csv data && echo "data/herzig.csv" >> .gitignore
  16. && git add .gitignore
  17. deps:
  18. - path: downloaded-data/herzig.csv
  19. md5: 69a17c08643aed84b874384a2a57c7ed
  20. size: 1483281
  21. outs:
  22. - path: data/herzig.csv
  23. md5: 69a17c08643aed84b874384a2a57c7ed
  24. size: 1483281
  25. preprocess_smells-test:
  26. cmd: data-preprocessing/smells.sh
  27. deps:
  28. - path: data-preprocessing/smells.sh
  29. md5: 1792bc2011c1aba4d51cdca74beee11e
  30. size: 2148
  31. - path: downloaded-data/smells-madeyski.csv
  32. md5: 3d60d277b9fa1306c05ccfdefe22e9d1
  33. size: 7513770
  34. outs:
  35. - path: data/smells/test.csv
  36. md5: 0200db0eec17554a48a5b3a25719fd03
  37. size: 77607
  38. parse_labels:
  39. cmd: bohr porcelain parse-labels
  40. deps:
  41. - path: labels
  42. md5: f54bde6a2ca21ad1a0ba7d4ff9a5b9a5.dir
  43. size: 619
  44. nfiles: 2
  45. outs:
  46. - path: labels.py
  47. md5: 1404972881fc94fbf1039b625bd4ccc0
  48. size: 1859
  49. smells_apply_heuristics__heuristics_smells__smells-test:
  50. cmd: bohr porcelain apply-heuristics smells --heuristic-group heuristics.smells
  51. --dataset smells-test
  52. deps:
  53. - path: data/smells/test.csv
  54. md5: 0200db0eec17554a48a5b3a25719fd03
  55. size: 77607
  56. - path: heuristics/smells.py
  57. md5: b1a5ed3a14eb9eae8924b8a43e3bc452
  58. size: 712
  59. - path: labels.py
  60. md5: 1404972881fc94fbf1039b625bd4ccc0
  61. size: 1859
  62. params:
  63. bohr.json:
  64. bohr_framework_version: 0.4.10
  65. outs:
  66. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-test.pkl
  67. md5: a8924b3413a7258e4d22510885b4b886
  68. size: 4230
  69. - path: metrics/smells/heuristics.smells/heuristic_metrics_smells-test.json
  70. md5: 2a7a29682c91259100e8d087b3accb4c
  71. size: 72
  72. preprocess_smells-train:
  73. cmd: data-preprocessing/smells.sh
  74. deps:
  75. - path: data-preprocessing/smells.sh
  76. md5: 1792bc2011c1aba4d51cdca74beee11e
  77. size: 2148
  78. - path: downloaded-data/smells-madeyski.csv
  79. md5: 3d60d277b9fa1306c05ccfdefe22e9d1
  80. size: 7513770
  81. outs:
  82. - path: data/smells/train.csv
  83. md5: 7fc9a7617e6f201523fba311317ba48f
  84. size: 296970
  85. smells_apply_heuristics__heuristics_smells__smells-train:
  86. cmd: bohr porcelain apply-heuristics smells --heuristic-group heuristics.smells
  87. --dataset smells-train
  88. deps:
  89. - path: data/smells/train.csv
  90. md5: 7fc9a7617e6f201523fba311317ba48f
  91. size: 296970
  92. - path: heuristics/smells.py
  93. md5: b1a5ed3a14eb9eae8924b8a43e3bc452
  94. size: 712
  95. - path: labels.py
  96. md5: 1404972881fc94fbf1039b625bd4ccc0
  97. size: 1859
  98. params:
  99. bohr.json:
  100. bohr_framework_version: 0.4.10
  101. outs:
  102. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-train.pkl
  103. md5: aa1818fe86e1d8d999ddb4a0f7c20ad9
  104. size: 14312
  105. - path: metrics/smells/heuristics.smells/heuristic_metrics_smells-train.json
  106. md5: 576a36563e43ce643bdd861930558217
  107. size: 32
  108. smells_combine_heuristics:
  109. cmd: bohr porcelain apply-heuristics smells
  110. deps:
  111. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-test.pkl
  112. md5: a8924b3413a7258e4d22510885b4b886
  113. size: 4230
  114. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-train.pkl
  115. md5: aa1818fe86e1d8d999ddb4a0f7c20ad9
  116. size: 14312
  117. params:
  118. bohr.json:
  119. bohr_framework_version: 0.4.10
  120. outs:
  121. - path: generated/smells/analysis_smells-test.csv
  122. md5: b3e54618879091f6014dbb90abffb50d
  123. size: 336
  124. - path: generated/smells/analysis_smells-train.csv
  125. md5: 276a5dd60ab0bfc2da69c0c8c1bf843a
  126. size: 250
  127. - path: generated/smells/heuristic_matrix_smells-test.pkl
  128. md5: b1fd4c196e2f967e9fca6b8f9c572272
  129. size: 4230
  130. - path: generated/smells/heuristic_matrix_smells-train.pkl
  131. md5: 2aa34f53b90128efb74215acdf7d995b
  132. size: 14312
  133. - path: metrics/smells/analysis_smells-test.json
  134. md5: 23e9b96402687da1a63c6765c925377e
  135. size: 1118
  136. - path: metrics/smells/analysis_smells-train.json
  137. md5: 88c0d6e20b8d101f5309d2f951cb6ee7
  138. size: 696
  139. - path: metrics/smells/heuristic_metrics_smells-test.json
  140. md5: 2a7a29682c91259100e8d087b3accb4c
  141. size: 72
  142. - path: metrics/smells/heuristic_metrics_smells-train.json
  143. md5: 576a36563e43ce643bdd861930558217
  144. size: 32
  145. bugginess_apply_heuristics__heuristics_bugginess__1151-commits:
  146. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  147. --dataset 1151-commits
  148. deps:
  149. - path: data/1151-commits.csv
  150. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  151. size: 346306
  152. - path: heuristics/bugginess.py
  153. md5: a55520b99cbaad572a26886b52e74652
  154. size: 8900
  155. - path: labels.py
  156. md5: 1404972881fc94fbf1039b625bd4ccc0
  157. size: 1859
  158. params:
  159. bohr.json:
  160. bohr_framework_version: 0.4.10
  161. outs:
  162. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_1151-commits.pkl
  163. md5: 732df7ec50a8165d7e9a1e79415064b5
  164. size: 2792584
  165. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_1151-commits.json
  166. md5: 590f784b4669d243ce5bff2a8d09345b
  167. size: 73
  168. preprocess_berger:
  169. cmd: cp downloaded-data/berger.csv data && echo "data/berger.csv" >> .gitignore
  170. && git add .gitignore
  171. deps:
  172. - path: downloaded-data/berger.csv
  173. md5: 126de41c9204a9e807e72406b1f9d631
  174. size: 62247
  175. outs:
  176. - path: data/berger.csv
  177. md5: 126de41c9204a9e807e72406b1f9d631
  178. size: 62247
  179. bugginess_apply_heuristics__heuristics_bugginess__berger:
  180. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  181. --dataset berger
  182. deps:
  183. - path: data/berger.csv
  184. md5: 126de41c9204a9e807e72406b1f9d631
  185. size: 62247
  186. - path: heuristics/bugginess.py
  187. md5: a55520b99cbaad572a26886b52e74652
  188. size: 8900
  189. - path: labels.py
  190. md5: 1404972881fc94fbf1039b625bd4ccc0
  191. size: 1859
  192. params:
  193. bohr.json:
  194. bohr_framework_version: 0.4.10
  195. outs:
  196. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_berger.pkl
  197. md5: e93deda462057f4a205960cf0d24d2ec
  198. size: 917768
  199. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_berger.json
  200. md5: feb313c11f1c1afb0fc58ef5ad73ab6a
  201. size: 73
  202. preprocess_bugginess-train:
  203. cmd: 7z x downloaded-data/bugginess_train.7z -odata/bugginess_train && echo "data/bugginess_train"
  204. >> .gitignore && git add .gitignore
  205. deps:
  206. - path: downloaded-data/bugginess_train.7z
  207. md5: d4dc26c2b0f0704b1559f2c0ce6320d7
  208. size: 255969433
  209. outs:
  210. - path: data/bugginess_train
  211. md5: f7cbfc7a91dfeca3aff7b7d3b6d7ea72.dir
  212. size: 2489726547
  213. nfiles: 3
  214. bugginess_apply_heuristics__heuristics_bugginess__bugginess-train:
  215. cmd: bohr apply-heuristics bugginess --heuristic-group heuristics.bugginess --dataset
  216. bugginess-train
  217. deps:
  218. - path: data/bugginess_train
  219. md5: f7cbfc7a91dfeca3aff7b7d3b6d7ea72.dir
  220. size: 2489726547
  221. nfiles: 3
  222. - path: heuristics/bugginess.py
  223. md5: 9f9ea19cd5c53bbbd41f94cf7b8f3d14
  224. size: 2873
  225. - path: heuristics/keywords
  226. md5: b4e7587c1b8e4e1461685a305d48bd66.dir
  227. size: 1382
  228. nfiles: 5
  229. - path: labels.py
  230. md5: 4ad220b4c289b2d8597bd6431c6565a6
  231. size: 1707
  232. params:
  233. bohr.json:
  234. bohr_framework_version: 0.4.2
  235. outs:
  236. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_bugginess-train.pkl
  237. md5: 5d1c71dcd36417356cabe2e340ca959d
  238. size: 500879984
  239. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_bugginess-train.json
  240. md5: 9c903723760f0000193679b361437e41
  241. size: 32
  242. bugginess_apply_heuristics__heuristics_bugginess__herzig:
  243. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  244. --dataset herzig
  245. deps:
  246. - path: data/herzig.csv
  247. md5: 69a17c08643aed84b874384a2a57c7ed
  248. size: 1483281
  249. - path: heuristics/bugginess.py
  250. md5: a55520b99cbaad572a26886b52e74652
  251. size: 8900
  252. - path: labels.py
  253. md5: 1404972881fc94fbf1039b625bd4ccc0
  254. size: 1859
  255. params:
  256. bohr.json:
  257. bohr_framework_version: 0.4.10
  258. outs:
  259. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_herzig.pkl
  260. md5: 15bf08d69bf793c3978a2ffe1458c76d
  261. size: 12608792
  262. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_herzig.json
  263. md5: 525291ad999e1bee97789045c1ae8333
  264. size: 72
  265. bugginess_combine_heuristics:
  266. cmd: bohr porcelain apply-heuristics bugginess
  267. deps:
  268. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_1151-commits.pkl
  269. md5: 732df7ec50a8165d7e9a1e79415064b5
  270. size: 2792584
  271. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_200k-commits.pkl
  272. md5: fa93c70bb58f111b85db389bd0c74ad1
  273. size: 498669340
  274. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_berger.pkl
  275. md5: e93deda462057f4a205960cf0d24d2ec
  276. size: 917768
  277. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_developer-labeled-commits.pkl
  278. md5: fbbed1142254d6cd9b922290565a6313
  279. size: 2348040
  280. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  281. md5: b9c7d542ac5cc3e3aa9a2114d794a0f8
  282. size: 3708248
  283. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_herzig.pkl
  284. md5: 15bf08d69bf793c3978a2ffe1458c76d
  285. size: 12608792
  286. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_1151-commits.pkl
  287. md5: 6586d8bf9010aff3be65327facde2edc
  288. size: 9975
  289. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_200k-commits.pkl
  290. md5: 7febb1ec7e27a0b380e83a4c732946ab
  291. size: 1651964
  292. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_berger.pkl
  293. md5: e93c0bbc779f1cf928251c4613ef5cc6
  294. size: 3767
  295. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_developer-labeled-commits.pkl
  296. md5: ff48bc7a29abb4802a76e4d4882edd86
  297. size: 8503
  298. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_fine-grained-refactorings.pkl
  299. md5: e1466fb17a3ce93547649ebd5d6e2210
  300. size: 13007
  301. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_herzig.pkl
  302. md5: 8e22aefcc1b79b3bc8c00ebd4503dca3
  303. size: 42479
  304. params:
  305. bohr.json:
  306. bohr_framework_version: 0.4.10
  307. outs:
  308. - path: generated/bugginess/analysis_1151-commits.csv
  309. md5: b9a8293c90963f2ce36669ded72d2e1a
  310. size: 24263
  311. - path: generated/bugginess/analysis_200k-commits.csv
  312. md5: c0fffd05e85647c9ebc9f005e279b798
  313. size: 30404
  314. - path: generated/bugginess/analysis_berger.csv
  315. md5: 4994f12a769112913fba6a9a7dbb0eb4
  316. size: 21371
  317. - path: generated/bugginess/analysis_developer-labeled-commits.csv
  318. md5: b6c2c87d6a2ce080928f504848f2d4ae
  319. size: 22425
  320. - path: generated/bugginess/analysis_fine-grained-refactorings.csv
  321. md5: 9367a5bed1da8d9f21967129870f31e8
  322. size: 21050
  323. - path: generated/bugginess/analysis_herzig.csv
  324. md5: ea0eb8e17dbdb9b9ebcf5f7cedbc979a
  325. size: 25808
  326. - path: generated/bugginess/heuristic_matrix_1151-commits.pkl
  327. md5: a0a1a7fbd7512b6cb31eb06d61c3bf9c
  328. size: 2801973
  329. - path: generated/bugginess/heuristic_matrix_200k-commits.pkl
  330. md5: e4dd0ab575a03ed8b0a142176ac9ecf7
  331. size: 500320716
  332. - path: generated/bugginess/heuristic_matrix_berger.pkl
  333. md5: 1b5504acb50064f2971a00799380d780
  334. size: 920949
  335. - path: generated/bugginess/heuristic_matrix_developer-labeled-commits.pkl
  336. md5: 6ee04d7e28587b9134b30f6a3d9ef3d9
  337. size: 2355957
  338. - path: generated/bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  339. md5: e8c17b404564787dc5bdffdff4b75b18
  340. size: 3720669
  341. - path: generated/bugginess/heuristic_matrix_herzig.pkl
  342. md5: c604a8886af18d4982a1f3626fa57619
  343. size: 12650685
  344. - path: metrics/bugginess/analysis_1151-commits.json
  345. md5: cfd93443f948403db776e31f346eecb4
  346. size: 108603
  347. - path: metrics/bugginess/analysis_200k-commits.json
  348. md5: 2aa1f13d807bd4ad4821128353c1747e
  349. size: 78819
  350. - path: metrics/bugginess/analysis_berger.json
  351. md5: 3d99febdbd632bb38a06cdb7c2bf4e62
  352. size: 105095
  353. - path: metrics/bugginess/analysis_developer-labeled-commits.json
  354. md5: c8df22c135bd5813bcf9682a0e1e375a
  355. size: 106171
  356. - path: metrics/bugginess/analysis_fine-grained-refactorings.json
  357. md5: 4dcc67f60b3dafe18542bc800be719d1
  358. size: 65913
  359. - path: metrics/bugginess/analysis_herzig.json
  360. md5: 30b7f8dc4e151d6d5691ed1965b09aff
  361. size: 110390
  362. - path: metrics/bugginess/heuristic_metrics_1151-commits.json
  363. md5: 590f784b4669d243ce5bff2a8d09345b
  364. size: 73
  365. - path: metrics/bugginess/heuristic_metrics_200k-commits.json
  366. md5: 4c5581c69bb80690b5edffe31c2948cf
  367. size: 31
  368. - path: metrics/bugginess/heuristic_metrics_berger.json
  369. md5: feb313c11f1c1afb0fc58ef5ad73ab6a
  370. size: 73
  371. - path: metrics/bugginess/heuristic_metrics_developer-labeled-commits.json
  372. md5: 936006aa024d80022004b24caf729c4d
  373. size: 73
  374. - path: metrics/bugginess/heuristic_metrics_fine-grained-refactorings.json
  375. md5: 164d888c48bfb076fa97782a2aa703a8
  376. size: 31
  377. - path: metrics/bugginess/heuristic_metrics_herzig.json
  378. md5: 525291ad999e1bee97789045c1ae8333
  379. size: 72
  380. bugginess_train_label_model:
  381. cmd: bohr porcelain train-label-model bugginess 200k-commits
  382. deps:
  383. - path: data/1151-commits.csv
  384. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  385. size: 346306
  386. - path: data/berger.csv
  387. md5: 126de41c9204a9e807e72406b1f9d631
  388. size: 62247
  389. - path: data/developer-labeled.csv
  390. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  391. size: 121817
  392. - path: data/herzig.csv
  393. md5: 69a17c08643aed84b874384a2a57c7ed
  394. size: 1483281
  395. - path: generated/bugginess/heuristic_matrix_1151-commits.pkl
  396. md5: a0a1a7fbd7512b6cb31eb06d61c3bf9c
  397. size: 2801973
  398. - path: generated/bugginess/heuristic_matrix_200k-commits.pkl
  399. md5: e4dd0ab575a03ed8b0a142176ac9ecf7
  400. size: 500320716
  401. - path: generated/bugginess/heuristic_matrix_berger.pkl
  402. md5: 1b5504acb50064f2971a00799380d780
  403. size: 920949
  404. - path: generated/bugginess/heuristic_matrix_developer-labeled-commits.pkl
  405. md5: 6ee04d7e28587b9134b30f6a3d9ef3d9
  406. size: 2355957
  407. - path: generated/bugginess/heuristic_matrix_herzig.pkl
  408. md5: c604a8886af18d4982a1f3626fa57619
  409. size: 12650685
  410. params:
  411. bohr.json:
  412. bohr_framework_version: 0.4.10
  413. outs:
  414. - path: generated/bugginess/label_model.pkl
  415. md5: 27c72792ac98db20c3c5cbbc3768de40
  416. size: 1875432
  417. - path: generated/bugginess/label_model_weights.csv
  418. md5: 69b5d4a9e3903ac9d392386c61b1c9b2
  419. size: 20318
  420. - path: metrics/bugginess/label_model_metrics.json
  421. md5: d3589a6129da95be758f794d340fc0af
  422. size: 642
  423. bugginess_label_dataset_herzig:
  424. cmd: bohr porcelain label-dataset bugginess herzig
  425. deps:
  426. - path: data/herzig.csv
  427. md5: 69a17c08643aed84b874384a2a57c7ed
  428. size: 1483281
  429. - path: generated/bugginess/heuristic_matrix_herzig.pkl
  430. md5: c604a8886af18d4982a1f3626fa57619
  431. size: 12650685
  432. - path: generated/bugginess/label_model.pkl
  433. md5: 27c72792ac98db20c3c5cbbc3768de40
  434. size: 1875432
  435. params:
  436. bohr.json:
  437. bohr_framework_version: 0.4.10
  438. outs:
  439. - path: labeled-datasets/bugginess/herzig.labeled.csv
  440. md5: 8c839c298f7c3e137b7f94859bc9571d
  441. size: 1539779
  442. smells_train_label_model:
  443. cmd: bohr porcelain train-label-model smells smells-train
  444. deps:
  445. - path: data/smells/test.csv
  446. md5: 0200db0eec17554a48a5b3a25719fd03
  447. size: 77607
  448. - path: generated/smells/heuristic_matrix_smells-test.pkl
  449. md5: b1fd4c196e2f967e9fca6b8f9c572272
  450. size: 4230
  451. - path: generated/smells/heuristic_matrix_smells-train.pkl
  452. md5: 2aa34f53b90128efb74215acdf7d995b
  453. size: 14312
  454. params:
  455. bohr.json:
  456. bohr_framework_version: 0.4.10
  457. outs:
  458. - path: generated/smells/label_model.pkl
  459. md5: 930d6534b7c29c17d248df2eeedf140f
  460. size: 4874
  461. - path: generated/smells/label_model_weights.csv
  462. md5: c1f7fbb15c07cadcf0a4e050f51fa89e
  463. size: 179
  464. - path: metrics/smells/label_model_metrics.json
  465. md5: 2040bf8b04f316ce72cfff68a4423b35
  466. size: 156
  467. smells_label_dataset_smells-train:
  468. cmd: bohr porcelain label-dataset smells smells-train
  469. deps:
  470. - path: data/smells/train.csv
  471. md5: 7fc9a7617e6f201523fba311317ba48f
  472. size: 296970
  473. - path: generated/smells/heuristic_matrix_smells-train.pkl
  474. md5: 2aa34f53b90128efb74215acdf7d995b
  475. size: 14312
  476. - path: generated/smells/label_model.pkl
  477. md5: 930d6534b7c29c17d248df2eeedf140f
  478. size: 4874
  479. params:
  480. bohr.json:
  481. bohr_framework_version: 0.4.10
  482. outs:
  483. - path: labeled-datasets/smells/smells-train.labeled.csv
  484. md5: b3433438369f2ca2276c22cff309631e
  485. size: 296121
  486. smells_label_dataset_smells-test:
  487. cmd: bohr porcelain label-dataset smells smells-test
  488. deps:
  489. - path: data/smells/test.csv
  490. md5: 0200db0eec17554a48a5b3a25719fd03
  491. size: 77607
  492. - path: generated/smells/heuristic_matrix_smells-test.pkl
  493. md5: b1fd4c196e2f967e9fca6b8f9c572272
  494. size: 4230
  495. - path: generated/smells/label_model.pkl
  496. md5: 930d6534b7c29c17d248df2eeedf140f
  497. size: 4874
  498. params:
  499. bohr.json:
  500. bohr_framework_version: 0.4.10
  501. outs:
  502. - path: labeled-datasets/smells/smells-test.labeled.csv
  503. md5: fe4a97ad13be96db8f076fda178bf984
  504. size: 77279
  505. bugginess_label_dataset_bugginess-train:
  506. cmd: bohr label-dataset bugginess bugginess-train
  507. deps:
  508. - path: data/bugginess_train
  509. md5: f7cbfc7a91dfeca3aff7b7d3b6d7ea72.dir
  510. size: 2489726547
  511. nfiles: 3
  512. - path: generated/bugginess/heuristic_matrix_bugginess-train.pkl
  513. md5: d9141b7bf8b3eb25cf3e90490acbb812
  514. size: 500879984
  515. - path: generated/bugginess/label_model.pkl
  516. md5: ce78684652e122b347fe0c7fc32ba035
  517. size: 1863238
  518. params:
  519. bohr.json:
  520. bohr_framework_version: 0.4.2
  521. outs:
  522. - path: labeled-datasets/bugginess-train.labeled.csv
  523. md5: bfe4ac306c08f7188e094acc20e1ff03
  524. size: 61623779
  525. bugginess_label_dataset_1151-commits:
  526. cmd: bohr porcelain label-dataset bugginess 1151-commits
  527. deps:
  528. - path: data/1151-commits.csv
  529. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  530. size: 346306
  531. - path: generated/bugginess/heuristic_matrix_1151-commits.pkl
  532. md5: a0a1a7fbd7512b6cb31eb06d61c3bf9c
  533. size: 2801973
  534. - path: generated/bugginess/label_model.pkl
  535. md5: 27c72792ac98db20c3c5cbbc3768de40
  536. size: 1875432
  537. params:
  538. bohr.json:
  539. bohr_framework_version: 0.4.10
  540. outs:
  541. - path: labeled-datasets/bugginess/1151-commits.labeled.csv
  542. md5: c108a362f144ea7898c448510dd0f794
  543. size: 359783
  544. bugginess_label_dataset_berger:
  545. cmd: bohr porcelain label-dataset bugginess berger
  546. deps:
  547. - path: data/berger.csv
  548. md5: 126de41c9204a9e807e72406b1f9d631
  549. size: 62247
  550. - path: generated/bugginess/heuristic_matrix_berger.pkl
  551. md5: 1b5504acb50064f2971a00799380d780
  552. size: 920949
  553. - path: generated/bugginess/label_model.pkl
  554. md5: 27c72792ac98db20c3c5cbbc3768de40
  555. size: 1875432
  556. params:
  557. bohr.json:
  558. bohr_framework_version: 0.4.10
  559. outs:
  560. - path: labeled-datasets/bugginess/berger.labeled.csv
  561. md5: 83a5bae043e167792a2e721cc4f383ee
  562. size: 66776
  563. bugginess_transformer_train:
  564. cmd: bash classifiers/bugginess-transformer/train.sh labeled-data/bugginess.csv
  565. deps:
  566. - path: classifiers/bugginess-transformer/run.py
  567. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  568. size: 12023
  569. - path: classifiers/bugginess-transformer/train.sh
  570. md5: 3a19e011c049042bbec7e8315e883c38
  571. size: 557
  572. - path: labeled-datasets/bugginess-train.labeled.csv
  573. md5: bfe4ac306c08f7188e094acc20e1ff03
  574. size: 61623779
  575. - path: requirements.txt
  576. md5: 29b4c5d66c523cec0712dbcdcced42bb
  577. size: 21
  578. outs:
  579. - path: models/config.json
  580. md5: 3effd3229ade2ed52eeb90d252790bf5
  581. size: 716
  582. - path: models/merges.txt
  583. md5: fb9c1e34b6999f3a062df6ed4a604957
  584. size: 458459
  585. - path: models/pytorch_model.bin
  586. md5: 40379a0207d19e2a24e116e941d7d675
  587. size: 333858922
  588. - path: models/special_tokens_map.json
  589. md5: 17bb9e090d1d3a775683aba3ba610591
  590. size: 239
  591. - path: models/tokenizer_config.json
  592. md5: e1a3e947aa301aadc524ee29f0dbcc39
  593. size: 1257
  594. - path: models/training_args.bin
  595. md5: 408c3f12467908cabb77ded5ce3490ed
  596. size: 2159
  597. - path: models/vocab.json
  598. md5: ca70df26ed267d27a9edde9c5341f17b
  599. size: 813062
  600. bugginess_transformer_test_herzig:
  601. cmd: bash classifiers/bugginess-transformer/test.sh data/bugginess/herzig.csv
  602. metrics/bugginess/transformer/herzig
  603. deps:
  604. - path: classifiers/bugginess-transformer/run.py
  605. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  606. size: 12023
  607. - path: classifiers/bugginess-transformer/test.sh
  608. md5: 92c4020b4c026f9b85fd38ddee1bd528
  609. size: 313
  610. - path: data/herzig.csv
  611. md5: 279936268f488e1e613f81a537f29055
  612. size: 1458311
  613. - path: models/config.json
  614. md5: 3effd3229ade2ed52eeb90d252790bf5
  615. size: 716
  616. - path: models/merges.txt
  617. md5: fb9c1e34b6999f3a062df6ed4a604957
  618. size: 458459
  619. - path: models/pytorch_model.bin
  620. md5: 40379a0207d19e2a24e116e941d7d675
  621. size: 333858922
  622. - path: models/special_tokens_map.json
  623. md5: 17bb9e090d1d3a775683aba3ba610591
  624. size: 239
  625. - path: models/tokenizer_config.json
  626. md5: e1a3e947aa301aadc524ee29f0dbcc39
  627. size: 1257
  628. - path: models/training_args.bin
  629. md5: 408c3f12467908cabb77ded5ce3490ed
  630. size: 2159
  631. - path: models/vocab.json
  632. md5: ca70df26ed267d27a9edde9c5341f17b
  633. size: 813062
  634. - path: requirements.txt
  635. md5: 29b4c5d66c523cec0712dbcdcced42bb
  636. size: 21
  637. outs:
  638. - path: metrics/bugginess/transformer/herzig/eval_results.txt
  639. md5: bd726240700dac6e926d5532eb76c5c4
  640. size: 145
  641. bugginess_transformer_test_1151-commits:
  642. cmd: bash classifiers/bugginess-transformer/test.sh data/bugginess/1151-commits.csv
  643. metrics/bugginess/transformer/1151-commits
  644. deps:
  645. - path: classifiers/bugginess-transformer/run.py
  646. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  647. size: 12023
  648. - path: classifiers/bugginess-transformer/test.sh
  649. md5: 92c4020b4c026f9b85fd38ddee1bd528
  650. size: 313
  651. - path: data/1151-commits.csv
  652. md5: 7b32f404edf5982eb4c5f51b956663c4
  653. size: 341651
  654. - path: models/config.json
  655. md5: 3effd3229ade2ed52eeb90d252790bf5
  656. size: 716
  657. - path: models/merges.txt
  658. md5: fb9c1e34b6999f3a062df6ed4a604957
  659. size: 458459
  660. - path: models/pytorch_model.bin
  661. md5: 40379a0207d19e2a24e116e941d7d675
  662. size: 333858922
  663. - path: models/special_tokens_map.json
  664. md5: 17bb9e090d1d3a775683aba3ba610591
  665. size: 239
  666. - path: models/tokenizer_config.json
  667. md5: e1a3e947aa301aadc524ee29f0dbcc39
  668. size: 1257
  669. - path: models/training_args.bin
  670. md5: 408c3f12467908cabb77ded5ce3490ed
  671. size: 2159
  672. - path: models/vocab.json
  673. md5: ca70df26ed267d27a9edde9c5341f17b
  674. size: 813062
  675. - path: requirements.txt
  676. md5: 29b4c5d66c523cec0712dbcdcced42bb
  677. size: 21
  678. outs:
  679. - path: metrics/bugginess/transformer/1151-commits/eval_results.txt
  680. md5: 52b1b36d2896195e78ca5b7d42de4839
  681. size: 146
  682. bugginess_transformer_label_1151-commits:
  683. cmd: bash classifiers/bugginess-transformer/label.sh data/bugginess/1151-commits.csv
  684. metrics/bugginess/transformer/1151-commits
  685. deps:
  686. - path: classifiers/bugginess-transformer/label.sh
  687. md5: ce2646a4233e68991b57bbf2c7404ace
  688. size: 320
  689. - path: classifiers/bugginess-transformer/run.py
  690. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  691. size: 12023
  692. - path: data/1151-commits.csv
  693. md5: 7b32f404edf5982eb4c5f51b956663c4
  694. size: 341651
  695. - path: models/config.json
  696. md5: 3effd3229ade2ed52eeb90d252790bf5
  697. size: 716
  698. - path: models/merges.txt
  699. md5: fb9c1e34b6999f3a062df6ed4a604957
  700. size: 458459
  701. - path: models/pytorch_model.bin
  702. md5: 40379a0207d19e2a24e116e941d7d675
  703. size: 333858922
  704. - path: models/special_tokens_map.json
  705. md5: 17bb9e090d1d3a775683aba3ba610591
  706. size: 239
  707. - path: models/tokenizer_config.json
  708. md5: e1a3e947aa301aadc524ee29f0dbcc39
  709. size: 1257
  710. - path: models/training_args.bin
  711. md5: 408c3f12467908cabb77ded5ce3490ed
  712. size: 2159
  713. - path: models/vocab.json
  714. md5: ca70df26ed267d27a9edde9c5341f17b
  715. size: 813062
  716. - path: requirements.txt
  717. md5: 29b4c5d66c523cec0712dbcdcced42bb
  718. size: 21
  719. outs:
  720. - path: metrics/bugginess/transformer/1151-commits/assigned_labels.csv
  721. md5: 527025d5fa114a28fa55eed7f4c10801
  722. size: 6964
  723. bugginess_transformer_test_berger:
  724. cmd: bash classifiers/bugginess-transformer/test.sh data/bugginess/berger.csv
  725. metrics/bugginess/transformer/berger
  726. deps:
  727. - path: classifiers/bugginess-transformer/run.py
  728. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  729. size: 12023
  730. - path: classifiers/bugginess-transformer/test.sh
  731. md5: 92c4020b4c026f9b85fd38ddee1bd528
  732. size: 313
  733. - path: data/berger.csv
  734. md5: 71b9738db6cb47e3af599da316e3b570
  735. size: 60847
  736. - path: models/config.json
  737. md5: 3effd3229ade2ed52eeb90d252790bf5
  738. size: 716
  739. - path: models/merges.txt
  740. md5: fb9c1e34b6999f3a062df6ed4a604957
  741. size: 458459
  742. - path: models/pytorch_model.bin
  743. md5: 40379a0207d19e2a24e116e941d7d675
  744. size: 333858922
  745. - path: models/special_tokens_map.json
  746. md5: 17bb9e090d1d3a775683aba3ba610591
  747. size: 239
  748. - path: models/tokenizer_config.json
  749. md5: e1a3e947aa301aadc524ee29f0dbcc39
  750. size: 1257
  751. - path: models/training_args.bin
  752. md5: 408c3f12467908cabb77ded5ce3490ed
  753. size: 2159
  754. - path: models/vocab.json
  755. md5: ca70df26ed267d27a9edde9c5341f17b
  756. size: 813062
  757. - path: requirements.txt
  758. md5: 29b4c5d66c523cec0712dbcdcced42bb
  759. size: 21
  760. outs:
  761. - path: metrics/bugginess/transformer/berger/eval_results.txt
  762. md5: 30da05826f977ee9b867560731091915
  763. size: 144
  764. bugginess_combine_labels_1151-commits:
  765. cmd: python classifiers/bugginess-transformer/combine_labels.py labeled-datasets/1151-commits.labeled.csv
  766. metrics/bugginess/transformer/1151-commits/assigned_labels.csv labeled-datasets/1151-commits.labeled.both.csv
  767. && echo "labeled-datasets/1151-commits.labeled.both.csv" >> .gitignore
  768. deps:
  769. - path: classifiers/bugginess-transformer/combine_labels.py
  770. md5: 85cef7e65682e381b5e746d5a0901ec2
  771. size: 720
  772. - path: labeled-datasets/1151-commits.labeled.csv
  773. md5: 70250f3b3489aed05065c35a0b859c00
  774. size: 359755
  775. - path: metrics/bugginess/transformer/1151-commits/assigned_labels.csv
  776. md5: 527025d5fa114a28fa55eed7f4c10801
  777. size: 6964
  778. outs:
  779. - path: labeled-datasets/1151-commits.labeled.both.csv
  780. md5: f4d459e7b167fb0197dc49483eb2d2af
  781. size: 366721
  782. preprocess_200k-commits:
  783. cmd: cp downloaded-data/200k-commits.csv data && echo "data/200k-commits.csv"
  784. >> .gitignore && git add .gitignore
  785. deps:
  786. - path: downloaded-data/200k-commits.csv
  787. md5: 6ce10284e630c44110ffc483a7bb33df
  788. size: 71402002
  789. outs:
  790. - path: data/200k-commits.csv
  791. md5: 6ce10284e630c44110ffc483a7bb33df
  792. size: 71402002
  793. preprocess_200k-commits-issues:
  794. cmd: cp downloaded-data/200k-commits-issues.csv data && echo "data/200k-commits-issues.csv"
  795. >> .gitignore && git add .gitignore
  796. deps:
  797. - path: downloaded-data/200k-commits-issues.csv
  798. md5: da4b0d654f7ce1469857b9171a9647aa
  799. size: 96908075
  800. outs:
  801. - path: data/200k-commits-issues.csv
  802. md5: da4b0d654f7ce1469857b9171a9647aa
  803. size: 96908075
  804. preprocess_200k-commits-files:
  805. cmd: 7z x downloaded-data/200k-commits-files.csv.7z -odata && echo "data/200k-commits-files.csv"
  806. >> .gitignore && git add .gitignore
  807. deps:
  808. - path: downloaded-data/200k-commits-files.csv.7z
  809. md5: 56697c21cfd7bba5d0f68dcd0fbd86f0
  810. size: 240190210
  811. outs:
  812. - path: data/200k-commits-files.csv
  813. md5: bc989c140c305bed62a5a8b161883d3b
  814. size: 2284439219
  815. bugginess_apply_heuristics__heuristics_bugginess__200k-commits:
  816. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  817. --dataset 200k-commits
  818. deps:
  819. - path: data/200k-commits-files.csv
  820. md5: bc989c140c305bed62a5a8b161883d3b
  821. size: 2284439219
  822. - path: data/200k-commits-issues.csv
  823. md5: da4b0d654f7ce1469857b9171a9647aa
  824. size: 96908075
  825. - path: data/200k-commits-manual-labels.csv
  826. md5: 447bf23d38df7f7e3007dc35f70cab91
  827. size: 1187
  828. - path: data/200k-commits.csv
  829. md5: 6ce10284e630c44110ffc483a7bb33df
  830. size: 71402002
  831. - path: heuristics/bugginess.py
  832. md5: a55520b99cbaad572a26886b52e74652
  833. size: 8900
  834. - path: labels.py
  835. md5: 1404972881fc94fbf1039b625bd4ccc0
  836. size: 1859
  837. params:
  838. bohr.json:
  839. bohr_framework_version: 0.4.10
  840. outs:
  841. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_200k-commits.pkl
  842. md5: fa93c70bb58f111b85db389bd0c74ad1
  843. size: 498669340
  844. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_200k-commits.json
  845. md5: 958e51f4d2c35451cf7575eaea15e7a6
  846. size: 32
  847. bugginess_label_dataset_200k-commits:
  848. cmd: bohr porcelain label-dataset bugginess 200k-commits
  849. deps:
  850. - path: data/200k-commits.csv
  851. md5: 6ce10284e630c44110ffc483a7bb33df
  852. size: 71402002
  853. - path: generated/bugginess/heuristic_matrix_200k-commits.pkl
  854. md5: e4dd0ab575a03ed8b0a142176ac9ecf7
  855. size: 500320716
  856. - path: generated/bugginess/label_model.pkl
  857. md5: 27c72792ac98db20c3c5cbbc3768de40
  858. size: 1875432
  859. params:
  860. bohr.json:
  861. bohr_framework_version: 0.4.10
  862. outs:
  863. - path: labeled-datasets/bugginess/200k-commits.labeled.csv
  864. md5: f014c7f09d9e2cb712d6fb232758ee9c
  865. size: 73212497
  866. preprocess_200k-commits-link-issues:
  867. cmd: cp downloaded-data/200k-commits-link-issues.csv data && echo "data/200k-commits-link-issues.csv"
  868. >> .gitignore && git add .gitignore
  869. deps:
  870. - path: downloaded-data/200k-commits-link-issues.csv
  871. md5: f75c8b5c7747abc8c2bd1b3b847dac18
  872. size: 3005661
  873. outs:
  874. - path: data/200k-commits-link-issues.csv
  875. md5: f75c8b5c7747abc8c2bd1b3b847dac18
  876. size: 3005661
  877. preprocess_200k-commits-manual-labels:
  878. cmd: cp downloaded-data/200k-commits-manual-labels.csv data && echo "data/200k-commits-manual-labels.csv"
  879. >> .gitignore && git add .gitignore
  880. deps:
  881. - path: downloaded-data/200k-commits-manual-labels.csv
  882. md5: 447bf23d38df7f7e3007dc35f70cab91
  883. size: 1187
  884. outs:
  885. - path: data/200k-commits-manual-labels.csv
  886. md5: 447bf23d38df7f7e3007dc35f70cab91
  887. size: 1187
  888. bugginess_apply_heuristics__heuristics_manuallabels__herzig:
  889. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  890. --dataset herzig
  891. deps:
  892. - path: data/herzig.csv
  893. md5: 69a17c08643aed84b874384a2a57c7ed
  894. size: 1483281
  895. - path: heuristics/manuallabels.py
  896. md5: f338b2a285d76da97b3f53e9b167368a
  897. size: 278
  898. - path: labels.py
  899. md5: 1404972881fc94fbf1039b625bd4ccc0
  900. size: 1859
  901. params:
  902. bohr.json:
  903. bohr_framework_version: 0.4.10
  904. outs:
  905. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_herzig.pkl
  906. md5: 8e22aefcc1b79b3bc8c00ebd4503dca3
  907. size: 42479
  908. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_herzig.json
  909. md5: 6881c30e66d12aec85d162df31e5e04d
  910. size: 58
  911. bugginess_apply_heuristics__heuristics_manuallabels__1151-commits:
  912. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  913. --dataset 1151-commits
  914. deps:
  915. - path: data/1151-commits.csv
  916. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  917. size: 346306
  918. - path: heuristics/manuallabels.py
  919. md5: f338b2a285d76da97b3f53e9b167368a
  920. size: 278
  921. - path: labels.py
  922. md5: 1404972881fc94fbf1039b625bd4ccc0
  923. size: 1859
  924. params:
  925. bohr.json:
  926. bohr_framework_version: 0.4.10
  927. outs:
  928. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_1151-commits.pkl
  929. md5: 6586d8bf9010aff3be65327facde2edc
  930. size: 9975
  931. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_1151-commits.json
  932. md5: 452fdb0e2c252999419be5771a3774cc
  933. size: 58
  934. bugginess_apply_heuristics__heuristics_manuallabels__200k-commits:
  935. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  936. --dataset 200k-commits
  937. deps:
  938. - path: data/200k-commits-files.csv
  939. md5: bc989c140c305bed62a5a8b161883d3b
  940. size: 2284439219
  941. - path: data/200k-commits-issues.csv
  942. md5: da4b0d654f7ce1469857b9171a9647aa
  943. size: 96908075
  944. - path: data/200k-commits-manual-labels.csv
  945. md5: 447bf23d38df7f7e3007dc35f70cab91
  946. size: 1187
  947. - path: data/200k-commits.csv
  948. md5: 6ce10284e630c44110ffc483a7bb33df
  949. size: 71402002
  950. - path: heuristics/manuallabels.py
  951. md5: f338b2a285d76da97b3f53e9b167368a
  952. size: 278
  953. - path: labels.py
  954. md5: 1404972881fc94fbf1039b625bd4ccc0
  955. size: 1859
  956. params:
  957. bohr.json:
  958. bohr_framework_version: 0.4.10
  959. outs:
  960. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_200k-commits.pkl
  961. md5: 7febb1ec7e27a0b380e83a4c732946ab
  962. size: 1651964
  963. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_200k-commits.json
  964. md5: b550e0fca5c368f3221fc11db0ba8a3e
  965. size: 36
  966. bugginess_apply_heuristics__heuristics_manuallabels__berger:
  967. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  968. --dataset berger
  969. deps:
  970. - path: data/berger.csv
  971. md5: 126de41c9204a9e807e72406b1f9d631
  972. size: 62247
  973. - path: heuristics/manuallabels.py
  974. md5: f338b2a285d76da97b3f53e9b167368a
  975. size: 278
  976. - path: labels.py
  977. md5: 1404972881fc94fbf1039b625bd4ccc0
  978. size: 1859
  979. params:
  980. bohr.json:
  981. bohr_framework_version: 0.4.10
  982. outs:
  983. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_berger.pkl
  984. md5: e93c0bbc779f1cf928251c4613ef5cc6
  985. size: 3767
  986. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_berger.json
  987. md5: c2fefb5ddd23aee9e2705356b8d131c1
  988. size: 59
  989. preprocess_developer-labeled-commits:
  990. cmd: data-preprocessing/developer_labeled.py
  991. deps:
  992. - path: data-preprocessing/developer_labeled.py
  993. md5: d4c743a2b7723181d0284ea959fcbb99
  994. size: 1488
  995. - path: downloaded-data/developer-labeled-commits.zip
  996. md5: 691ca4dc06f945d2d3019e20ceb5cd5c
  997. size: 898671
  998. outs:
  999. - path: data/developer-labeled.csv
  1000. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1001. size: 121817
  1002. bugginess_apply_heuristics__heuristics_bugginess__developer-labeled-commits:
  1003. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  1004. --dataset developer-labeled-commits
  1005. deps:
  1006. - path: data/developer-labeled.csv
  1007. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1008. size: 121817
  1009. - path: heuristics/bugginess.py
  1010. md5: a55520b99cbaad572a26886b52e74652
  1011. size: 8900
  1012. - path: labels.py
  1013. md5: 1404972881fc94fbf1039b625bd4ccc0
  1014. size: 1859
  1015. params:
  1016. bohr.json:
  1017. bohr_framework_version: 0.4.10
  1018. outs:
  1019. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1020. md5: fbbed1142254d6cd9b922290565a6313
  1021. size: 2348040
  1022. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_developer-labeled-commits.json
  1023. md5: 936006aa024d80022004b24caf729c4d
  1024. size: 73
  1025. preprocess_fine-grained-refactorings:
  1026. cmd: data-preprocessing/fine-grained-refactorings.py
  1027. deps:
  1028. - path: data-preprocessing/fine-grained-refactorings.py
  1029. md5: 4e32da7ede9b4d34c07d3e54d2672f21
  1030. size: 962
  1031. - path: downloaded-data/fine-grained-refactorings.zip
  1032. md5: 15d8fedcdef6f7f75df5c687c78cd791
  1033. size: 119657
  1034. outs:
  1035. - path: data/fine-grained-refactorings.csv
  1036. md5: 4b2fed41042a5ceb2e95738f35650beb
  1037. size: 358328
  1038. bugginess_apply_heuristics__heuristics_bugginess__fine-grained-refactorings:
  1039. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  1040. --dataset fine-grained-refactorings
  1041. deps:
  1042. - path: data/fine-grained-refactorings.csv
  1043. md5: 4b2fed41042a5ceb2e95738f35650beb
  1044. size: 358328
  1045. - path: heuristics/bugginess.py
  1046. md5: a55520b99cbaad572a26886b52e74652
  1047. size: 8900
  1048. - path: labels.py
  1049. md5: 1404972881fc94fbf1039b625bd4ccc0
  1050. size: 1859
  1051. params:
  1052. bohr.json:
  1053. bohr_framework_version: 0.4.10
  1054. outs:
  1055. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1056. md5: b9c7d542ac5cc3e3aa9a2114d794a0f8
  1057. size: 3708248
  1058. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_fine-grained-refactorings.json
  1059. md5: 164d888c48bfb076fa97782a2aa703a8
  1060. size: 31
  1061. bugginess_apply_heuristics__heuristics_manuallabels__developer-labeled-commits:
  1062. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  1063. --dataset developer-labeled-commits
  1064. deps:
  1065. - path: data/developer-labeled.csv
  1066. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1067. size: 121817
  1068. - path: heuristics/manuallabels.py
  1069. md5: f338b2a285d76da97b3f53e9b167368a
  1070. size: 278
  1071. - path: labels.py
  1072. md5: 1404972881fc94fbf1039b625bd4ccc0
  1073. size: 1859
  1074. params:
  1075. bohr.json:
  1076. bohr_framework_version: 0.4.10
  1077. outs:
  1078. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_developer-labeled-commits.pkl
  1079. md5: ff48bc7a29abb4802a76e4d4882edd86
  1080. size: 8503
  1081. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_developer-labeled-commits.json
  1082. md5: 6736afb27f1813d39b8769670a0d29c7
  1083. size: 58
  1084. bugginess_apply_heuristics__heuristics_manuallabels__fine-grained-refactorings:
  1085. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  1086. --dataset fine-grained-refactorings
  1087. deps:
  1088. - path: data/fine-grained-refactorings.csv
  1089. md5: 4b2fed41042a5ceb2e95738f35650beb
  1090. size: 358328
  1091. - path: heuristics/manuallabels.py
  1092. md5: f338b2a285d76da97b3f53e9b167368a
  1093. size: 278
  1094. - path: labels.py
  1095. md5: 1404972881fc94fbf1039b625bd4ccc0
  1096. size: 1859
  1097. params:
  1098. bohr.json:
  1099. bohr_framework_version: 0.4.10
  1100. outs:
  1101. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_fine-grained-refactorings.pkl
  1102. md5: e1466fb17a3ce93547649ebd5d6e2210
  1103. size: 13007
  1104. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_fine-grained-refactorings.json
  1105. md5: 90747343662116155b09e2920b157b6c
  1106. size: 17
  1107. bugginess_label_dataset_developer-labeled-commits:
  1108. cmd: bohr porcelain label-dataset bugginess developer-labeled-commits
  1109. deps:
  1110. - path: data/developer-labeled.csv
  1111. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1112. size: 121817
  1113. - path: generated/bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1114. md5: 6ee04d7e28587b9134b30f6a3d9ef3d9
  1115. size: 2355957
  1116. - path: generated/bugginess/label_model.pkl
  1117. md5: 27c72792ac98db20c3c5cbbc3768de40
  1118. size: 1875432
  1119. params:
  1120. bohr.json:
  1121. bohr_framework_version: 0.4.10
  1122. outs:
  1123. - path: labeled-datasets/bugginess/developer-labeled-commits.labeled.csv
  1124. md5: 665a9e5156e4d587143863774d122288
  1125. size: 133310
  1126. bugginess_label_dataset_fine-grained-refactorings:
  1127. cmd: bohr porcelain label-dataset bugginess fine-grained-refactorings
  1128. deps:
  1129. - path: data/fine-grained-refactorings.csv
  1130. md5: 4b2fed41042a5ceb2e95738f35650beb
  1131. size: 358328
  1132. - path: generated/bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1133. md5: e8c17b404564787dc5bdffdff4b75b18
  1134. size: 3720669
  1135. - path: generated/bugginess/label_model.pkl
  1136. md5: 27c72792ac98db20c3c5cbbc3768de40
  1137. size: 1875432
  1138. params:
  1139. bohr.json:
  1140. bohr_framework_version: 0.4.10
  1141. outs:
  1142. - path: labeled-datasets/bugginess/fine-grained-refactorings.labeled.csv
  1143. md5: d2dac7e9a367318b529b69ce29fb2ec3
  1144. size: 375865
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...