Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

dvc.lock 76 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
1483
1484
1485
1486
1487
1488
1489
1490
1491
1492
1493
1494
1495
1496
1497
1498
1499
1500
1501
1502
1503
1504
1505
1506
1507
1508
1509
1510
1511
1512
1513
1514
1515
1516
1517
1518
1519
1520
1521
1522
1523
1524
1525
1526
1527
1528
1529
1530
1531
1532
1533
1534
1535
1536
1537
1538
1539
1540
1541
1542
1543
1544
1545
1546
1547
1548
1549
1550
1551
1552
1553
1554
1555
1556
1557
1558
1559
1560
1561
1562
1563
1564
1565
1566
1567
1568
1569
1570
1571
1572
1573
1574
1575
1576
1577
1578
1579
1580
1581
1582
1583
1584
1585
1586
1587
1588
1589
1590
1591
1592
1593
1594
1595
1596
1597
1598
1599
1600
1601
1602
1603
1604
1605
1606
1607
1608
1609
1610
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
1641
1642
1643
1644
1645
1646
1647
1648
1649
1650
1651
1652
1653
1654
1655
1656
1657
1658
1659
1660
1661
1662
1663
1664
1665
1666
1667
1668
1669
1670
1671
1672
1673
1674
1675
1676
1677
1678
1679
1680
1681
1682
1683
1684
1685
1686
1687
1688
1689
1690
1691
1692
1693
1694
1695
1696
1697
1698
1699
1700
1701
1702
1703
1704
1705
1706
1707
1708
1709
1710
1711
1712
1713
1714
1715
1716
1717
1718
1719
1720
1721
1722
1723
1724
1725
1726
1727
1728
1729
1730
1731
1732
1733
1734
1735
1736
1737
1738
1739
1740
1741
1742
1743
1744
1745
1746
1747
1748
1749
1750
1751
1752
1753
1754
1755
1756
1757
1758
1759
1760
1761
1762
1763
1764
1765
1766
1767
1768
1769
1770
1771
1772
1773
1774
1775
1776
1777
1778
1779
1780
1781
1782
1783
1784
1785
1786
1787
1788
1789
1790
1791
1792
1793
1794
1795
1796
1797
1798
1799
1800
1801
1802
1803
1804
1805
1806
1807
1808
1809
1810
1811
1812
1813
1814
1815
1816
1817
1818
1819
1820
1821
1822
1823
1824
1825
1826
1827
1828
1829
1830
1831
1832
1833
1834
1835
1836
1837
1838
1839
1840
1841
1842
1843
1844
1845
1846
1847
1848
1849
1850
1851
1852
1853
1854
1855
1856
1857
1858
1859
1860
1861
1862
1863
1864
1865
1866
1867
1868
1869
1870
1871
1872
1873
1874
1875
1876
1877
1878
1879
1880
1881
1882
1883
1884
1885
1886
1887
1888
1889
1890
1891
1892
1893
1894
1895
1896
1897
1898
1899
1900
1901
1902
1903
1904
1905
1906
1907
1908
1909
1910
1911
1912
1913
1914
1915
1916
1917
1918
1919
1920
1921
1922
1923
1924
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
2027
2028
2029
2030
2031
2032
2033
2034
2035
2036
2037
2038
2039
2040
  1. schema: '2.0'
  2. stages:
  3. preprocess_1151-commits:
  4. cmd: cp downloaded-data/1151-commits.csv data && echo "data/1151-commits.csv"
  5. >> .gitignore && git add .gitignore
  6. deps:
  7. - path: downloaded-data/1151-commits.csv
  8. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  9. size: 346306
  10. outs:
  11. - path: data/1151-commits.csv
  12. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  13. size: 346306
  14. preprocess_herzig:
  15. cmd: cp downloaded-data/herzig.csv data && echo "data/herzig.csv" >> .gitignore
  16. && git add .gitignore
  17. deps:
  18. - path: downloaded-data/herzig.csv
  19. md5: 69a17c08643aed84b874384a2a57c7ed
  20. size: 1483281
  21. outs:
  22. - path: data/herzig.csv
  23. md5: 69a17c08643aed84b874384a2a57c7ed
  24. size: 1483281
  25. preprocess_smells-test:
  26. cmd: data-preprocessing/smells.sh
  27. deps:
  28. - path: data-preprocessing/smells.sh
  29. md5: 1792bc2011c1aba4d51cdca74beee11e
  30. size: 2148
  31. - path: downloaded-data/smells-madeyski.csv
  32. md5: 3d60d277b9fa1306c05ccfdefe22e9d1
  33. size: 7513770
  34. outs:
  35. - path: data/smells/test.csv
  36. md5: 0200db0eec17554a48a5b3a25719fd03
  37. size: 77607
  38. parse_labels:
  39. cmd: bohr porcelain parse-labels
  40. deps:
  41. - path: labels
  42. md5: f54bde6a2ca21ad1a0ba7d4ff9a5b9a5.dir
  43. size: 619
  44. nfiles: 2
  45. outs:
  46. - path: labels.py
  47. md5: 1404972881fc94fbf1039b625bd4ccc0
  48. size: 1859
  49. smells_apply_heuristics__heuristics_smells__smells-test:
  50. cmd: bohr porcelain apply-heuristics smells --heuristic-group heuristics.smells
  51. --dataset smells-test
  52. deps:
  53. - path: data/smells/test.csv
  54. md5: 0200db0eec17554a48a5b3a25719fd03
  55. size: 77607
  56. - path: heuristics/smells.py
  57. md5: b1a5ed3a14eb9eae8924b8a43e3bc452
  58. size: 712
  59. - path: labels.py
  60. md5: 1404972881fc94fbf1039b625bd4ccc0
  61. size: 1859
  62. params:
  63. bohr.json:
  64. bohr_framework_version: 0.4.10
  65. outs:
  66. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-test.pkl
  67. md5: a8924b3413a7258e4d22510885b4b886
  68. size: 4230
  69. - path: metrics/smells/heuristics.smells/heuristic_metrics_smells-test.json
  70. md5: 2a7a29682c91259100e8d087b3accb4c
  71. size: 72
  72. preprocess_smells-train:
  73. cmd: data-preprocessing/smells.sh
  74. deps:
  75. - path: data-preprocessing/smells.sh
  76. md5: 1792bc2011c1aba4d51cdca74beee11e
  77. size: 2148
  78. - path: downloaded-data/smells-madeyski.csv
  79. md5: 3d60d277b9fa1306c05ccfdefe22e9d1
  80. size: 7513770
  81. outs:
  82. - path: data/smells/train.csv
  83. md5: 7fc9a7617e6f201523fba311317ba48f
  84. size: 296970
  85. smells_apply_heuristics__heuristics_smells__smells-train:
  86. cmd: bohr porcelain apply-heuristics smells --heuristic-group heuristics.smells
  87. --dataset smells-train
  88. deps:
  89. - path: data/smells/train.csv
  90. md5: 7fc9a7617e6f201523fba311317ba48f
  91. size: 296970
  92. - path: heuristics/smells.py
  93. md5: b1a5ed3a14eb9eae8924b8a43e3bc452
  94. size: 712
  95. - path: labels.py
  96. md5: 1404972881fc94fbf1039b625bd4ccc0
  97. size: 1859
  98. params:
  99. bohr.json:
  100. bohr_framework_version: 0.4.10
  101. outs:
  102. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-train.pkl
  103. md5: aa1818fe86e1d8d999ddb4a0f7c20ad9
  104. size: 14312
  105. - path: metrics/smells/heuristics.smells/heuristic_metrics_smells-train.json
  106. md5: 576a36563e43ce643bdd861930558217
  107. size: 32
  108. smells_combine_heuristics:
  109. cmd: bohr porcelain apply-heuristics smells
  110. deps:
  111. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-test.pkl
  112. md5: a8924b3413a7258e4d22510885b4b886
  113. size: 4230
  114. - path: generated/smells/heuristics.smells/heuristic_matrix_smells-train.pkl
  115. md5: aa1818fe86e1d8d999ddb4a0f7c20ad9
  116. size: 14312
  117. params:
  118. bohr.json:
  119. bohr_framework_version: 0.4.10
  120. outs:
  121. - path: generated/smells/analysis_smells-test.csv
  122. md5: b3e54618879091f6014dbb90abffb50d
  123. size: 336
  124. - path: generated/smells/analysis_smells-train.csv
  125. md5: 276a5dd60ab0bfc2da69c0c8c1bf843a
  126. size: 250
  127. - path: generated/smells/heuristic_matrix_smells-test.pkl
  128. md5: b1fd4c196e2f967e9fca6b8f9c572272
  129. size: 4230
  130. - path: generated/smells/heuristic_matrix_smells-train.pkl
  131. md5: 2aa34f53b90128efb74215acdf7d995b
  132. size: 14312
  133. - path: metrics/smells/analysis_smells-test.json
  134. md5: 23e9b96402687da1a63c6765c925377e
  135. size: 1118
  136. - path: metrics/smells/analysis_smells-train.json
  137. md5: 88c0d6e20b8d101f5309d2f951cb6ee7
  138. size: 696
  139. - path: metrics/smells/heuristic_metrics_smells-test.json
  140. md5: 2a7a29682c91259100e8d087b3accb4c
  141. size: 72
  142. - path: metrics/smells/heuristic_metrics_smells-train.json
  143. md5: 576a36563e43ce643bdd861930558217
  144. size: 32
  145. bugginess_apply_heuristics__heuristics_bugginess__1151-commits:
  146. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  147. --dataset 1151-commits
  148. deps:
  149. - path: data/1151-commits.csv
  150. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  151. size: 346306
  152. - path: heuristics/bugginess.py
  153. md5: a55520b99cbaad572a26886b52e74652
  154. size: 8900
  155. - path: labels.py
  156. md5: 1404972881fc94fbf1039b625bd4ccc0
  157. size: 1859
  158. params:
  159. bohr.json:
  160. bohr_framework_version: 0.4.10
  161. outs:
  162. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_1151-commits.pkl
  163. md5: 732df7ec50a8165d7e9a1e79415064b5
  164. size: 2792584
  165. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_1151-commits.json
  166. md5: 590f784b4669d243ce5bff2a8d09345b
  167. size: 73
  168. preprocess_berger:
  169. cmd: cp downloaded-data/berger.csv data && echo "data/berger.csv" >> .gitignore
  170. && git add .gitignore
  171. deps:
  172. - path: downloaded-data/berger.csv
  173. md5: 126de41c9204a9e807e72406b1f9d631
  174. size: 62247
  175. outs:
  176. - path: data/berger.csv
  177. md5: 126de41c9204a9e807e72406b1f9d631
  178. size: 62247
  179. bugginess_apply_heuristics__heuristics_bugginess__berger:
  180. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  181. --dataset berger
  182. deps:
  183. - path: data/berger.csv
  184. md5: 126de41c9204a9e807e72406b1f9d631
  185. size: 62247
  186. - path: heuristics/bugginess.py
  187. md5: a55520b99cbaad572a26886b52e74652
  188. size: 8900
  189. - path: labels.py
  190. md5: 1404972881fc94fbf1039b625bd4ccc0
  191. size: 1859
  192. params:
  193. bohr.json:
  194. bohr_framework_version: 0.4.10
  195. outs:
  196. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_berger.pkl
  197. md5: e93deda462057f4a205960cf0d24d2ec
  198. size: 917768
  199. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_berger.json
  200. md5: feb313c11f1c1afb0fc58ef5ad73ab6a
  201. size: 73
  202. preprocess_bugginess-train:
  203. cmd: 7z x downloaded-data/bugginess_train.7z -odata/bugginess_train && echo "data/bugginess_train"
  204. >> .gitignore && git add .gitignore
  205. deps:
  206. - path: downloaded-data/bugginess_train.7z
  207. md5: d4dc26c2b0f0704b1559f2c0ce6320d7
  208. size: 255969433
  209. outs:
  210. - path: data/bugginess_train
  211. md5: f7cbfc7a91dfeca3aff7b7d3b6d7ea72.dir
  212. size: 2489726547
  213. nfiles: 3
  214. bugginess_apply_heuristics__heuristics_bugginess__bugginess-train:
  215. cmd: bohr apply-heuristics bugginess --heuristic-group heuristics.bugginess --dataset
  216. bugginess-train
  217. deps:
  218. - path: data/bugginess_train
  219. md5: f7cbfc7a91dfeca3aff7b7d3b6d7ea72.dir
  220. size: 2489726547
  221. nfiles: 3
  222. - path: heuristics/bugginess.py
  223. md5: 9f9ea19cd5c53bbbd41f94cf7b8f3d14
  224. size: 2873
  225. - path: heuristics/keywords
  226. md5: b4e7587c1b8e4e1461685a305d48bd66.dir
  227. size: 1382
  228. nfiles: 5
  229. - path: labels.py
  230. md5: 4ad220b4c289b2d8597bd6431c6565a6
  231. size: 1707
  232. params:
  233. bohr.json:
  234. bohr_framework_version: 0.4.2
  235. outs:
  236. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_bugginess-train.pkl
  237. md5: 5d1c71dcd36417356cabe2e340ca959d
  238. size: 500879984
  239. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_bugginess-train.json
  240. md5: 9c903723760f0000193679b361437e41
  241. size: 32
  242. bugginess_apply_heuristics__heuristics_bugginess__herzig:
  243. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  244. --dataset herzig
  245. deps:
  246. - path: data/herzig.csv
  247. md5: 69a17c08643aed84b874384a2a57c7ed
  248. size: 1483281
  249. - path: heuristics/bugginess.py
  250. md5: a55520b99cbaad572a26886b52e74652
  251. size: 8900
  252. - path: labels.py
  253. md5: 1404972881fc94fbf1039b625bd4ccc0
  254. size: 1859
  255. params:
  256. bohr.json:
  257. bohr_framework_version: 0.4.10
  258. outs:
  259. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_herzig.pkl
  260. md5: 15bf08d69bf793c3978a2ffe1458c76d
  261. size: 12608792
  262. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_herzig.json
  263. md5: 525291ad999e1bee97789045c1ae8333
  264. size: 72
  265. bugginess_combine_heuristics:
  266. cmd: bohr porcelain apply-heuristics bugginess
  267. deps:
  268. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_1151-commits.pkl
  269. md5: 732df7ec50a8165d7e9a1e79415064b5
  270. size: 2792584
  271. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_200k-commits.pkl
  272. md5: fa93c70bb58f111b85db389bd0c74ad1
  273. size: 498669340
  274. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_berger.pkl
  275. md5: e93deda462057f4a205960cf0d24d2ec
  276. size: 917768
  277. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_developer-labeled-commits.pkl
  278. md5: fbbed1142254d6cd9b922290565a6313
  279. size: 2348040
  280. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  281. md5: b9c7d542ac5cc3e3aa9a2114d794a0f8
  282. size: 3708248
  283. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_herzig.pkl
  284. md5: 15bf08d69bf793c3978a2ffe1458c76d
  285. size: 12608792
  286. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_1151-commits.pkl
  287. md5: 6586d8bf9010aff3be65327facde2edc
  288. size: 9975
  289. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_200k-commits.pkl
  290. md5: 7febb1ec7e27a0b380e83a4c732946ab
  291. size: 1651964
  292. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_berger.pkl
  293. md5: e93c0bbc779f1cf928251c4613ef5cc6
  294. size: 3767
  295. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_developer-labeled-commits.pkl
  296. md5: ff48bc7a29abb4802a76e4d4882edd86
  297. size: 8503
  298. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_fine-grained-refactorings.pkl
  299. md5: e1466fb17a3ce93547649ebd5d6e2210
  300. size: 13007
  301. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_herzig.pkl
  302. md5: 8e22aefcc1b79b3bc8c00ebd4503dca3
  303. size: 42479
  304. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_1151-commits.pkl
  305. md5: 7ce2e0846705b59c985ee6fdfec54688
  306. size: 9964
  307. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_200k-commits.pkl
  308. md5: 6d11a8ffe074e1b1cad1f68282b7a821
  309. size: 1651953
  310. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_berger.pkl
  311. md5: e6271ef87fe70277dc4075960da80517
  312. size: 3756
  313. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  314. md5: d6cf7bd8b27954b36508d2c7c6f82e77
  315. size: 8492
  316. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  317. md5: b39471a6430f306caac12ef016e44e2f
  318. size: 12996
  319. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_herzig.pkl
  320. md5: 6be4da5cf3cc5c31485cd7be9454a502
  321. size: 42468
  322. params:
  323. bohr.json:
  324. bohr_framework_version: 0.4.10
  325. outs:
  326. - path: generated/bugginess/analysis_1151-commits.csv
  327. md5: 8ee4791afe967f2f28db8e0e67b8b967
  328. size: 24412
  329. - path: generated/bugginess/analysis_200k-commits.csv
  330. md5: b0e3db0d766038c1cb7cace4d8ee05b4
  331. size: 30511
  332. - path: generated/bugginess/analysis_berger.csv
  333. md5: 86d62319ebef9e88e8bfd3821fc5f474
  334. size: 21556
  335. - path: generated/bugginess/analysis_developer-labeled-commits.csv
  336. md5: 94eaa2979edd2f944d49f0eb6fa24955
  337. size: 22611
  338. - path: generated/bugginess/analysis_fine-grained-refactorings.csv
  339. md5: 11da2d246f2b90317a7a4e8a6dfd1fb2
  340. size: 21134
  341. - path: generated/bugginess/analysis_herzig.csv
  342. md5: 1382ca926c2a28dfea7f7784ace51a5f
  343. size: 25928
  344. - path: generated/bugginess/heuristic_matrix_1151-commits.pkl
  345. md5: 621742a07acb5b672e2e800d0f6c98c0
  346. size: 2811348
  347. - path: generated/bugginess/heuristic_matrix_200k-commits.pkl
  348. md5: 3a986cbe681dfc500c2947272a4162fa
  349. size: 501972078
  350. - path: generated/bugginess/heuristic_matrix_berger.pkl
  351. md5: ea85cfe60bdb18ef08697d3c379dad23
  352. size: 924116
  353. - path: generated/bugginess/heuristic_matrix_developer-labeled-commits.pkl
  354. md5: b71eb34d2f0264c9d9cfc373727f1d31
  355. size: 2363860
  356. - path: generated/bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  357. md5: 675213aedbce431d370ce3e4568edb6b
  358. size: 3733076
  359. - path: generated/bugginess/heuristic_matrix_herzig.pkl
  360. md5: 1cad55ab5b748d4cdbd78967b865e5f4
  361. size: 12692573
  362. - path: metrics/bugginess/analysis_1151-commits.json
  363. md5: eb815cb21a83ac06f7bcd4f301631c61
  364. size: 108882
  365. - path: metrics/bugginess/analysis_200k-commits.json
  366. md5: 7f65be3984bdc43ab235ec2f7e27fa16
  367. size: 79011
  368. - path: metrics/bugginess/analysis_berger.json
  369. md5: 0d4220357cdbb454ac7eb476771f0b1f
  370. size: 105410
  371. - path: metrics/bugginess/analysis_developer-labeled-commits.json
  372. md5: bbc26523d0a4b791c72505ccd9211cd7
  373. size: 106487
  374. - path: metrics/bugginess/analysis_fine-grained-refactorings.json
  375. md5: dbb4be5a9d8a800cc4dc444f49960d05
  376. size: 66082
  377. - path: metrics/bugginess/analysis_herzig.json
  378. md5: 9d01e6f21e752973860df5e02feff387
  379. size: 110640
  380. - path: metrics/bugginess/heuristic_metrics_1151-commits.json
  381. md5: 09174a954ac3672c5c1993baaca0225a
  382. size: 72
  383. - path: metrics/bugginess/heuristic_metrics_200k-commits.json
  384. md5: 637a087041dadcb73db461d355a37401
  385. size: 32
  386. - path: metrics/bugginess/heuristic_metrics_berger.json
  387. md5: 5024c5ebf8dfc6d7e43d06453e132f3a
  388. size: 73
  389. - path: metrics/bugginess/heuristic_metrics_developer-labeled-commits.json
  390. md5: 02d126b04812b490e556da550b615860
  391. size: 72
  392. - path: metrics/bugginess/heuristic_metrics_fine-grained-refactorings.json
  393. md5: 668030d87b617448b3d76eefa599ce15
  394. size: 32
  395. - path: metrics/bugginess/heuristic_metrics_herzig.json
  396. md5: 788a616725c217b3c2b2c853453e9ab9
  397. size: 73
  398. bugginess_train_label_model:
  399. cmd: bohr porcelain train-label-model bugginess 200k-commits
  400. deps:
  401. - path: data/1151-commits.csv
  402. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  403. size: 346306
  404. - path: data/berger.csv
  405. md5: 126de41c9204a9e807e72406b1f9d631
  406. size: 62247
  407. - path: data/developer-labeled.csv
  408. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  409. size: 121817
  410. - path: data/herzig.csv
  411. md5: 69a17c08643aed84b874384a2a57c7ed
  412. size: 1483281
  413. - path: generated/bugginess/heuristic_matrix_1151-commits.pkl
  414. md5: 621742a07acb5b672e2e800d0f6c98c0
  415. size: 2811348
  416. - path: generated/bugginess/heuristic_matrix_200k-commits.pkl
  417. md5: 3a986cbe681dfc500c2947272a4162fa
  418. size: 501972078
  419. - path: generated/bugginess/heuristic_matrix_berger.pkl
  420. md5: ea85cfe60bdb18ef08697d3c379dad23
  421. size: 924116
  422. - path: generated/bugginess/heuristic_matrix_developer-labeled-commits.pkl
  423. md5: b71eb34d2f0264c9d9cfc373727f1d31
  424. size: 2363860
  425. - path: generated/bugginess/heuristic_matrix_herzig.pkl
  426. md5: 1cad55ab5b748d4cdbd78967b865e5f4
  427. size: 12692573
  428. params:
  429. bohr.json:
  430. bohr_framework_version: 0.4.10
  431. outs:
  432. - path: generated/bugginess/label_model.pkl
  433. md5: d181a602cf14da0796fd6fa99aec6e06
  434. size: 1887708
  435. - path: generated/bugginess/label_model_weights.csv
  436. md5: 759620bb5537c183d19d8cd1841a356b
  437. size: 20564
  438. - path: metrics/bugginess/label_model_metrics.json
  439. md5: dd00d4921a0a16b6d10233bfd6d301d6
  440. size: 643
  441. bugginess_label_dataset_herzig:
  442. cmd: bohr porcelain label-dataset bugginess herzig
  443. deps:
  444. - path: data/herzig.csv
  445. md5: 69a17c08643aed84b874384a2a57c7ed
  446. size: 1483281
  447. - path: generated/bugginess/heuristic_matrix_herzig.pkl
  448. md5: 1cad55ab5b748d4cdbd78967b865e5f4
  449. size: 12692573
  450. - path: generated/bugginess/label_model.pkl
  451. md5: d181a602cf14da0796fd6fa99aec6e06
  452. size: 1887708
  453. params:
  454. bohr.json:
  455. bohr_framework_version: 0.4.10
  456. outs:
  457. - path: labeled-datasets/bugginess/herzig.labeled.csv
  458. md5: c23b134060d6e6659bfaac22ad691a90
  459. size: 1540065
  460. smells_train_label_model:
  461. cmd: bohr porcelain train-label-model smells smells-train
  462. deps:
  463. - path: data/smells/test.csv
  464. md5: 0200db0eec17554a48a5b3a25719fd03
  465. size: 77607
  466. - path: generated/smells/heuristic_matrix_smells-test.pkl
  467. md5: b1fd4c196e2f967e9fca6b8f9c572272
  468. size: 4230
  469. - path: generated/smells/heuristic_matrix_smells-train.pkl
  470. md5: 2aa34f53b90128efb74215acdf7d995b
  471. size: 14312
  472. params:
  473. bohr.json:
  474. bohr_framework_version: 0.4.10
  475. outs:
  476. - path: generated/smells/label_model.pkl
  477. md5: 930d6534b7c29c17d248df2eeedf140f
  478. size: 4874
  479. - path: generated/smells/label_model_weights.csv
  480. md5: c1f7fbb15c07cadcf0a4e050f51fa89e
  481. size: 179
  482. - path: metrics/smells/label_model_metrics.json
  483. md5: 2040bf8b04f316ce72cfff68a4423b35
  484. size: 156
  485. smells_label_dataset_smells-train:
  486. cmd: bohr porcelain label-dataset smells smells-train
  487. deps:
  488. - path: data/smells/train.csv
  489. md5: 7fc9a7617e6f201523fba311317ba48f
  490. size: 296970
  491. - path: generated/smells/heuristic_matrix_smells-train.pkl
  492. md5: 2aa34f53b90128efb74215acdf7d995b
  493. size: 14312
  494. - path: generated/smells/label_model.pkl
  495. md5: 930d6534b7c29c17d248df2eeedf140f
  496. size: 4874
  497. params:
  498. bohr.json:
  499. bohr_framework_version: 0.4.10
  500. outs:
  501. - path: labeled-datasets/smells/smells-train.labeled.csv
  502. md5: b3433438369f2ca2276c22cff309631e
  503. size: 296121
  504. smells_label_dataset_smells-test:
  505. cmd: bohr porcelain label-dataset smells smells-test
  506. deps:
  507. - path: data/smells/test.csv
  508. md5: 0200db0eec17554a48a5b3a25719fd03
  509. size: 77607
  510. - path: generated/smells/heuristic_matrix_smells-test.pkl
  511. md5: b1fd4c196e2f967e9fca6b8f9c572272
  512. size: 4230
  513. - path: generated/smells/label_model.pkl
  514. md5: 930d6534b7c29c17d248df2eeedf140f
  515. size: 4874
  516. params:
  517. bohr.json:
  518. bohr_framework_version: 0.4.10
  519. outs:
  520. - path: labeled-datasets/smells/smells-test.labeled.csv
  521. md5: fe4a97ad13be96db8f076fda178bf984
  522. size: 77279
  523. bugginess_label_dataset_bugginess-train:
  524. cmd: bohr label-dataset bugginess bugginess-train
  525. deps:
  526. - path: data/bugginess_train
  527. md5: f7cbfc7a91dfeca3aff7b7d3b6d7ea72.dir
  528. size: 2489726547
  529. nfiles: 3
  530. - path: generated/bugginess/heuristic_matrix_bugginess-train.pkl
  531. md5: d9141b7bf8b3eb25cf3e90490acbb812
  532. size: 500879984
  533. - path: generated/bugginess/label_model.pkl
  534. md5: ce78684652e122b347fe0c7fc32ba035
  535. size: 1863238
  536. params:
  537. bohr.json:
  538. bohr_framework_version: 0.4.2
  539. outs:
  540. - path: labeled-datasets/bugginess-train.labeled.csv
  541. md5: bfe4ac306c08f7188e094acc20e1ff03
  542. size: 61623779
  543. bugginess_label_dataset_1151-commits:
  544. cmd: bohr porcelain label-dataset bugginess 1151-commits
  545. deps:
  546. - path: data/1151-commits.csv
  547. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  548. size: 346306
  549. - path: generated/bugginess/heuristic_matrix_1151-commits.pkl
  550. md5: 621742a07acb5b672e2e800d0f6c98c0
  551. size: 2811348
  552. - path: generated/bugginess/label_model.pkl
  553. md5: d181a602cf14da0796fd6fa99aec6e06
  554. size: 1887708
  555. params:
  556. bohr.json:
  557. bohr_framework_version: 0.4.10
  558. outs:
  559. - path: labeled-datasets/bugginess/1151-commits.labeled.csv
  560. md5: 7e0e9835e401bfa84c11ad0d7051d497
  561. size: 359699
  562. bugginess_label_dataset_berger:
  563. cmd: bohr porcelain label-dataset bugginess berger
  564. deps:
  565. - path: data/berger.csv
  566. md5: 126de41c9204a9e807e72406b1f9d631
  567. size: 62247
  568. - path: generated/bugginess/heuristic_matrix_berger.pkl
  569. md5: ea85cfe60bdb18ef08697d3c379dad23
  570. size: 924116
  571. - path: generated/bugginess/label_model.pkl
  572. md5: d181a602cf14da0796fd6fa99aec6e06
  573. size: 1887708
  574. params:
  575. bohr.json:
  576. bohr_framework_version: 0.4.10
  577. outs:
  578. - path: labeled-datasets/bugginess/berger.labeled.csv
  579. md5: 498ebdd9c39cb01c351c5805f2b448e1
  580. size: 66772
  581. bugginess_transformer_train:
  582. cmd: bash classifiers/bugginess-transformer/train.sh labeled-data/bugginess.csv
  583. deps:
  584. - path: classifiers/bugginess-transformer/run.py
  585. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  586. size: 12023
  587. - path: classifiers/bugginess-transformer/train.sh
  588. md5: 3a19e011c049042bbec7e8315e883c38
  589. size: 557
  590. - path: labeled-datasets/bugginess-train.labeled.csv
  591. md5: bfe4ac306c08f7188e094acc20e1ff03
  592. size: 61623779
  593. - path: requirements.txt
  594. md5: 29b4c5d66c523cec0712dbcdcced42bb
  595. size: 21
  596. outs:
  597. - path: models/config.json
  598. md5: 3effd3229ade2ed52eeb90d252790bf5
  599. size: 716
  600. - path: models/merges.txt
  601. md5: fb9c1e34b6999f3a062df6ed4a604957
  602. size: 458459
  603. - path: models/pytorch_model.bin
  604. md5: 40379a0207d19e2a24e116e941d7d675
  605. size: 333858922
  606. - path: models/special_tokens_map.json
  607. md5: 17bb9e090d1d3a775683aba3ba610591
  608. size: 239
  609. - path: models/tokenizer_config.json
  610. md5: e1a3e947aa301aadc524ee29f0dbcc39
  611. size: 1257
  612. - path: models/training_args.bin
  613. md5: 408c3f12467908cabb77ded5ce3490ed
  614. size: 2159
  615. - path: models/vocab.json
  616. md5: ca70df26ed267d27a9edde9c5341f17b
  617. size: 813062
  618. bugginess_transformer_test_herzig:
  619. cmd: bash classifiers/bugginess-transformer/test.sh data/bugginess/herzig.csv
  620. metrics/bugginess/transformer/herzig
  621. deps:
  622. - path: classifiers/bugginess-transformer/run.py
  623. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  624. size: 12023
  625. - path: classifiers/bugginess-transformer/test.sh
  626. md5: 92c4020b4c026f9b85fd38ddee1bd528
  627. size: 313
  628. - path: data/herzig.csv
  629. md5: 279936268f488e1e613f81a537f29055
  630. size: 1458311
  631. - path: models/config.json
  632. md5: 3effd3229ade2ed52eeb90d252790bf5
  633. size: 716
  634. - path: models/merges.txt
  635. md5: fb9c1e34b6999f3a062df6ed4a604957
  636. size: 458459
  637. - path: models/pytorch_model.bin
  638. md5: 40379a0207d19e2a24e116e941d7d675
  639. size: 333858922
  640. - path: models/special_tokens_map.json
  641. md5: 17bb9e090d1d3a775683aba3ba610591
  642. size: 239
  643. - path: models/tokenizer_config.json
  644. md5: e1a3e947aa301aadc524ee29f0dbcc39
  645. size: 1257
  646. - path: models/training_args.bin
  647. md5: 408c3f12467908cabb77ded5ce3490ed
  648. size: 2159
  649. - path: models/vocab.json
  650. md5: ca70df26ed267d27a9edde9c5341f17b
  651. size: 813062
  652. - path: requirements.txt
  653. md5: 29b4c5d66c523cec0712dbcdcced42bb
  654. size: 21
  655. outs:
  656. - path: metrics/bugginess/transformer/herzig/eval_results.txt
  657. md5: bd726240700dac6e926d5532eb76c5c4
  658. size: 145
  659. bugginess_transformer_test_1151-commits:
  660. cmd: bash classifiers/bugginess-transformer/test.sh data/bugginess/1151-commits.csv
  661. metrics/bugginess/transformer/1151-commits
  662. deps:
  663. - path: classifiers/bugginess-transformer/run.py
  664. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  665. size: 12023
  666. - path: classifiers/bugginess-transformer/test.sh
  667. md5: 92c4020b4c026f9b85fd38ddee1bd528
  668. size: 313
  669. - path: data/1151-commits.csv
  670. md5: 7b32f404edf5982eb4c5f51b956663c4
  671. size: 341651
  672. - path: models/config.json
  673. md5: 3effd3229ade2ed52eeb90d252790bf5
  674. size: 716
  675. - path: models/merges.txt
  676. md5: fb9c1e34b6999f3a062df6ed4a604957
  677. size: 458459
  678. - path: models/pytorch_model.bin
  679. md5: 40379a0207d19e2a24e116e941d7d675
  680. size: 333858922
  681. - path: models/special_tokens_map.json
  682. md5: 17bb9e090d1d3a775683aba3ba610591
  683. size: 239
  684. - path: models/tokenizer_config.json
  685. md5: e1a3e947aa301aadc524ee29f0dbcc39
  686. size: 1257
  687. - path: models/training_args.bin
  688. md5: 408c3f12467908cabb77ded5ce3490ed
  689. size: 2159
  690. - path: models/vocab.json
  691. md5: ca70df26ed267d27a9edde9c5341f17b
  692. size: 813062
  693. - path: requirements.txt
  694. md5: 29b4c5d66c523cec0712dbcdcced42bb
  695. size: 21
  696. outs:
  697. - path: metrics/bugginess/transformer/1151-commits/eval_results.txt
  698. md5: 52b1b36d2896195e78ca5b7d42de4839
  699. size: 146
  700. bugginess_transformer_label_1151-commits:
  701. cmd: bash classifiers/bugginess-transformer/label.sh data/bugginess/1151-commits.csv
  702. metrics/bugginess/transformer/1151-commits
  703. deps:
  704. - path: classifiers/bugginess-transformer/label.sh
  705. md5: ce2646a4233e68991b57bbf2c7404ace
  706. size: 320
  707. - path: classifiers/bugginess-transformer/run.py
  708. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  709. size: 12023
  710. - path: data/1151-commits.csv
  711. md5: 7b32f404edf5982eb4c5f51b956663c4
  712. size: 341651
  713. - path: models/config.json
  714. md5: 3effd3229ade2ed52eeb90d252790bf5
  715. size: 716
  716. - path: models/merges.txt
  717. md5: fb9c1e34b6999f3a062df6ed4a604957
  718. size: 458459
  719. - path: models/pytorch_model.bin
  720. md5: 40379a0207d19e2a24e116e941d7d675
  721. size: 333858922
  722. - path: models/special_tokens_map.json
  723. md5: 17bb9e090d1d3a775683aba3ba610591
  724. size: 239
  725. - path: models/tokenizer_config.json
  726. md5: e1a3e947aa301aadc524ee29f0dbcc39
  727. size: 1257
  728. - path: models/training_args.bin
  729. md5: 408c3f12467908cabb77ded5ce3490ed
  730. size: 2159
  731. - path: models/vocab.json
  732. md5: ca70df26ed267d27a9edde9c5341f17b
  733. size: 813062
  734. - path: requirements.txt
  735. md5: 29b4c5d66c523cec0712dbcdcced42bb
  736. size: 21
  737. outs:
  738. - path: metrics/bugginess/transformer/1151-commits/assigned_labels.csv
  739. md5: 527025d5fa114a28fa55eed7f4c10801
  740. size: 6964
  741. bugginess_transformer_test_berger:
  742. cmd: bash classifiers/bugginess-transformer/test.sh data/bugginess/berger.csv
  743. metrics/bugginess/transformer/berger
  744. deps:
  745. - path: classifiers/bugginess-transformer/run.py
  746. md5: faf5ebb8f0348b28aa1205e2c56cd41c
  747. size: 12023
  748. - path: classifiers/bugginess-transformer/test.sh
  749. md5: 92c4020b4c026f9b85fd38ddee1bd528
  750. size: 313
  751. - path: data/berger.csv
  752. md5: 71b9738db6cb47e3af599da316e3b570
  753. size: 60847
  754. - path: models/config.json
  755. md5: 3effd3229ade2ed52eeb90d252790bf5
  756. size: 716
  757. - path: models/merges.txt
  758. md5: fb9c1e34b6999f3a062df6ed4a604957
  759. size: 458459
  760. - path: models/pytorch_model.bin
  761. md5: 40379a0207d19e2a24e116e941d7d675
  762. size: 333858922
  763. - path: models/special_tokens_map.json
  764. md5: 17bb9e090d1d3a775683aba3ba610591
  765. size: 239
  766. - path: models/tokenizer_config.json
  767. md5: e1a3e947aa301aadc524ee29f0dbcc39
  768. size: 1257
  769. - path: models/training_args.bin
  770. md5: 408c3f12467908cabb77ded5ce3490ed
  771. size: 2159
  772. - path: models/vocab.json
  773. md5: ca70df26ed267d27a9edde9c5341f17b
  774. size: 813062
  775. - path: requirements.txt
  776. md5: 29b4c5d66c523cec0712dbcdcced42bb
  777. size: 21
  778. outs:
  779. - path: metrics/bugginess/transformer/berger/eval_results.txt
  780. md5: 30da05826f977ee9b867560731091915
  781. size: 144
  782. bugginess_combine_labels_1151-commits:
  783. cmd: python classifiers/bugginess-transformer/combine_labels.py labeled-datasets/1151-commits.labeled.csv
  784. metrics/bugginess/transformer/1151-commits/assigned_labels.csv labeled-datasets/1151-commits.labeled.both.csv
  785. && echo "labeled-datasets/1151-commits.labeled.both.csv" >> .gitignore
  786. deps:
  787. - path: classifiers/bugginess-transformer/combine_labels.py
  788. md5: 85cef7e65682e381b5e746d5a0901ec2
  789. size: 720
  790. - path: labeled-datasets/1151-commits.labeled.csv
  791. md5: 70250f3b3489aed05065c35a0b859c00
  792. size: 359755
  793. - path: metrics/bugginess/transformer/1151-commits/assigned_labels.csv
  794. md5: 527025d5fa114a28fa55eed7f4c10801
  795. size: 6964
  796. outs:
  797. - path: labeled-datasets/1151-commits.labeled.both.csv
  798. md5: f4d459e7b167fb0197dc49483eb2d2af
  799. size: 366721
  800. preprocess_200k-commits:
  801. cmd: cp downloaded-data/200k-commits.csv data && echo "data/200k-commits.csv"
  802. >> .gitignore && git add .gitignore
  803. deps:
  804. - path: downloaded-data/200k-commits.csv
  805. md5: 6ce10284e630c44110ffc483a7bb33df
  806. size: 71402002
  807. outs:
  808. - path: data/200k-commits.csv
  809. md5: 6ce10284e630c44110ffc483a7bb33df
  810. size: 71402002
  811. preprocess_200k-commits-issues:
  812. cmd: cp downloaded-data/200k-commits-issues.csv data && echo "data/200k-commits-issues.csv"
  813. >> .gitignore && git add .gitignore
  814. deps:
  815. - path: downloaded-data/200k-commits-issues.csv
  816. md5: da4b0d654f7ce1469857b9171a9647aa
  817. size: 96908075
  818. outs:
  819. - path: data/200k-commits-issues.csv
  820. md5: da4b0d654f7ce1469857b9171a9647aa
  821. size: 96908075
  822. preprocess_200k-commits-files:
  823. cmd: 7z x downloaded-data/200k-commits-files.csv.7z -odata && echo "data/200k-commits-files.csv"
  824. >> .gitignore && git add .gitignore
  825. deps:
  826. - path: downloaded-data/200k-commits-files.csv.7z
  827. md5: 56697c21cfd7bba5d0f68dcd0fbd86f0
  828. size: 240190210
  829. outs:
  830. - path: data/200k-commits-files.csv
  831. md5: bc989c140c305bed62a5a8b161883d3b
  832. size: 2284439219
  833. bugginess_apply_heuristics__heuristics_bugginess__200k-commits:
  834. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  835. --dataset 200k-commits
  836. deps:
  837. - path: data/200k-commits-files.csv
  838. md5: bc989c140c305bed62a5a8b161883d3b
  839. size: 2284439219
  840. - path: data/200k-commits-issues.csv
  841. md5: da4b0d654f7ce1469857b9171a9647aa
  842. size: 96908075
  843. - path: data/200k-commits-manual-labels.csv
  844. md5: 447bf23d38df7f7e3007dc35f70cab91
  845. size: 1187
  846. - path: data/200k-commits.csv
  847. md5: 6ce10284e630c44110ffc483a7bb33df
  848. size: 71402002
  849. - path: heuristics/bugginess.py
  850. md5: a55520b99cbaad572a26886b52e74652
  851. size: 8900
  852. - path: labels.py
  853. md5: 1404972881fc94fbf1039b625bd4ccc0
  854. size: 1859
  855. params:
  856. bohr.json:
  857. bohr_framework_version: 0.4.10
  858. outs:
  859. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_200k-commits.pkl
  860. md5: fa93c70bb58f111b85db389bd0c74ad1
  861. size: 498669340
  862. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_200k-commits.json
  863. md5: 958e51f4d2c35451cf7575eaea15e7a6
  864. size: 32
  865. bugginess_label_dataset_200k-commits:
  866. cmd: bohr porcelain label-dataset bugginess 200k-commits
  867. deps:
  868. - path: data/200k-commits.csv
  869. md5: 6ce10284e630c44110ffc483a7bb33df
  870. size: 71402002
  871. - path: generated/bugginess/heuristic_matrix_200k-commits.pkl
  872. md5: 3a986cbe681dfc500c2947272a4162fa
  873. size: 501972078
  874. - path: generated/bugginess/label_model.pkl
  875. md5: d181a602cf14da0796fd6fa99aec6e06
  876. size: 1887708
  877. params:
  878. bohr.json:
  879. bohr_framework_version: 0.4.10
  880. outs:
  881. - path: labeled-datasets/bugginess/200k-commits.labeled.csv
  882. md5: afca892c48bd2b677ce4317ffdeabad9
  883. size: 73227493
  884. preprocess_200k-commits-link-issues:
  885. cmd: cp downloaded-data/200k-commits-link-issues.csv data && echo "data/200k-commits-link-issues.csv"
  886. >> .gitignore && git add .gitignore
  887. deps:
  888. - path: downloaded-data/200k-commits-link-issues.csv
  889. md5: f75c8b5c7747abc8c2bd1b3b847dac18
  890. size: 3005661
  891. outs:
  892. - path: data/200k-commits-link-issues.csv
  893. md5: f75c8b5c7747abc8c2bd1b3b847dac18
  894. size: 3005661
  895. preprocess_200k-commits-manual-labels:
  896. cmd: cp downloaded-data/200k-commits-manual-labels.csv data && echo "data/200k-commits-manual-labels.csv"
  897. >> .gitignore && git add .gitignore
  898. deps:
  899. - path: downloaded-data/200k-commits-manual-labels.csv
  900. md5: 447bf23d38df7f7e3007dc35f70cab91
  901. size: 1187
  902. outs:
  903. - path: data/200k-commits-manual-labels.csv
  904. md5: 447bf23d38df7f7e3007dc35f70cab91
  905. size: 1187
  906. bugginess_apply_heuristics__heuristics_manuallabels__herzig:
  907. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  908. --dataset herzig
  909. deps:
  910. - path: data/herzig.csv
  911. md5: 69a17c08643aed84b874384a2a57c7ed
  912. size: 1483281
  913. - path: heuristics/manuallabels.py
  914. md5: f338b2a285d76da97b3f53e9b167368a
  915. size: 278
  916. - path: labels.py
  917. md5: 1404972881fc94fbf1039b625bd4ccc0
  918. size: 1859
  919. params:
  920. bohr.json:
  921. bohr_framework_version: 0.4.10
  922. outs:
  923. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_herzig.pkl
  924. md5: 8e22aefcc1b79b3bc8c00ebd4503dca3
  925. size: 42479
  926. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_herzig.json
  927. md5: 6881c30e66d12aec85d162df31e5e04d
  928. size: 58
  929. bugginess_apply_heuristics__heuristics_manuallabels__1151-commits:
  930. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  931. --dataset 1151-commits
  932. deps:
  933. - path: data/1151-commits.csv
  934. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  935. size: 346306
  936. - path: heuristics/manuallabels.py
  937. md5: f338b2a285d76da97b3f53e9b167368a
  938. size: 278
  939. - path: labels.py
  940. md5: 1404972881fc94fbf1039b625bd4ccc0
  941. size: 1859
  942. params:
  943. bohr.json:
  944. bohr_framework_version: 0.4.10
  945. outs:
  946. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_1151-commits.pkl
  947. md5: 6586d8bf9010aff3be65327facde2edc
  948. size: 9975
  949. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_1151-commits.json
  950. md5: 452fdb0e2c252999419be5771a3774cc
  951. size: 58
  952. bugginess_apply_heuristics__heuristics_manuallabels__200k-commits:
  953. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  954. --dataset 200k-commits
  955. deps:
  956. - path: data/200k-commits-files.csv
  957. md5: bc989c140c305bed62a5a8b161883d3b
  958. size: 2284439219
  959. - path: data/200k-commits-issues.csv
  960. md5: da4b0d654f7ce1469857b9171a9647aa
  961. size: 96908075
  962. - path: data/200k-commits-manual-labels.csv
  963. md5: 447bf23d38df7f7e3007dc35f70cab91
  964. size: 1187
  965. - path: data/200k-commits.csv
  966. md5: 6ce10284e630c44110ffc483a7bb33df
  967. size: 71402002
  968. - path: heuristics/manuallabels.py
  969. md5: f338b2a285d76da97b3f53e9b167368a
  970. size: 278
  971. - path: labels.py
  972. md5: 1404972881fc94fbf1039b625bd4ccc0
  973. size: 1859
  974. params:
  975. bohr.json:
  976. bohr_framework_version: 0.4.10
  977. outs:
  978. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_200k-commits.pkl
  979. md5: 7febb1ec7e27a0b380e83a4c732946ab
  980. size: 1651964
  981. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_200k-commits.json
  982. md5: b550e0fca5c368f3221fc11db0ba8a3e
  983. size: 36
  984. bugginess_apply_heuristics__heuristics_manuallabels__berger:
  985. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  986. --dataset berger
  987. deps:
  988. - path: data/berger.csv
  989. md5: 126de41c9204a9e807e72406b1f9d631
  990. size: 62247
  991. - path: heuristics/manuallabels.py
  992. md5: f338b2a285d76da97b3f53e9b167368a
  993. size: 278
  994. - path: labels.py
  995. md5: 1404972881fc94fbf1039b625bd4ccc0
  996. size: 1859
  997. params:
  998. bohr.json:
  999. bohr_framework_version: 0.4.10
  1000. outs:
  1001. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_berger.pkl
  1002. md5: e93c0bbc779f1cf928251c4613ef5cc6
  1003. size: 3767
  1004. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_berger.json
  1005. md5: c2fefb5ddd23aee9e2705356b8d131c1
  1006. size: 59
  1007. preprocess_developer-labeled-commits:
  1008. cmd: data-preprocessing/developer_labeled.py
  1009. deps:
  1010. - path: data-preprocessing/developer_labeled.py
  1011. md5: d4c743a2b7723181d0284ea959fcbb99
  1012. size: 1488
  1013. - path: downloaded-data/developer-labeled-commits.zip
  1014. md5: 691ca4dc06f945d2d3019e20ceb5cd5c
  1015. size: 898671
  1016. outs:
  1017. - path: data/developer-labeled.csv
  1018. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1019. size: 121817
  1020. bugginess_apply_heuristics__heuristics_bugginess__developer-labeled-commits:
  1021. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  1022. --dataset developer-labeled-commits
  1023. deps:
  1024. - path: data/developer-labeled.csv
  1025. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1026. size: 121817
  1027. - path: heuristics/bugginess.py
  1028. md5: a55520b99cbaad572a26886b52e74652
  1029. size: 8900
  1030. - path: labels.py
  1031. md5: 1404972881fc94fbf1039b625bd4ccc0
  1032. size: 1859
  1033. params:
  1034. bohr.json:
  1035. bohr_framework_version: 0.4.10
  1036. outs:
  1037. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1038. md5: fbbed1142254d6cd9b922290565a6313
  1039. size: 2348040
  1040. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_developer-labeled-commits.json
  1041. md5: 936006aa024d80022004b24caf729c4d
  1042. size: 73
  1043. preprocess_fine-grained-refactorings:
  1044. cmd: data-preprocessing/fine-grained-refactorings.py
  1045. deps:
  1046. - path: data-preprocessing/fine-grained-refactorings.py
  1047. md5: 4e32da7ede9b4d34c07d3e54d2672f21
  1048. size: 962
  1049. - path: downloaded-data/fine-grained-refactorings.zip
  1050. md5: 15d8fedcdef6f7f75df5c687c78cd791
  1051. size: 119657
  1052. outs:
  1053. - path: data/fine-grained-refactorings.csv
  1054. md5: 4b2fed41042a5ceb2e95738f35650beb
  1055. size: 358328
  1056. bugginess_apply_heuristics__heuristics_bugginess__fine-grained-refactorings:
  1057. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.bugginess
  1058. --dataset fine-grained-refactorings
  1059. deps:
  1060. - path: data/fine-grained-refactorings.csv
  1061. md5: 4b2fed41042a5ceb2e95738f35650beb
  1062. size: 358328
  1063. - path: heuristics/bugginess.py
  1064. md5: a55520b99cbaad572a26886b52e74652
  1065. size: 8900
  1066. - path: labels.py
  1067. md5: 1404972881fc94fbf1039b625bd4ccc0
  1068. size: 1859
  1069. params:
  1070. bohr.json:
  1071. bohr_framework_version: 0.4.10
  1072. outs:
  1073. - path: generated/bugginess/heuristics.bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1074. md5: b9c7d542ac5cc3e3aa9a2114d794a0f8
  1075. size: 3708248
  1076. - path: metrics/bugginess/heuristics.bugginess/heuristic_metrics_fine-grained-refactorings.json
  1077. md5: 164d888c48bfb076fa97782a2aa703a8
  1078. size: 31
  1079. bugginess_apply_heuristics__heuristics_manuallabels__developer-labeled-commits:
  1080. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  1081. --dataset developer-labeled-commits
  1082. deps:
  1083. - path: data/developer-labeled.csv
  1084. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1085. size: 121817
  1086. - path: heuristics/manuallabels.py
  1087. md5: f338b2a285d76da97b3f53e9b167368a
  1088. size: 278
  1089. - path: labels.py
  1090. md5: 1404972881fc94fbf1039b625bd4ccc0
  1091. size: 1859
  1092. params:
  1093. bohr.json:
  1094. bohr_framework_version: 0.4.10
  1095. outs:
  1096. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_developer-labeled-commits.pkl
  1097. md5: ff48bc7a29abb4802a76e4d4882edd86
  1098. size: 8503
  1099. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_developer-labeled-commits.json
  1100. md5: 6736afb27f1813d39b8769670a0d29c7
  1101. size: 58
  1102. bugginess_apply_heuristics__heuristics_manuallabels__fine-grained-refactorings:
  1103. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.manuallabels
  1104. --dataset fine-grained-refactorings
  1105. deps:
  1106. - path: data/fine-grained-refactorings.csv
  1107. md5: 4b2fed41042a5ceb2e95738f35650beb
  1108. size: 358328
  1109. - path: heuristics/manuallabels.py
  1110. md5: f338b2a285d76da97b3f53e9b167368a
  1111. size: 278
  1112. - path: labels.py
  1113. md5: 1404972881fc94fbf1039b625bd4ccc0
  1114. size: 1859
  1115. params:
  1116. bohr.json:
  1117. bohr_framework_version: 0.4.10
  1118. outs:
  1119. - path: generated/bugginess/heuristics.manuallabels/heuristic_matrix_fine-grained-refactorings.pkl
  1120. md5: e1466fb17a3ce93547649ebd5d6e2210
  1121. size: 13007
  1122. - path: metrics/bugginess/heuristics.manuallabels/heuristic_metrics_fine-grained-refactorings.json
  1123. md5: 90747343662116155b09e2920b157b6c
  1124. size: 17
  1125. bugginess_label_dataset_developer-labeled-commits:
  1126. cmd: bohr porcelain label-dataset bugginess developer-labeled-commits
  1127. deps:
  1128. - path: data/developer-labeled.csv
  1129. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1130. size: 121817
  1131. - path: generated/bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1132. md5: b71eb34d2f0264c9d9cfc373727f1d31
  1133. size: 2363860
  1134. - path: generated/bugginess/label_model.pkl
  1135. md5: d181a602cf14da0796fd6fa99aec6e06
  1136. size: 1887708
  1137. params:
  1138. bohr.json:
  1139. bohr_framework_version: 0.4.10
  1140. outs:
  1141. - path: labeled-datasets/bugginess/developer-labeled-commits.labeled.csv
  1142. md5: c1393fcf2bc3153e8205c463820fe23f
  1143. size: 133282
  1144. bugginess_label_dataset_fine-grained-refactorings:
  1145. cmd: bohr porcelain label-dataset bugginess fine-grained-refactorings
  1146. deps:
  1147. - path: data/fine-grained-refactorings.csv
  1148. md5: 4b2fed41042a5ceb2e95738f35650beb
  1149. size: 358328
  1150. - path: generated/bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1151. md5: 675213aedbce431d370ce3e4568edb6b
  1152. size: 3733076
  1153. - path: generated/bugginess/label_model.pkl
  1154. md5: d181a602cf14da0796fd6fa99aec6e06
  1155. size: 1887708
  1156. params:
  1157. bohr.json:
  1158. bohr_framework_version: 0.4.10
  1159. outs:
  1160. - path: labeled-datasets/bugginess/fine-grained-refactorings.labeled.csv
  1161. md5: 626b6640b9accba529b052f7229e6025
  1162. size: 375834
  1163. spacy_bugginess_apply_heuristics__heuristics_bugginess__fine-grained-refactorings:
  1164. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.bugginess
  1165. --dataset fine-grained-refactorings
  1166. deps:
  1167. - path: data/fine-grained-refactorings.csv
  1168. md5: 4b2fed41042a5ceb2e95738f35650beb
  1169. size: 358328
  1170. - path: heuristics/bugginess.py
  1171. md5: a55520b99cbaad572a26886b52e74652
  1172. size: 8900
  1173. - path: labels.py
  1174. md5: 1404972881fc94fbf1039b625bd4ccc0
  1175. size: 1859
  1176. params:
  1177. bohr.json:
  1178. bohr_framework_version: 0.4.10
  1179. outs:
  1180. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1181. md5: b9c7d542ac5cc3e3aa9a2114d794a0f8
  1182. size: 3708248
  1183. - path: metrics/spacy_bugginess/heuristics.bugginess/heuristic_metrics_fine-grained-refactorings.json
  1184. md5: 164d888c48bfb076fa97782a2aa703a8
  1185. size: 31
  1186. spacy_bugginess_apply_heuristics__heuristics_manuallabels__developer-labeled-commits:
  1187. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.manuallabels
  1188. --dataset developer-labeled-commits
  1189. deps:
  1190. - path: data/developer-labeled.csv
  1191. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1192. size: 121817
  1193. - path: heuristics/manuallabels.py
  1194. md5: f338b2a285d76da97b3f53e9b167368a
  1195. size: 278
  1196. - path: labels.py
  1197. md5: 1404972881fc94fbf1039b625bd4ccc0
  1198. size: 1859
  1199. params:
  1200. bohr.json:
  1201. bohr_framework_version: 0.4.10
  1202. outs:
  1203. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_developer-labeled-commits.pkl
  1204. md5: ff48bc7a29abb4802a76e4d4882edd86
  1205. size: 8503
  1206. - path: metrics/spacy_bugginess/heuristics.manuallabels/heuristic_metrics_developer-labeled-commits.json
  1207. md5: 6736afb27f1813d39b8769670a0d29c7
  1208. size: 58
  1209. spacy_bugginess_apply_heuristics__heuristics_manuallabels__fine-grained-refactorings:
  1210. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.manuallabels
  1211. --dataset fine-grained-refactorings
  1212. deps:
  1213. - path: data/fine-grained-refactorings.csv
  1214. md5: 4b2fed41042a5ceb2e95738f35650beb
  1215. size: 358328
  1216. - path: heuristics/manuallabels.py
  1217. md5: f338b2a285d76da97b3f53e9b167368a
  1218. size: 278
  1219. - path: labels.py
  1220. md5: 1404972881fc94fbf1039b625bd4ccc0
  1221. size: 1859
  1222. params:
  1223. bohr.json:
  1224. bohr_framework_version: 0.4.10
  1225. outs:
  1226. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_fine-grained-refactorings.pkl
  1227. md5: e1466fb17a3ce93547649ebd5d6e2210
  1228. size: 13007
  1229. - path: metrics/spacy_bugginess/heuristics.manuallabels/heuristic_metrics_fine-grained-refactorings.json
  1230. md5: 90747343662116155b09e2920b157b6c
  1231. size: 17
  1232. spacy_bugginess_apply_heuristics__heuristics_bugginess__developer-labeled-commits:
  1233. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.bugginess
  1234. --dataset developer-labeled-commits
  1235. deps:
  1236. - path: data/developer-labeled.csv
  1237. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1238. size: 121817
  1239. - path: heuristics/bugginess.py
  1240. md5: a55520b99cbaad572a26886b52e74652
  1241. size: 8900
  1242. - path: labels.py
  1243. md5: 1404972881fc94fbf1039b625bd4ccc0
  1244. size: 1859
  1245. params:
  1246. bohr.json:
  1247. bohr_framework_version: 0.4.10
  1248. outs:
  1249. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1250. md5: fbbed1142254d6cd9b922290565a6313
  1251. size: 2348040
  1252. - path: metrics/spacy_bugginess/heuristics.bugginess/heuristic_metrics_developer-labeled-commits.json
  1253. md5: 936006aa024d80022004b24caf729c4d
  1254. size: 73
  1255. spacy_bugginess_apply_heuristics__heuristics_manuallabels__herzig:
  1256. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.manuallabels
  1257. --dataset herzig
  1258. deps:
  1259. - path: data/herzig.csv
  1260. md5: 69a17c08643aed84b874384a2a57c7ed
  1261. size: 1483281
  1262. - path: heuristics/manuallabels.py
  1263. md5: f338b2a285d76da97b3f53e9b167368a
  1264. size: 278
  1265. - path: labels.py
  1266. md5: 1404972881fc94fbf1039b625bd4ccc0
  1267. size: 1859
  1268. params:
  1269. bohr.json:
  1270. bohr_framework_version: 0.4.10
  1271. outs:
  1272. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_herzig.pkl
  1273. md5: 8e22aefcc1b79b3bc8c00ebd4503dca3
  1274. size: 42479
  1275. - path: metrics/spacy_bugginess/heuristics.manuallabels/heuristic_metrics_herzig.json
  1276. md5: 6881c30e66d12aec85d162df31e5e04d
  1277. size: 58
  1278. spacy_bugginess_apply_heuristics__heuristics_manuallabels__200k-commits:
  1279. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.manuallabels
  1280. --dataset 200k-commits
  1281. deps:
  1282. - path: data/200k-commits-files.csv
  1283. md5: bc989c140c305bed62a5a8b161883d3b
  1284. size: 2284439219
  1285. - path: data/200k-commits-issues.csv
  1286. md5: da4b0d654f7ce1469857b9171a9647aa
  1287. size: 96908075
  1288. - path: data/200k-commits-manual-labels.csv
  1289. md5: 447bf23d38df7f7e3007dc35f70cab91
  1290. size: 1187
  1291. - path: data/200k-commits.csv
  1292. md5: 6ce10284e630c44110ffc483a7bb33df
  1293. size: 71402002
  1294. - path: heuristics/manuallabels.py
  1295. md5: f338b2a285d76da97b3f53e9b167368a
  1296. size: 278
  1297. - path: labels.py
  1298. md5: 1404972881fc94fbf1039b625bd4ccc0
  1299. size: 1859
  1300. params:
  1301. bohr.json:
  1302. bohr_framework_version: 0.4.10
  1303. outs:
  1304. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_200k-commits.pkl
  1305. md5: 7febb1ec7e27a0b380e83a4c732946ab
  1306. size: 1651964
  1307. - path: metrics/spacy_bugginess/heuristics.manuallabels/heuristic_metrics_200k-commits.json
  1308. md5: b550e0fca5c368f3221fc11db0ba8a3e
  1309. size: 36
  1310. spacy_bugginess_apply_heuristics__heuristics_bugginess__1151-commits:
  1311. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.bugginess
  1312. --dataset 1151-commits
  1313. deps:
  1314. - path: data/1151-commits.csv
  1315. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  1316. size: 346306
  1317. - path: heuristics/bugginess.py
  1318. md5: a55520b99cbaad572a26886b52e74652
  1319. size: 8900
  1320. - path: labels.py
  1321. md5: 1404972881fc94fbf1039b625bd4ccc0
  1322. size: 1859
  1323. params:
  1324. bohr.json:
  1325. bohr_framework_version: 0.4.10
  1326. outs:
  1327. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_1151-commits.pkl
  1328. md5: 732df7ec50a8165d7e9a1e79415064b5
  1329. size: 2792584
  1330. - path: metrics/spacy_bugginess/heuristics.bugginess/heuristic_metrics_1151-commits.json
  1331. md5: 590f784b4669d243ce5bff2a8d09345b
  1332. size: 73
  1333. spacy_bugginess_apply_heuristics__heuristics_bugginess__berger:
  1334. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.bugginess
  1335. --dataset berger
  1336. deps:
  1337. - path: data/berger.csv
  1338. md5: 126de41c9204a9e807e72406b1f9d631
  1339. size: 62247
  1340. - path: heuristics/bugginess.py
  1341. md5: a55520b99cbaad572a26886b52e74652
  1342. size: 8900
  1343. - path: labels.py
  1344. md5: 1404972881fc94fbf1039b625bd4ccc0
  1345. size: 1859
  1346. params:
  1347. bohr.json:
  1348. bohr_framework_version: 0.4.10
  1349. outs:
  1350. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_berger.pkl
  1351. md5: e93deda462057f4a205960cf0d24d2ec
  1352. size: 917768
  1353. - path: metrics/spacy_bugginess/heuristics.bugginess/heuristic_metrics_berger.json
  1354. md5: feb313c11f1c1afb0fc58ef5ad73ab6a
  1355. size: 73
  1356. spacy_bugginess_apply_heuristics__heuristics_bugginess__200k-commits:
  1357. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.bugginess
  1358. --dataset 200k-commits
  1359. deps:
  1360. - path: data/200k-commits-files.csv
  1361. md5: bc989c140c305bed62a5a8b161883d3b
  1362. size: 2284439219
  1363. - path: data/200k-commits-issues.csv
  1364. md5: da4b0d654f7ce1469857b9171a9647aa
  1365. size: 96908075
  1366. - path: data/200k-commits-manual-labels.csv
  1367. md5: 447bf23d38df7f7e3007dc35f70cab91
  1368. size: 1187
  1369. - path: data/200k-commits.csv
  1370. md5: 6ce10284e630c44110ffc483a7bb33df
  1371. size: 71402002
  1372. - path: heuristics/bugginess.py
  1373. md5: a55520b99cbaad572a26886b52e74652
  1374. size: 8900
  1375. - path: labels.py
  1376. md5: 1404972881fc94fbf1039b625bd4ccc0
  1377. size: 1859
  1378. params:
  1379. bohr.json:
  1380. bohr_framework_version: 0.4.10
  1381. outs:
  1382. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_200k-commits.pkl
  1383. md5: fa93c70bb58f111b85db389bd0c74ad1
  1384. size: 498669340
  1385. - path: metrics/spacy_bugginess/heuristics.bugginess/heuristic_metrics_200k-commits.json
  1386. md5: 958e51f4d2c35451cf7575eaea15e7a6
  1387. size: 32
  1388. spacy_bugginess_apply_heuristics__heuristics_bugginess__herzig:
  1389. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.bugginess
  1390. --dataset herzig
  1391. deps:
  1392. - path: data/herzig.csv
  1393. md5: 69a17c08643aed84b874384a2a57c7ed
  1394. size: 1483281
  1395. - path: heuristics/bugginess.py
  1396. md5: a55520b99cbaad572a26886b52e74652
  1397. size: 8900
  1398. - path: labels.py
  1399. md5: 1404972881fc94fbf1039b625bd4ccc0
  1400. size: 1859
  1401. params:
  1402. bohr.json:
  1403. bohr_framework_version: 0.4.10
  1404. outs:
  1405. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_herzig.pkl
  1406. md5: 15bf08d69bf793c3978a2ffe1458c76d
  1407. size: 12608792
  1408. - path: metrics/spacy_bugginess/heuristics.bugginess/heuristic_metrics_herzig.json
  1409. md5: 525291ad999e1bee97789045c1ae8333
  1410. size: 72
  1411. spacy_bugginess_apply_heuristics__heuristics_manuallabels__1151-commits:
  1412. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.manuallabels
  1413. --dataset 1151-commits
  1414. deps:
  1415. - path: data/1151-commits.csv
  1416. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  1417. size: 346306
  1418. - path: heuristics/manuallabels.py
  1419. md5: f338b2a285d76da97b3f53e9b167368a
  1420. size: 278
  1421. - path: labels.py
  1422. md5: 1404972881fc94fbf1039b625bd4ccc0
  1423. size: 1859
  1424. params:
  1425. bohr.json:
  1426. bohr_framework_version: 0.4.10
  1427. outs:
  1428. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_1151-commits.pkl
  1429. md5: 6586d8bf9010aff3be65327facde2edc
  1430. size: 9975
  1431. - path: metrics/spacy_bugginess/heuristics.manuallabels/heuristic_metrics_1151-commits.json
  1432. md5: 452fdb0e2c252999419be5771a3774cc
  1433. size: 58
  1434. spacy_bugginess_apply_heuristics__heuristics_manuallabels__berger:
  1435. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.manuallabels
  1436. --dataset berger
  1437. deps:
  1438. - path: data/berger.csv
  1439. md5: 126de41c9204a9e807e72406b1f9d631
  1440. size: 62247
  1441. - path: heuristics/manuallabels.py
  1442. md5: f338b2a285d76da97b3f53e9b167368a
  1443. size: 278
  1444. - path: labels.py
  1445. md5: 1404972881fc94fbf1039b625bd4ccc0
  1446. size: 1859
  1447. params:
  1448. bohr.json:
  1449. bohr_framework_version: 0.4.10
  1450. outs:
  1451. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_berger.pkl
  1452. md5: e93c0bbc779f1cf928251c4613ef5cc6
  1453. size: 3767
  1454. - path: metrics/spacy_bugginess/heuristics.manuallabels/heuristic_metrics_berger.json
  1455. md5: c2fefb5ddd23aee9e2705356b8d131c1
  1456. size: 59
  1457. spacy_bugginess_combine_heuristics:
  1458. cmd: bohr porcelain apply-heuristics spacy_bugginess
  1459. deps:
  1460. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_1151-commits.pkl
  1461. md5: 732df7ec50a8165d7e9a1e79415064b5
  1462. size: 2792584
  1463. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_200k-commits.pkl
  1464. md5: fa93c70bb58f111b85db389bd0c74ad1
  1465. size: 498669340
  1466. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_berger.pkl
  1467. md5: e93deda462057f4a205960cf0d24d2ec
  1468. size: 917768
  1469. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1470. md5: fbbed1142254d6cd9b922290565a6313
  1471. size: 2348040
  1472. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1473. md5: b9c7d542ac5cc3e3aa9a2114d794a0f8
  1474. size: 3708248
  1475. - path: generated/spacy_bugginess/heuristics.bugginess/heuristic_matrix_herzig.pkl
  1476. md5: 15bf08d69bf793c3978a2ffe1458c76d
  1477. size: 12608792
  1478. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_1151-commits.pkl
  1479. md5: 6586d8bf9010aff3be65327facde2edc
  1480. size: 9975
  1481. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_200k-commits.pkl
  1482. md5: 7febb1ec7e27a0b380e83a4c732946ab
  1483. size: 1651964
  1484. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_berger.pkl
  1485. md5: e93c0bbc779f1cf928251c4613ef5cc6
  1486. size: 3767
  1487. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_developer-labeled-commits.pkl
  1488. md5: ff48bc7a29abb4802a76e4d4882edd86
  1489. size: 8503
  1490. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_fine-grained-refactorings.pkl
  1491. md5: e1466fb17a3ce93547649ebd5d6e2210
  1492. size: 13007
  1493. - path: generated/spacy_bugginess/heuristics.manuallabels/heuristic_matrix_herzig.pkl
  1494. md5: 8e22aefcc1b79b3bc8c00ebd4503dca3
  1495. size: 42479
  1496. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_1151-commits.pkl
  1497. md5: 7ce2e0846705b59c985ee6fdfec54688
  1498. size: 9964
  1499. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_200k-commits.pkl
  1500. md5: 6d11a8ffe074e1b1cad1f68282b7a821
  1501. size: 1651953
  1502. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_berger.pkl
  1503. md5: e6271ef87fe70277dc4075960da80517
  1504. size: 3756
  1505. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1506. md5: d6cf7bd8b27954b36508d2c7c6f82e77
  1507. size: 8492
  1508. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1509. md5: b39471a6430f306caac12ef016e44e2f
  1510. size: 12996
  1511. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_herzig.pkl
  1512. md5: 6be4da5cf3cc5c31485cd7be9454a502
  1513. size: 42468
  1514. params:
  1515. bohr.json:
  1516. bohr_framework_version: 0.4.10
  1517. outs:
  1518. - path: generated/spacy_bugginess/analysis_1151-commits.csv
  1519. md5: 8ee4791afe967f2f28db8e0e67b8b967
  1520. size: 24412
  1521. - path: generated/spacy_bugginess/analysis_200k-commits.csv
  1522. md5: b0e3db0d766038c1cb7cace4d8ee05b4
  1523. size: 30511
  1524. - path: generated/spacy_bugginess/analysis_berger.csv
  1525. md5: 86d62319ebef9e88e8bfd3821fc5f474
  1526. size: 21556
  1527. - path: generated/spacy_bugginess/analysis_developer-labeled-commits.csv
  1528. md5: 94eaa2979edd2f944d49f0eb6fa24955
  1529. size: 22611
  1530. - path: generated/spacy_bugginess/analysis_fine-grained-refactorings.csv
  1531. md5: 11da2d246f2b90317a7a4e8a6dfd1fb2
  1532. size: 21134
  1533. - path: generated/spacy_bugginess/analysis_herzig.csv
  1534. md5: 1382ca926c2a28dfea7f7784ace51a5f
  1535. size: 25928
  1536. - path: generated/spacy_bugginess/heuristic_matrix_1151-commits.pkl
  1537. md5: 621742a07acb5b672e2e800d0f6c98c0
  1538. size: 2811348
  1539. - path: generated/spacy_bugginess/heuristic_matrix_200k-commits.pkl
  1540. md5: 3a986cbe681dfc500c2947272a4162fa
  1541. size: 501972078
  1542. - path: generated/spacy_bugginess/heuristic_matrix_berger.pkl
  1543. md5: ea85cfe60bdb18ef08697d3c379dad23
  1544. size: 924116
  1545. - path: generated/spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1546. md5: b71eb34d2f0264c9d9cfc373727f1d31
  1547. size: 2363860
  1548. - path: generated/spacy_bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1549. md5: 675213aedbce431d370ce3e4568edb6b
  1550. size: 3733076
  1551. - path: generated/spacy_bugginess/heuristic_matrix_herzig.pkl
  1552. md5: 1cad55ab5b748d4cdbd78967b865e5f4
  1553. size: 12692573
  1554. - path: metrics/spacy_bugginess/analysis_1151-commits.json
  1555. md5: eb815cb21a83ac06f7bcd4f301631c61
  1556. size: 108882
  1557. - path: metrics/spacy_bugginess/analysis_200k-commits.json
  1558. md5: 7f65be3984bdc43ab235ec2f7e27fa16
  1559. size: 79011
  1560. - path: metrics/spacy_bugginess/analysis_berger.json
  1561. md5: 0d4220357cdbb454ac7eb476771f0b1f
  1562. size: 105410
  1563. - path: metrics/spacy_bugginess/analysis_developer-labeled-commits.json
  1564. md5: bbc26523d0a4b791c72505ccd9211cd7
  1565. size: 106487
  1566. - path: metrics/spacy_bugginess/analysis_fine-grained-refactorings.json
  1567. md5: dbb4be5a9d8a800cc4dc444f49960d05
  1568. size: 66082
  1569. - path: metrics/spacy_bugginess/analysis_herzig.json
  1570. md5: 9d01e6f21e752973860df5e02feff387
  1571. size: 110640
  1572. - path: metrics/spacy_bugginess/heuristic_metrics_1151-commits.json
  1573. md5: 09174a954ac3672c5c1993baaca0225a
  1574. size: 72
  1575. - path: metrics/spacy_bugginess/heuristic_metrics_200k-commits.json
  1576. md5: 637a087041dadcb73db461d355a37401
  1577. size: 32
  1578. - path: metrics/spacy_bugginess/heuristic_metrics_berger.json
  1579. md5: 5024c5ebf8dfc6d7e43d06453e132f3a
  1580. size: 73
  1581. - path: metrics/spacy_bugginess/heuristic_metrics_developer-labeled-commits.json
  1582. md5: 02d126b04812b490e556da550b615860
  1583. size: 72
  1584. - path: metrics/spacy_bugginess/heuristic_metrics_fine-grained-refactorings.json
  1585. md5: 668030d87b617448b3d76eefa599ce15
  1586. size: 32
  1587. - path: metrics/spacy_bugginess/heuristic_metrics_herzig.json
  1588. md5: 788a616725c217b3c2b2c853453e9ab9
  1589. size: 73
  1590. spacy_bugginess_train_label_model:
  1591. cmd: bohr porcelain train-label-model spacy_bugginess 200k-commits
  1592. deps:
  1593. - path: data/1151-commits.csv
  1594. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  1595. size: 346306
  1596. - path: data/berger.csv
  1597. md5: 126de41c9204a9e807e72406b1f9d631
  1598. size: 62247
  1599. - path: data/developer-labeled.csv
  1600. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1601. size: 121817
  1602. - path: data/herzig.csv
  1603. md5: 69a17c08643aed84b874384a2a57c7ed
  1604. size: 1483281
  1605. - path: generated/spacy_bugginess/heuristic_matrix_1151-commits.pkl
  1606. md5: 621742a07acb5b672e2e800d0f6c98c0
  1607. size: 2811348
  1608. - path: generated/spacy_bugginess/heuristic_matrix_200k-commits.pkl
  1609. md5: 3a986cbe681dfc500c2947272a4162fa
  1610. size: 501972078
  1611. - path: generated/spacy_bugginess/heuristic_matrix_berger.pkl
  1612. md5: ea85cfe60bdb18ef08697d3c379dad23
  1613. size: 924116
  1614. - path: generated/spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1615. md5: b71eb34d2f0264c9d9cfc373727f1d31
  1616. size: 2363860
  1617. - path: generated/spacy_bugginess/heuristic_matrix_herzig.pkl
  1618. md5: 1cad55ab5b748d4cdbd78967b865e5f4
  1619. size: 12692573
  1620. params:
  1621. bohr.json:
  1622. bohr_framework_version: 0.4.10
  1623. outs:
  1624. - path: generated/spacy_bugginess/label_model.pkl
  1625. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1626. size: 1887708
  1627. - path: generated/spacy_bugginess/label_model_weights.csv
  1628. md5: 759620bb5537c183d19d8cd1841a356b
  1629. size: 20564
  1630. - path: metrics/spacy_bugginess/label_model_metrics.json
  1631. md5: dd00d4921a0a16b6d10233bfd6d301d6
  1632. size: 643
  1633. spacy_bugginess_apply_heuristics__heuristics_spacy_bugginess__1151-commits:
  1634. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.spacy_bugginess
  1635. --dataset 1151-commits
  1636. deps:
  1637. - path: data/1151-commits.csv
  1638. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  1639. size: 346306
  1640. - path: heuristics/spacy_bugginess.py
  1641. md5: 50f5460261d9730da2dc894987b06d01
  1642. size: 2182
  1643. - path: labels.py
  1644. md5: 1404972881fc94fbf1039b625bd4ccc0
  1645. size: 1859
  1646. params:
  1647. bohr.json:
  1648. bohr_framework_version: 0.4.10
  1649. outs:
  1650. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_1151-commits.pkl
  1651. md5: 7ce2e0846705b59c985ee6fdfec54688
  1652. size: 9964
  1653. - path: metrics/spacy_bugginess/heuristics.spacy_bugginess/heuristic_metrics_1151-commits.json
  1654. md5: 0e762c55e17b7fa3021f08d056f21d91
  1655. size: 74
  1656. spacy_bugginess_apply_heuristics__heuristics_spacy_bugginess__berger:
  1657. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.spacy_bugginess
  1658. --dataset berger
  1659. deps:
  1660. - path: data/berger.csv
  1661. md5: 126de41c9204a9e807e72406b1f9d631
  1662. size: 62247
  1663. - path: heuristics/spacy_bugginess.py
  1664. md5: 50f5460261d9730da2dc894987b06d01
  1665. size: 2182
  1666. - path: labels.py
  1667. md5: 1404972881fc94fbf1039b625bd4ccc0
  1668. size: 1859
  1669. params:
  1670. bohr.json:
  1671. bohr_framework_version: 0.4.10
  1672. outs:
  1673. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_berger.pkl
  1674. md5: e6271ef87fe70277dc4075960da80517
  1675. size: 3756
  1676. - path: metrics/spacy_bugginess/heuristics.spacy_bugginess/heuristic_metrics_berger.json
  1677. md5: b1e3bb8f69724c21f03b27079215a275
  1678. size: 60
  1679. spacy_bugginess_apply_heuristics__heuristics_spacy_bugginess__200k-commits:
  1680. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.spacy_bugginess
  1681. --dataset 200k-commits
  1682. deps:
  1683. - path: data/200k-commits-files.csv
  1684. md5: bc989c140c305bed62a5a8b161883d3b
  1685. size: 2284439219
  1686. - path: data/200k-commits-issues.csv
  1687. md5: da4b0d654f7ce1469857b9171a9647aa
  1688. size: 96908075
  1689. - path: data/200k-commits-manual-labels.csv
  1690. md5: 447bf23d38df7f7e3007dc35f70cab91
  1691. size: 1187
  1692. - path: data/200k-commits.csv
  1693. md5: 6ce10284e630c44110ffc483a7bb33df
  1694. size: 71402002
  1695. - path: heuristics/spacy_bugginess.py
  1696. md5: 50f5460261d9730da2dc894987b06d01
  1697. size: 2182
  1698. - path: labels.py
  1699. md5: 1404972881fc94fbf1039b625bd4ccc0
  1700. size: 1859
  1701. params:
  1702. bohr.json:
  1703. bohr_framework_version: 0.4.10
  1704. outs:
  1705. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_200k-commits.pkl
  1706. md5: 6d11a8ffe074e1b1cad1f68282b7a821
  1707. size: 1651953
  1708. - path: metrics/spacy_bugginess/heuristics.spacy_bugginess/heuristic_metrics_200k-commits.json
  1709. md5: 407f76665fb6f1d936772ba68a29d967
  1710. size: 32
  1711. spacy_bugginess_apply_heuristics__heuristics_spacy_bugginess__developer-labeled-commits:
  1712. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.spacy_bugginess
  1713. --dataset developer-labeled-commits
  1714. deps:
  1715. - path: data/developer-labeled.csv
  1716. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1717. size: 121817
  1718. - path: heuristics/spacy_bugginess.py
  1719. md5: 50f5460261d9730da2dc894987b06d01
  1720. size: 2182
  1721. - path: labels.py
  1722. md5: 1404972881fc94fbf1039b625bd4ccc0
  1723. size: 1859
  1724. params:
  1725. bohr.json:
  1726. bohr_framework_version: 0.4.10
  1727. outs:
  1728. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1729. md5: d6cf7bd8b27954b36508d2c7c6f82e77
  1730. size: 8492
  1731. - path: metrics/spacy_bugginess/heuristics.spacy_bugginess/heuristic_metrics_developer-labeled-commits.json
  1732. md5: 69e42befbee70dc12a86b507501242c2
  1733. size: 73
  1734. spacy_bugginess_apply_heuristics__heuristics_spacy_bugginess__fine-grained-refactorings:
  1735. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.spacy_bugginess
  1736. --dataset fine-grained-refactorings
  1737. deps:
  1738. - path: data/fine-grained-refactorings.csv
  1739. md5: 4b2fed41042a5ceb2e95738f35650beb
  1740. size: 358328
  1741. - path: heuristics/spacy_bugginess.py
  1742. md5: 50f5460261d9730da2dc894987b06d01
  1743. size: 2182
  1744. - path: labels.py
  1745. md5: 1404972881fc94fbf1039b625bd4ccc0
  1746. size: 1859
  1747. params:
  1748. bohr.json:
  1749. bohr_framework_version: 0.4.10
  1750. outs:
  1751. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1752. md5: b39471a6430f306caac12ef016e44e2f
  1753. size: 12996
  1754. - path: metrics/spacy_bugginess/heuristics.spacy_bugginess/heuristic_metrics_fine-grained-refactorings.json
  1755. md5: f51c715360d7d297b4479335a8795b20
  1756. size: 32
  1757. spacy_bugginess_apply_heuristics__heuristics_spacy_bugginess__herzig:
  1758. cmd: bohr porcelain apply-heuristics spacy_bugginess --heuristic-group heuristics.spacy_bugginess
  1759. --dataset herzig
  1760. deps:
  1761. - path: data/herzig.csv
  1762. md5: 69a17c08643aed84b874384a2a57c7ed
  1763. size: 1483281
  1764. - path: heuristics/spacy_bugginess.py
  1765. md5: 50f5460261d9730da2dc894987b06d01
  1766. size: 2182
  1767. - path: labels.py
  1768. md5: 1404972881fc94fbf1039b625bd4ccc0
  1769. size: 1859
  1770. params:
  1771. bohr.json:
  1772. bohr_framework_version: 0.4.10
  1773. outs:
  1774. - path: generated/spacy_bugginess/heuristics.spacy_bugginess/heuristic_matrix_herzig.pkl
  1775. md5: 6be4da5cf3cc5c31485cd7be9454a502
  1776. size: 42468
  1777. - path: metrics/spacy_bugginess/heuristics.spacy_bugginess/heuristic_metrics_herzig.json
  1778. md5: 59d2e6ad532ac50f87ff36e7f3af8613
  1779. size: 74
  1780. spacy_bugginess_label_dataset_200k-commits:
  1781. cmd: bohr porcelain label-dataset spacy_bugginess 200k-commits
  1782. deps:
  1783. - path: data/200k-commits.csv
  1784. md5: 6ce10284e630c44110ffc483a7bb33df
  1785. size: 71402002
  1786. - path: generated/spacy_bugginess/heuristic_matrix_200k-commits.pkl
  1787. md5: 3a986cbe681dfc500c2947272a4162fa
  1788. size: 501972078
  1789. - path: generated/spacy_bugginess/label_model.pkl
  1790. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1791. size: 1887708
  1792. params:
  1793. bohr.json:
  1794. bohr_framework_version: 0.4.10
  1795. outs:
  1796. - path: labeled-datasets/spacy_bugginess/200k-commits.labeled.csv
  1797. md5: afca892c48bd2b677ce4317ffdeabad9
  1798. size: 73227493
  1799. spacy_bugginess_label_dataset_1151-commits:
  1800. cmd: bohr porcelain label-dataset spacy_bugginess 1151-commits
  1801. deps:
  1802. - path: data/1151-commits.csv
  1803. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  1804. size: 346306
  1805. - path: generated/spacy_bugginess/heuristic_matrix_1151-commits.pkl
  1806. md5: 621742a07acb5b672e2e800d0f6c98c0
  1807. size: 2811348
  1808. - path: generated/spacy_bugginess/label_model.pkl
  1809. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1810. size: 1887708
  1811. params:
  1812. bohr.json:
  1813. bohr_framework_version: 0.4.10
  1814. outs:
  1815. - path: labeled-datasets/spacy_bugginess/1151-commits.labeled.csv
  1816. md5: 7e0e9835e401bfa84c11ad0d7051d497
  1817. size: 359699
  1818. spacy_bugginess_label_dataset_herzig:
  1819. cmd: bohr porcelain label-dataset spacy_bugginess herzig
  1820. deps:
  1821. - path: data/herzig.csv
  1822. md5: 69a17c08643aed84b874384a2a57c7ed
  1823. size: 1483281
  1824. - path: generated/spacy_bugginess/heuristic_matrix_herzig.pkl
  1825. md5: 1cad55ab5b748d4cdbd78967b865e5f4
  1826. size: 12692573
  1827. - path: generated/spacy_bugginess/label_model.pkl
  1828. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1829. size: 1887708
  1830. params:
  1831. bohr.json:
  1832. bohr_framework_version: 0.4.10
  1833. outs:
  1834. - path: labeled-datasets/spacy_bugginess/herzig.labeled.csv
  1835. md5: c23b134060d6e6659bfaac22ad691a90
  1836. size: 1540065
  1837. spacy_bugginess_label_dataset_developer-labeled-commits:
  1838. cmd: bohr porcelain label-dataset spacy_bugginess developer-labeled-commits
  1839. deps:
  1840. - path: data/developer-labeled.csv
  1841. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  1842. size: 121817
  1843. - path: generated/spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  1844. md5: b71eb34d2f0264c9d9cfc373727f1d31
  1845. size: 2363860
  1846. - path: generated/spacy_bugginess/label_model.pkl
  1847. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1848. size: 1887708
  1849. params:
  1850. bohr.json:
  1851. bohr_framework_version: 0.4.10
  1852. outs:
  1853. - path: labeled-datasets/spacy_bugginess/developer-labeled-commits.labeled.csv
  1854. md5: c1393fcf2bc3153e8205c463820fe23f
  1855. size: 133282
  1856. spacy_bugginess_label_dataset_fine-grained-refactorings:
  1857. cmd: bohr porcelain label-dataset spacy_bugginess fine-grained-refactorings
  1858. deps:
  1859. - path: data/fine-grained-refactorings.csv
  1860. md5: 4b2fed41042a5ceb2e95738f35650beb
  1861. size: 358328
  1862. - path: generated/spacy_bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  1863. md5: 675213aedbce431d370ce3e4568edb6b
  1864. size: 3733076
  1865. - path: generated/spacy_bugginess/label_model.pkl
  1866. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1867. size: 1887708
  1868. params:
  1869. bohr.json:
  1870. bohr_framework_version: 0.4.10
  1871. outs:
  1872. - path: labeled-datasets/spacy_bugginess/fine-grained-refactorings.labeled.csv
  1873. md5: 626b6640b9accba529b052f7229e6025
  1874. size: 375834
  1875. spacy_bugginess_label_dataset_berger:
  1876. cmd: bohr porcelain label-dataset spacy_bugginess berger
  1877. deps:
  1878. - path: data/berger.csv
  1879. md5: 126de41c9204a9e807e72406b1f9d631
  1880. size: 62247
  1881. - path: generated/spacy_bugginess/heuristic_matrix_berger.pkl
  1882. md5: ea85cfe60bdb18ef08697d3c379dad23
  1883. size: 924116
  1884. - path: generated/spacy_bugginess/label_model.pkl
  1885. md5: 3c2bd4c79be1517f9f535d3bba2209d8
  1886. size: 1887708
  1887. params:
  1888. bohr.json:
  1889. bohr_framework_version: 0.4.10
  1890. outs:
  1891. - path: labeled-datasets/spacy_bugginess/berger.labeled.csv
  1892. md5: 498ebdd9c39cb01c351c5805f2b448e1
  1893. size: 66772
  1894. bugginess_apply_heuristics__heuristics_spacy_bugginess__herzig:
  1895. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.spacy_bugginess
  1896. --dataset herzig
  1897. deps:
  1898. - path: data/herzig.csv
  1899. md5: 69a17c08643aed84b874384a2a57c7ed
  1900. size: 1483281
  1901. - path: heuristics/spacy_bugginess.py
  1902. md5: 53c7ba0d4d416a0e55a4a883bb07b780
  1903. size: 2416
  1904. - path: labels.py
  1905. md5: 1404972881fc94fbf1039b625bd4ccc0
  1906. size: 1859
  1907. params:
  1908. bohr.json:
  1909. bohr_framework_version: 0.4.10
  1910. outs:
  1911. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_herzig.pkl
  1912. md5: 6be4da5cf3cc5c31485cd7be9454a502
  1913. size: 42468
  1914. - path: metrics/bugginess/heuristics.spacy_bugginess/heuristic_metrics_herzig.json
  1915. md5: 59d2e6ad532ac50f87ff36e7f3af8613
  1916. size: 74
  1917. bugginess_apply_heuristics__heuristics_spacy_bugginess__1151-commits:
  1918. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.spacy_bugginess
  1919. --dataset 1151-commits
  1920. deps:
  1921. - path: data/1151-commits.csv
  1922. md5: dd000fe19ba4aac9efa3a3856e2acc5e
  1923. size: 346306
  1924. - path: heuristics/spacy_bugginess.py
  1925. md5: 53c7ba0d4d416a0e55a4a883bb07b780
  1926. size: 2416
  1927. - path: labels.py
  1928. md5: 1404972881fc94fbf1039b625bd4ccc0
  1929. size: 1859
  1930. params:
  1931. bohr.json:
  1932. bohr_framework_version: 0.4.10
  1933. outs:
  1934. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_1151-commits.pkl
  1935. md5: 7ce2e0846705b59c985ee6fdfec54688
  1936. size: 9964
  1937. - path: metrics/bugginess/heuristics.spacy_bugginess/heuristic_metrics_1151-commits.json
  1938. md5: 0e762c55e17b7fa3021f08d056f21d91
  1939. size: 74
  1940. bugginess_apply_heuristics__heuristics_spacy_bugginess__200k-commits:
  1941. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.spacy_bugginess
  1942. --dataset 200k-commits
  1943. deps:
  1944. - path: data/200k-commits-files.csv
  1945. md5: bc989c140c305bed62a5a8b161883d3b
  1946. size: 2284439219
  1947. - path: data/200k-commits-issues.csv
  1948. md5: da4b0d654f7ce1469857b9171a9647aa
  1949. size: 96908075
  1950. - path: data/200k-commits-manual-labels.csv
  1951. md5: 447bf23d38df7f7e3007dc35f70cab91
  1952. size: 1187
  1953. - path: data/200k-commits.csv
  1954. md5: 6ce10284e630c44110ffc483a7bb33df
  1955. size: 71402002
  1956. - path: heuristics/spacy_bugginess.py
  1957. md5: 53c7ba0d4d416a0e55a4a883bb07b780
  1958. size: 2416
  1959. - path: labels.py
  1960. md5: 1404972881fc94fbf1039b625bd4ccc0
  1961. size: 1859
  1962. params:
  1963. bohr.json:
  1964. bohr_framework_version: 0.4.10
  1965. outs:
  1966. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_200k-commits.pkl
  1967. md5: 6d11a8ffe074e1b1cad1f68282b7a821
  1968. size: 1651953
  1969. - path: metrics/bugginess/heuristics.spacy_bugginess/heuristic_metrics_200k-commits.json
  1970. md5: 407f76665fb6f1d936772ba68a29d967
  1971. size: 32
  1972. bugginess_apply_heuristics__heuristics_spacy_bugginess__berger:
  1973. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.spacy_bugginess
  1974. --dataset berger
  1975. deps:
  1976. - path: data/berger.csv
  1977. md5: 126de41c9204a9e807e72406b1f9d631
  1978. size: 62247
  1979. - path: heuristics/spacy_bugginess.py
  1980. md5: 53c7ba0d4d416a0e55a4a883bb07b780
  1981. size: 2416
  1982. - path: labels.py
  1983. md5: 1404972881fc94fbf1039b625bd4ccc0
  1984. size: 1859
  1985. params:
  1986. bohr.json:
  1987. bohr_framework_version: 0.4.10
  1988. outs:
  1989. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_berger.pkl
  1990. md5: e6271ef87fe70277dc4075960da80517
  1991. size: 3756
  1992. - path: metrics/bugginess/heuristics.spacy_bugginess/heuristic_metrics_berger.json
  1993. md5: b1e3bb8f69724c21f03b27079215a275
  1994. size: 60
  1995. bugginess_apply_heuristics__heuristics_spacy_bugginess__developer-labeled-commits:
  1996. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.spacy_bugginess
  1997. --dataset developer-labeled-commits
  1998. deps:
  1999. - path: data/developer-labeled.csv
  2000. md5: db835bc072a7fbfb2fa947c1d5dbb1aa
  2001. size: 121817
  2002. - path: heuristics/spacy_bugginess.py
  2003. md5: 53c7ba0d4d416a0e55a4a883bb07b780
  2004. size: 2416
  2005. - path: labels.py
  2006. md5: 1404972881fc94fbf1039b625bd4ccc0
  2007. size: 1859
  2008. params:
  2009. bohr.json:
  2010. bohr_framework_version: 0.4.10
  2011. outs:
  2012. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_developer-labeled-commits.pkl
  2013. md5: d6cf7bd8b27954b36508d2c7c6f82e77
  2014. size: 8492
  2015. - path: metrics/bugginess/heuristics.spacy_bugginess/heuristic_metrics_developer-labeled-commits.json
  2016. md5: 69e42befbee70dc12a86b507501242c2
  2017. size: 73
  2018. bugginess_apply_heuristics__heuristics_spacy_bugginess__fine-grained-refactorings:
  2019. cmd: bohr porcelain apply-heuristics bugginess --heuristic-group heuristics.spacy_bugginess
  2020. --dataset fine-grained-refactorings
  2021. deps:
  2022. - path: data/fine-grained-refactorings.csv
  2023. md5: 4b2fed41042a5ceb2e95738f35650beb
  2024. size: 358328
  2025. - path: heuristics/spacy_bugginess.py
  2026. md5: 53c7ba0d4d416a0e55a4a883bb07b780
  2027. size: 2416
  2028. - path: labels.py
  2029. md5: 1404972881fc94fbf1039b625bd4ccc0
  2030. size: 1859
  2031. params:
  2032. bohr.json:
  2033. bohr_framework_version: 0.4.10
  2034. outs:
  2035. - path: generated/bugginess/heuristics.spacy_bugginess/heuristic_matrix_fine-grained-refactorings.pkl
  2036. md5: b39471a6430f306caac12ef016e44e2f
  2037. size: 12996
  2038. - path: metrics/bugginess/heuristics.spacy_bugginess/heuristic_metrics_fine-grained-refactorings.json
  2039. md5: f51c715360d7d297b4479335a8795b20
  2040. size: 32
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...