Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

chr6.log 17 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
  1. [WARNING] No absolute output path provided, using current directory as prefix
  2. [INFO] CLAIR3 VERSION: v0.1-r6
  3. [INFO] BAM FILE PATH: /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6.bam
  4. [INFO] REFERENCE FILE PATH: /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/../../config/ref/GCA_000001405.15_GRCh38_no_alt_analysis_set.fasta
  5. [INFO] MODEL PATH: /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/ont_r104_e81_sup_g5015
  6. [INFO] OUTPUT FOLDER: /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6
  7. [INFO] PLATFORM: ont
  8. [INFO] THREADS: 48
  9. [INFO] BED FILE PATH: /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6.bed
  10. [INFO] VCF FILE PATH: EMPTY
  11. [INFO] CONTIGS: EMPTY
  12. [INFO] CONDA PREFIX: /home/OXFORDNANOLABS/cwright/miniconda3/envs/clair3
  13. [INFO] SAMTOOLS PATH: samtools
  14. [INFO] PYTHON PATH: python3
  15. [INFO] PYPY PATH: pypy3
  16. [INFO] PARALLEL PATH: parallel
  17. [INFO] WHATSHAP PATH: whatshap
  18. [INFO] CHUNK SIZE: 5000000
  19. [INFO] FULL ALIGN PROPORTION: 0.3
  20. [INFO] FULL ALIGN REFERENCE PROPORTION: 0.1
  21. [INFO] ENABLE FILEUP ONLY CALLING: False
  22. [INFO] ENABLE FAST MODE CALLING: False
  23. [INFO] ENABLE CALLING SNP CANDIDATES ONLY: False
  24. [INFO] ENABLE PRINTING REFERENCE CALLS: False
  25. [INFO] ENABLE OUTPUT GVCF: False
  26. [INFO] ENABLE HAPLOID PRECISE MODE: False
  27. [INFO] ENABLE HAPLOID SENSITIVE MODE: False
  28. [INFO] ENABLE INCLUDE ALL CTGS CALLING: False
  29. [INFO] ENABLE NO PHASING FOR FULL ALIGNMENT: False
  30. + /home/OXFORDNANOLABS/cwright/miniconda3/envs/clair3/bin/scripts/clair3.sh --bam_fn /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6.bam --ref_fn /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/../../config/ref/GCA_000001405.15_GRCh38_no_alt_analysis_set.fasta --threads 48 --model_path /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/ont_r104_e81_sup_g5015 --platform ont --output /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6 --bed_fn=/media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6.bed --vcf_fn=EMPTY --ctg_name=EMPTY --sample_name=SAMPLE --chunk_num=0 --chunk_size=5000000 --samtools=samtools --python=python3 --pypy=pypy3 --parallel=parallel --whatshap=whatshap --qual=2 --var_pct_full=0.3 --ref_pct_full=0.1 --snp_min_af=0 --indel_min_af=0 --pileup_only=False --gvcf=False --fast_mode=False --call_snp_only=False --print_ref_calls=False --haploid_precise=False --haploid_sensitive=False --include_all_ctgs=False --no_phasing_for_fa=False --pileup_model_prefix=pileup --fa_model_prefix=full_alignment
  31. [INFO] Check environment variables
  32. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/log
  33. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp
  34. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/split_beds
  35. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/pileup_output
  36. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/merge_output
  37. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/phase_output
  38. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/gvcf_tmp_output
  39. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/full_alignment_output
  40. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/phase_output/phase_vcf
  41. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/phase_output/phase_bam
  42. [INFO] Create folder /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/tmp/full_alignment_output/candidate_bed
  43. [INFO] --include_all_ctgs not enabled, use chr{1..22,X,Y} and {1..22,X,Y} by default
  44. [INFO] Call variant in contigs: chr6
  45. [INFO] Chunk number for each contig: 35
  46. [INFO] 1/7 Call variants using pileup model
  47. [INFO] Delay 1 seconds before starting variant calling ...
  48. [mpileup] 1 samples in 1 input files
  49. Calling variants ...
  50. Total processed positions in chr6 (chunk 19/35) : 13591
  51. Total time elapsed: 194.83 s
  52. [INFO] Delay 2 seconds before starting variant calling ...
  53. [mpileup] 1 samples in 1 input files
  54. Calling variants ...
  55. Total processed positions in chr6 (chunk 10/35) : 14808
  56. Total time elapsed: 197.73 s
  57. [INFO] Delay 3 seconds before starting variant calling ...
  58. [mpileup] 1 samples in 1 input files
  59. Calling variants ...
  60. Total processed positions in chr6 (chunk 29/35) : 12779
  61. Total time elapsed: 197.40 s
  62. [INFO] Delay 1 seconds before starting variant calling ...
  63. [mpileup] 1 samples in 1 input files
  64. Calling variants ...
  65. Total processed positions in chr6 (chunk 34/35) : 15600
  66. Total time elapsed: 205.93 s
  67. [INFO] Delay 0 seconds before starting variant calling ...
  68. [mpileup] 1 samples in 1 input files
  69. Calling variants ...
  70. Total processed positions in chr6 (chunk 20/35) : 12458
  71. Total time elapsed: 206.47 s
  72. [INFO] Delay 0 seconds before starting variant calling ...
  73. [mpileup] 1 samples in 1 input files
  74. Calling variants ...
  75. Total processed positions in chr6 (chunk 28/35) : 12097
  76. Total time elapsed: 206.55 s
  77. [INFO] Delay 2 seconds before starting variant calling ...
  78. [mpileup] 1 samples in 1 input files
  79. Calling variants ...
  80. Total processed positions in chr6 (chunk 9/35) : 15132
  81. Total time elapsed: 206.86 s
  82. [INFO] Delay 1 seconds before starting variant calling ...
  83. [mpileup] 1 samples in 1 input files
  84. Calling variants ...
  85. Total processed positions in chr6 (chunk 27/35) : 12934
  86. Total time elapsed: 206.97 s
  87. [INFO] Delay 0 seconds before starting variant calling ...
  88. [mpileup] 1 samples in 1 input files
  89. Calling variants ...
  90. Total processed positions in chr6 (chunk 18/35) : 13629
  91. Total time elapsed: 208.22 s
  92. [INFO] Delay 0 seconds before starting variant calling ...
  93. [mpileup] 1 samples in 1 input files
  94. Calling variants ...
  95. Total processed positions in chr6 (chunk 12/35) : 12612
  96. Total time elapsed: 208.10 s
  97. [INFO] Delay 4 seconds before starting variant calling ...
  98. [mpileup] 1 samples in 1 input files
  99. Calling variants ...
  100. Total processed positions in chr6 (chunk 26/35) : 12071
  101. Total time elapsed: 207.66 s
  102. [INFO] Delay 4 seconds before starting variant calling ...
  103. [mpileup] 1 samples in 1 input files
  104. Calling variants ...
  105. Total processed positions in chr6 (chunk 16/35) : 14568
  106. Total time elapsed: 208.77 s
  107. [INFO] Delay 4 seconds before starting variant calling ...
  108. [mpileup] 1 samples in 1 input files
  109. Calling variants ...
  110. Total processed positions in chr6 (chunk 4/35) : 15995
  111. Total time elapsed: 207.96 s
  112. [INFO] Delay 4 seconds before starting variant calling ...
  113. [mpileup] 1 samples in 1 input files
  114. Calling variants ...
  115. Total processed positions in chr6 (chunk 30/35) : 10807
  116. Total time elapsed: 208.14 s
  117. [INFO] Delay 4 seconds before starting variant calling ...
  118. [mpileup] 1 samples in 1 input files
  119. Calling variants ...
  120. Total processed positions in chr6 (chunk 25/35) : 13144
  121. Total time elapsed: 210.15 s
  122. [INFO] Delay 3 seconds before starting variant calling ...
  123. [mpileup] 1 samples in 1 input files
  124. Calling variants ...
  125. Total processed positions in chr6 (chunk 11/35) : 14349
  126. Total time elapsed: 211.05 s
  127. [INFO] Delay 1 seconds before starting variant calling ...
  128. [mpileup] 1 samples in 1 input files
  129. Calling variants ...
  130. Total processed positions in chr6 (chunk 15/35) : 13182
  131. Total time elapsed: 212.17 s
  132. [INFO] Delay 0 seconds before starting variant calling ...
  133. [mpileup] 1 samples in 1 input files
  134. Calling variants ...
  135. Total processed positions in chr6 (chunk 17/35) : 13371
  136. Total time elapsed: 212.99 s
  137. [INFO] Delay 2 seconds before starting variant calling ...
  138. [mpileup] 1 samples in 1 input files
  139. Calling variants ...
  140. Total processed positions in chr6 (chunk 5/35) : 15571
  141. Total time elapsed: 213.47 s
  142. [INFO] Delay 0 seconds before starting variant calling ...
  143. [mpileup] 1 samples in 1 input files
  144. Calling variants ...
  145. Total processed positions in chr6 (chunk 23/35) : 13575
  146. Total time elapsed: 213.50 s
  147. [INFO] Delay 4 seconds before starting variant calling ...
  148. [mpileup] 1 samples in 1 input files
  149. Calling variants ...
  150. Total processed positions in chr6 (chunk 24/35) : 13010
  151. Total time elapsed: 212.23 s
  152. [INFO] Delay 0 seconds before starting variant calling ...
  153. [mpileup] 1 samples in 1 input files
  154. Calling variants ...
  155. Total processed positions in chr6 (chunk 22/35) : 14340
  156. Total time elapsed: 214.06 s
  157. [INFO] Delay 3 seconds before starting variant calling ...
  158. [mpileup] 1 samples in 1 input files
  159. Calling variants ...
  160. Total processed positions in chr6 (chunk 21/35) : 12413
  161. Total time elapsed: 213.71 s
  162. [INFO] Delay 2 seconds before starting variant calling ...
  163. [mpileup] 1 samples in 1 input files
  164. Calling variants ...
  165. Total processed positions in chr6 (chunk 31/35) : 14610
  166. Total time elapsed: 214.29 s
  167. [INFO] Delay 3 seconds before starting variant calling ...
  168. [mpileup] 1 samples in 1 input files
  169. Calling variants ...
  170. Total processed positions in chr6 (chunk 6/35) : 18237
  171. Total time elapsed: 216.71 s
  172. [INFO] Delay 3 seconds before starting variant calling ...
  173. [mpileup] 1 samples in 1 input files
  174. Calling variants ...
  175. Total processed positions in chr6 (chunk 3/35) : 14603
  176. Total time elapsed: 218.69 s
  177. [INFO] Delay 2 seconds before starting variant calling ...
  178. [mpileup] 1 samples in 1 input files
  179. Calling variants ...
  180. Total processed positions in chr6 (chunk 33/35) : 17987
  181. Total time elapsed: 221.15 s
  182. [INFO] Delay 2 seconds before starting variant calling ...
  183. [mpileup] 1 samples in 1 input files
  184. Calling variants ...
  185. Total processed positions in chr6 (chunk 8/35) : 15418
  186. Total time elapsed: 223.15 s
  187. [INFO] Delay 3 seconds before starting variant calling ...
  188. [mpileup] 1 samples in 1 input files
  189. Calling variants ...
  190. Total processed positions in chr6 (chunk 1/35) : 17631
  191. Total time elapsed: 224.79 s
  192. [INFO] Delay 2 seconds before starting variant calling ...
  193. [mpileup] 1 samples in 1 input files
  194. Calling variants ...
  195. Total processed positions in chr6 (chunk 14/35) : 14615
  196. Total time elapsed: 225.05 s
  197. [INFO] Delay 3 seconds before starting variant calling ...
  198. [faidx] Truncated sequence: chr6:164925849-171806021
  199. [mpileup] 1 samples in 1 input files
  200. Calling variants ...
  201. Processed 20000 tensors
  202. Total processed positions in chr6 (chunk 35/35) : 24916
  203. Total time elapsed: 228.90 s
  204. [INFO] Delay 4 seconds before starting variant calling ...
  205. [mpileup] 1 samples in 1 input files
  206. Calling variants ...
  207. Total processed positions in chr6 (chunk 2/35) : 17101
  208. Total time elapsed: 229.85 s
  209. [INFO] Delay 2 seconds before starting variant calling ...
  210. [mpileup] 1 samples in 1 input files
  211. Calling variants ...
  212. Total processed positions in chr6 (chunk 32/35) : 14757
  213. Total time elapsed: 230.92 s
  214. [INFO] Delay 4 seconds before starting variant calling ...
  215. [mpileup] 1 samples in 1 input files
  216. Calling variants ...
  217. Processed 20000 tensors
  218. Processed 40000 tensors
  219. Total processed positions in chr6 (chunk 7/35) : 48252
  220. Total time elapsed: 253.98 s
  221. [INFO] Delay 1 seconds before starting variant calling ...
  222. [mpileup] 1 samples in 1 input files
  223. Calling variants ...
  224. Processed 20000 tensors
  225. Processed 40000 tensors
  226. Processed 60000 tensors
  227. Processed 80000 tensors
  228. Total processed positions in chr6 (chunk 13/35) : 80612
  229. Total time elapsed: 299.14 s
  230. real 6m13.093s
  231. user 118m44.594s
  232. sys 2m14.765s
  233. [INFO] 2/7 Select heterozygous SNP variants for Whatshap phasing and haplotagging
  234. [INFO] Select heterozygous pileup variants exceeding phasing quality cutoff 22
  235. [INFO] Total heterozygous SNP positions selected: chr6: 111456
  236. real 0m2.061s
  237. user 0m1.483s
  238. sys 0m0.184s
  239. [INFO] 3/7 Phase VCF file using Whatshap
  240. This is WhatsHap 1.0 running under Python 3.6.10
  241. Working on 1 samples from 1 family
  242. ======== Working on chromosome 'chr6'
  243. ---- Processing individual SAMPLE
  244. Using maximum coverage per sample of 15X
  245. Number of variants skipped due to missing genotypes: 0
  246. Number of remaining heterozygous variants: 111456
  247. Reading alignments and detecting alleles ...
  248. Found 698324 reads covering 111418 variants
  249. Kept 557487 reads that cover at least two variants each
  250. Reducing coverage to at most 15X by selecting most informative reads ...
  251. Selected 72202 reads covering 111352 variants
  252. Variants covered by at least one phase-informative read in at least one individual after read selection: 111352
  253. Phasing 1 sample by solving the MEC problem ...
  254. MEC cost: 269220
  255. No. of phased blocks: 352
  256. Largest component contains 2628 variants (2.4% of accessible variants) between position 375341 and 3504839
  257. ======== Writing VCF
  258. Done writing VCF
  259. Changed 16 genotypes while writing VCF
  260. == SUMMARY ==
  261. Maximum memory usage: 1.273 GB
  262. Time spent reading BAM/CRAM: 206.9 s
  263. Time spent parsing VCF: 2.3 s
  264. Time spent selecting reads: 93.8 s
  265. Time spent phasing: 811.9 s
  266. Time spent writing VCF: 2.7 s
  267. Time spent finding components: 84.7 s
  268. Time spent on rest: 4.3 s
  269. Total elapsed time: 1206.7 s
  270. real 20m11.270s
  271. user 19m59.584s
  272. sys 0m6.595s
  273. [INFO] 4/7 Haplotag input BAM file using Whatshap
  274. Found 1 samples in input VCF
  275. Keeping 1 samples for haplo-tagging
  276. Found 0 samples in BAM file
  277. Reading alignments and detecting alleles ...
  278. Found 693682 reads covering 111298 variants
  279. == SUMMARY ==
  280. Total alignments processed: 1543459
  281. Alignments that could be tagged: 692374
  282. Alignments spanning multiple phase sets: 0
  283. haplotag - total processing time: 859.563937664032
  284. real 14m23.638s
  285. user 14m1.966s
  286. sys 0m16.687s
  287. [INFO] 5/7 Select candidates for full-alignment calling
  288. [INFO] Set variants quality cutoff 21.0
  289. [INFO] Set reference calls quality cutoff 10.0
  290. [INFO] Low quality reference calls to be processed in chr6: 29344
  291. [INFO] Low quality variants to be processed in chr6: 94819
  292. real 0m1.676s
  293. user 0m1.285s
  294. sys 0m0.236s
  295. [INFO] 6/7 Call low-quality variants using full-alignment model
  296. [INFO] Delay 3 seconds before starting variant calling ...
  297. [faidx] Truncated sequence: chr6:166386783-171745418
  298. [mpileup] 1 samples in 1 input files
  299. Calling variants ...
  300. Total processed positions in chr6 (chunk 13/13) : 4163
  301. Total time elapsed: 107.95 s
  302. [INFO] Delay 2 seconds before starting variant calling ...
  303. [mpileup] 1 samples in 1 input files
  304. Calling variants ...
  305. Total processed positions in chr6 (chunk 6/13) : 10000
  306. Total time elapsed: 153.37 s
  307. [INFO] Delay 3 seconds before starting variant calling ...
  308. [mpileup] 1 samples in 1 input files
  309. Calling variants ...
  310. Total processed positions in chr6 (chunk 3/13) : 10000
  311. Total time elapsed: 171.86 s
  312. [INFO] Delay 1 seconds before starting variant calling ...
  313. [mpileup] 1 samples in 1 input files
  314. Calling variants ...
  315. Total processed positions in chr6 (chunk 5/13) : 10000
  316. Total time elapsed: 189.74 s
  317. [INFO] Delay 0 seconds before starting variant calling ...
  318. [mpileup] 1 samples in 1 input files
  319. Calling variants ...
  320. Total processed positions in chr6 (chunk 2/13) : 10000
  321. Total time elapsed: 208.55 s
  322. [INFO] Delay 4 seconds before starting variant calling ...
  323. [mpileup] 1 samples in 1 input files
  324. Calling variants ...
  325. Total processed positions in chr6 (chunk 1/13) : 10000
  326. Total time elapsed: 209.56 s
  327. [INFO] Delay 3 seconds before starting variant calling ...
  328. [mpileup] 1 samples in 1 input files
  329. Calling variants ...
  330. Total processed positions in chr6 (chunk 7/13) : 10000
  331. Total time elapsed: 210.31 s
  332. [INFO] Delay 1 seconds before starting variant calling ...
  333. [mpileup] 1 samples in 1 input files
  334. Calling variants ...
  335. Total processed positions in chr6 (chunk 8/13) : 10000
  336. Total time elapsed: 235.95 s
  337. [INFO] Delay 3 seconds before starting variant calling ...
  338. [mpileup] 1 samples in 1 input files
  339. Calling variants ...
  340. Total processed positions in chr6 (chunk 4/13) : 10000
  341. Total time elapsed: 250.71 s
  342. [INFO] Delay 2 seconds before starting variant calling ...
  343. [mpileup] 1 samples in 1 input files
  344. Calling variants ...
  345. Total processed positions in chr6 (chunk 12/13) : 10000
  346. Total time elapsed: 353.18 s
  347. [INFO] Delay 2 seconds before starting variant calling ...
  348. [mpileup] 1 samples in 1 input files
  349. Calling variants ...
  350. Total processed positions in chr6 (chunk 11/13) : 10000
  351. Total time elapsed: 397.82 s
  352. [INFO] Delay 3 seconds before starting variant calling ...
  353. [mpileup] 1 samples in 1 input files
  354. Calling variants ...
  355. Total processed positions in chr6 (chunk 9/13) : 10000
  356. Total time elapsed: 400.47 s
  357. [INFO] Delay 2 seconds before starting variant calling ...
  358. [mpileup] 1 samples in 1 input files
  359. Calling variants ...
  360. Total processed positions in chr6 (chunk 10/13) : 10000
  361. Total time elapsed: 419.11 s
  362. real 7m36.616s
  363. user 72m48.421s
  364. sys 1m44.224s
  365. [INFO] 7/7 Merge pileup VCF and full-alignment VCF
  366. [INFO] Pileup variants processed in chr6: 221616
  367. [INFO] Full-alignment variants processed in chr6: 99333
  368. real 0m3.533s
  369. user 0m3.102s
  370. sys 0m0.266s
  371. [INFO] Finish calling, output file: /media/groups/custflow/active/datasets/gm24385_q20_2021.10/extra_analysis/small_variants/chr6/merge_output.vcf.gz
  372. real 48m58.583s
  373. user 227m39.466s
  374. sys 4m30.038s
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...