Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

README.txt 1.4 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
  1. 16S rRNA Trimmed Data Set
  2. This file contains ~72 million reads corresponding to deconvoluted, trimmed
  3. 16S sequences from SRA Study id SRP002395 (Human Microbiome Project 16S rRNA
  4. 454 Clinical Production Phase I). This represents 7518 preparations from 5034
  5. samples. 16S variable region V3-5 was sequenced for all 5034 samples, with
  6. variable regions V1-3 and V6-9 also sequenced for subsets of the samples. 18
  7. body sub-sites are represented in this dataset.
  8. This is a gzipped multi-FASTA file of reverse-complemented 454 clear ranges,
  9. with the following subsequences removed:
  10. 1. initial "TCAG" (must have been present in the original read)
  11. 2. reverse barcode sequence (must have been present in the original read)
  12. 3. reverse primer sequence (must have been present in the original read)
  13. 4. forward primer sequence (if present within the clear range)
  14. SRA runs containing a total of approximately 10,000 reads could not be
  15. successfully converted to SFF by the sffdump utility in the NCBI SRA SDK and
  16. have been excluded from this initial release.
  17. Ongoing HMP 16S analyses are being performed on a dataset containing reads
  18. from both SRA Study ids SRP002395 (Human Microbiome Project 16S rRNA 454
  19. Clinical Production Phase I) and SRP002012 (Human Microbiome Project 454
  20. Clinical Production Pilot, PPS). This dataset currently represents only the
  21. former project. We are in the process of readying SRP002012 reads for release
  22. on this site.
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...