Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
72dd099df6
Initial commit
1 year ago
d0b8891670
update readme automation
1 year ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

GATK Test Data

Stream data with DDA:

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/gatk-test-data-dataset")

fs.listdir("s3://gatk-test-data")

Description:

The GATK test data resource bundle is a collection of files for resequencing human genomic data with the Broad Institute's Genome Analysis Toolkit (GATK).

Contact:

The GATK test data resource bundle is a collection of files for resequencing human genomic data with the Broad Institute's Genome Analysis Toolkit (GATK).

Update Frequency:

Every 3 months

Managed By:

Broad Institute

Resources:

  1. resource:
    • Description: The contents of this dataset is multi-modal and includes various types of genomic data, such as CRAMs/BAMs, whole-genome sequencing (WGS) data, exome data, RNA data, etc.
    • ARN: arn:aws:s3:::gatk-test-data
    • Region: us-east-1
    • Type: S3 Bucket

Tags:

aws-pds, biology, bioinformatics, cancer, genetic, genomic, life sciences

Tools & Applications:

  1. tools & applications:
Tip!

Press p or to see the previous file or, n or to see the next file

About

gatk-test-data-dataset is originate from the Registry of Open Data on AWS

Collaborators 5

Comments

Loading...