Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
d894b6a1fc
Initial commit
1 year ago
07783069a0
update readme automation
1 year ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

The Singapore Nanopore Expression Data Set

Stream data with DDA:

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/sgnex-dataset")

fs.listdir("s3://sg-nex-data")

Description:

The Singapore Nanopore Expression (SG-NEx) project is an international collaboration to generate reference transcriptomes and a comprehensive benchmark data set for long read Nanopore RNA-Seq. Transcriptome profiling is done using PCR-cDNA sequencing (PCR-cDNA), amplification-free cDNA sequencing (direct cDNA), direct sequencing of native RNA (direct RNA), and short read RNA-Seq. The SG-NEx core data includes 5 of the most commonly used cell lines and it is extended with additional cell lines and samples that cover a broad range of human tissues. All core samples are sequenced with at least 3 high quality replicates. For a subset of samples spike-in RNAs are used and matched m6A profiling data is available.

Contact:

The Singapore Nanopore Expression (SG-NEx) project is an international collaboration to generate reference transcriptomes and a comprehensive benchmark data set for long read Nanopore RNA-Seq. Transcriptome profiling is done using PCR-cDNA sequencing (PCR-cDNA), amplification-free cDNA sequencing (direct cDNA), direct sequencing of native RNA (direct RNA), and short read RNA-Seq. The SG-NEx core data includes 5 of the most commonly used cell lines and it is extended with additional cell lines and samples that cover a broad range of human tissues. All core samples are sequenced with at least 3 high quality replicates. For a subset of samples spike-in RNAs are used and matched m6A profiling data is available.

Update Frequency:

Datasets will be updated periodically as additional data are generated.

Managed By:

https://www.a-star.edu.sg/gis

Resources:

  1. resource:
    • Description: Nanopore long read RNA Seq data and matched short read RNA-Seq from the Singapore Nanopore Expression Project (SG-NEx). The data includes raw signal data (fast5), basecalled reads (fastq), aligned reads (bam), processed data for RNA modification detection (json), reference genome annotation files (gtf and fa) and sample metadata (txt).
    • ARN: arn:aws:s3:::sg-nex-data
    • Region: ap-southeast-1
    • Type: S3 Bucket
    • Explore: Browse Bucket

Tags:

aws-pds, genomic, transcriptomics, life sciences, long read sequencing, short read sequencing, bioinformatics, fast5, fasta, fastq, bam

Tutorials:

  1. tutorial:

  2. tutorial:

  3. tutorial:

  4. tutorial:

Tools & Applications:

  1. tools & applications:

  2. tools & applications:

  3. tools & applications:

  4. tools & applications:

  5. tools & applications:

    • Title: nf-core/nanoseq: A nanopore DNA and RNA-Seq demultiplexing, QC, alignment and analysis pipeline
    • URL: https://nf-co.re/nanoseq
    • AuthorName: Chelsea Sawyer et al.

Publication:

  1. publication:

  2. publication:

  3. publication:

  4. publication:

  5. publication:

Tip!

Press p or to see the previous file or, n or to see the next file

About

sgnex-dataset is originate from the Registry of Open Data on AWS

Collaborators 5

Comments

Loading...