Photo by DeepMind on Unsplash

University of British Columbia Sunflower Genome Dataset Dataset for Machine Learning

Install DagsHub:

pip install dagshub
Click on copy button to copy content

To stream this data directly on DagsHub

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/ubc-sunflower-genome-dataset")

fs.listdir("s3://ubc-sunflower-genome")
Click on copy button to copy content

Description

This dataset captures Sunflower’s genetic diversity originating from thousands of wild, cultivated, and landrace sunflower individuals distributed across North America. The data consists of raw sequences and associated botanical metadata, aligned sequences (to three different reference genomes), and sets of SNPs computed across several cohorts.

Additional information

Update frequency

Twice per year.

Managed by

The Rieseberg Lab at the University of British Columbia

License

Public Domain

Related datasets

Allen Brain Observatory – Visual Coding AWS Public Data Set

Allen Cell Imaging Collections

Biological and Physical Sciences (BPS) Microscopy Benchmark Training Dataset

Cancer Cell Line Encyclopedia (CCLE)

Launch your ML development to new heights with DagsHub

Back to top