
University of British Columbia Sunflower Genome Dataset Dataset for Machine Learning
Install DagsHub:
pip install dagshub
To stream this data directly on DagsHub
from dagshub.streaming import DagsHubFilesystem
fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/ubc-sunflower-genome-dataset")
fs.listdir("s3://ubc-sunflower-genome")
Description
This dataset captures Sunflower’s genetic diversity originating from thousands of wild, cultivated, and landrace sunflower individuals distributed across North America. The data consists of raw sequences and associated botanical metadata, aligned sequences (to three different reference genomes), and sets of SNPs computed across several cohorts.
Additional information
Documentation
Update frequency
Twice per year.
Managed by
The Rieseberg Lab at the University of British Columbia
License
Public Domain