Photo by DeepMind on Unsplash

stdpopsim species resources Dataset for Machine Learning

Install DagsHub:

pip install dagshub
Click on copy button to copy content

To stream this data directly on DagsHub

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/stdpopsim_kern-dataset")

fs.listdir("s3://stdpopsim")
Click on copy button to copy content

Description

Contains all resources (genome specifications, recombination maps, etc.) required for species specific simulation with the stdpopsim package. These resources are originally from a variety of other consortium and published work but are consolidated here for ease of access and use. If you are interested in adding a new species to the stdpopsim resource please raise an issue on the stdpopsim GitHub page to have the necessary files added here.

Additional information

Update frequency

Data will be added as new species, genome assemblies, and genetic map data for already included species become available.

Managed by

Andrew Kern & Jerome Kelleher

License

Please see the individual datasets compiled here for licensing details and make sure to cite the original sources of any elements of this data that you use.

Related datasets

Allen Brain Observatory – Visual Coding AWS Public Data Set

Allen Cell Imaging Collections

Biological and Physical Sciences (BPS) Microscopy Benchmark Training Dataset

Cancer Cell Line Encyclopedia (CCLE)

Launch your ML development to new heights with DagsHub

Back to top