
Human Cancer Models Initiative (HCMI) Cancer Model Development Center Dataset for Machine Learning
Install DagsHub:
pip install dagshub
To stream this data directly on DagsHub
from dagshub.streaming import DagsHubFilesystem
fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/hcmi-cmdc-dataset")
fs.listdir("s3://gdc-hcmi-cmdc-phs001486-2-open")
Description
The Human Cancer Models Initiative (HCMI) is an international consortium that is generating novel, next-generation, tumor-derived culture models annotated with genomic and clinical data. HCMI-developed models and related data are available as a community resource. The NCI is contributing to the initiative by supporting four Cancer Model Development Centers (CMDCs). CMDCs are tasked with producing next-generation cancer models from clinical samples. The cancer models include tumor types that are rare, originate from patients from underrepresented populations, lack precision therapy, or lack cancer model tools. Throughout the development process, the CMDCs utilize stringent internal QC measures to ensure both clinical and molecular integrity. These models are then annotated with clinical and genomic data and are available as a community resource.
Additional information
Update frequency
Genomic Data Commons (GDC) is source of truth for this dataset; GDC offers monthly data releases,
although this dataset may not be updated at every release.
Managed by
License
NIH Genomic Data Sharing Policy https://gdc.cancer.gov/access-data/data-access-policies