Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

General

open-data-registry aws-pds sustainability agriculture earth observation geospatial life sciences + 724

Task

disaster response classification image classification object detection autonomous vehicles machine translation vision + 490

 Open Source Data Science Datasets

Path: .

In the realm of predictive analysis for Hill Valley, logistic regression is a statistical technique employed to forecast binary outcomes related to various factors in the region

dataset model classification tabular scikit-learn object detection image classification information retrieval anomaly detection git mlflow github

Path: .

World Mortality Dataset: international data on all-cause mortality.

dataset tabular git github

DagsHub / IMDb

Updated 1 year ago

Path: .

Subsets of IMDb data are available for access to customers for personal and non-commercial use

dataset nlp tabular dvc git

Dean / RPPP

Updated 2 years ago

Path: raw

RPPP – Reddit Post Popularity Predictor A project with two goals: 1. Given a Reddit post, predict how popular it's going to be (what it's score will be) 2. Showcasing a remote working file system with DVC

dataset model nlp tabular dvc git

Path: .

Designing your first machine learning pipeline with few lines of codes and simple drag and drop using Orchest. In this project we will train binary classification model to predict epitope which is used for vaccine development.

dataset classification tabular scikit-learn git github