Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

General

open-data-registry aws-pds sustainability agriculture earth observation geospatial life sciences + 712

Task

disaster response classification image classification object detection autonomous vehicles machine translation vision + 490

 Open Source Data Science Datasets

Path: .

Showcasing DagsHub Annotations, Label Studio integration, Discussions, and other related features

dataset nlp audio computer vision tabular label studio

Path: .

Open-source audio datasets hosted on DagsHub

dataset audio git github

Path: .

Detect a person's gender from a voice file

dataset audio

Path: .

URDU emotions dataset by Siddique Latif :

dataset audio

Path: .

Good for wake word detection; a wide array of sounds that can be used for object detection research.

dataset audio

Path: .

Italian: 6 actors who played 14 sentences; 6 emotions: disgust, fear, anger, joy, surprise, sadness.

dataset audio

Path: .

DVC project for inputs from Urban Sounds source: https://urbansounddataset.weebly.com/index.html#welcome

dataset audio

Path: .

Speech Commands Dataset by TensorFlow and AIY teams:

dataset audio

Path: .

The Variably Intense Vocalizations of Affect and Emotion Corpus (VIVAE)

dataset audio

Path: .

Multi-track music dataset for music source separation.

dataset audio

Path: .

A labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.

dataset audio

Path: .

Basic Arabic Vocal Emotions Dataset (BAVED) is a dataset that contains Arabic words spelled in different levels of emotions recorded in an audio/wav format.

dataset audio

Path: .

800 recording spoken by 10 actors (5 males and 5 females); 7 emotions: anger, neutral, fear, boredom, happiness, sadness, disgust.

dataset audio

Path: .

Children's Song Dataset is open source dataset for singing voice research. This dataset contains 50 Korean and 50 English songs sung by one Korean female professional pop singer.

dataset audio

Path: .

The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally produced studio speech recordings and recordings of the same speech on common consumer devices (tablet and smartphone) in real-world environments.

dataset audio

Path: .

4 speakers, 2,000 recordings (50 of each digit per speaker), English pronunciations.

dataset audio

Path: .

This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours.

dataset audio