Audio Datasets

Audio data is a rich source of information that can be leveraged for advanced machine learning applications. By analyzing audio signals, models can learn to identify patterns and make predictions related to speech recognition, music classification, and sound event detection. This opens up new opportunities in fields such as healthcare, entertainment, and security, providing innovative solutions for some of the biggest challenges faced today. With the right tools and algorithms, audio data has the potential to revolutionize the way we interact with technology and the world around us.

Search datasets:

Filter results:

voice gender detection

lego-spoken-dialogue-corpus

free-spoken-digit-dataset

daps-dataset

children-song-dataset

emo-db

basic-arabic-vocal-emotions-dataset

esc50-dataset

musdb18-dataset

VIVAE

Speech Commands Dataset

UrbanSounds

EMOVO

Public Domain Sounds

URDU-Dataset

JL-Corpus

lj-speech-dataset

speech-accent-archive

Estonian-Emotional-Speech-Corpus

Acted-Emotional-Speech-Dynamic-Database

Toronto-emotional-speech-set-TESS

Deeply Nonverbal Vocalization Dataset

Att-HACK

CHiME-Home

EmoSynth

CREMA-D

Bird-Audio-Detection-challenge

CommonVoice

Flickr-Audio-Caption-Corpus

Golos

RSC

MS-SNSD

AudioMNIST

CMU-MOSI

zerospeech2021 dataset

LEGO-Spoken-Dialogue-Corpus

WARBLRB10k-10K Smartphone Recording Dataset

FSL4-4K User Contributed Loops

FSDnoisy18k-Open Audio Dataset

UrbanSound8K-Labeled Urban Sound Excerpts Dataset

Automatic Speech Recognition (ASR) Error Robustness

Voices Obscured in Complex Environmental Settings (VOiCES)

CoversBR

Improve your data quality for better AI

Easily curate and annotate your vision, audio, and document data with a single platform

Book A Demo

More categories

Biology

Computer Vision

Geology

NLP

Tabular

Urban