Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

General

open-data-registry aws-pds sustainability agriculture earth observation geospatial life sciences + 726

Task

disaster response classification image classification object detection autonomous vehicles machine translation vision + 490

 Open Source Data Science Datasets

Path: .

Parallel English speech samples from 177 countries

dataset audio

Path: .

26 text passage read by 10 speakers; 4 main emotions: joy, sadness, anger and neutral.

dataset audio

Path: .

Speech Emotion Recognition (SER) is the process of extracting emotional paralinguistic information from speech.

dataset audio

Path: .

Dataset from Open SLR http://www.openslr.org/99/

dataset audio

Path: .

Emotional speech corpus with primary and secondary emotions.

dataset audio

Path: .

Att-hack: an expressive speech database with social attitudes

dataset audio

Path: .

The dataset of the Zero Resource Speech Challenge 2021, http://www.zerospeech.com/ .

dataset audio

Path: .

CMU Multimodal Opinion Sentiment Intensity (CMU-MOSI) is a dataset of opinion level sentiment intensity in online videos. It contains 2199 opinion utterances with sentiment annotated between very negative to very positive in seven Likert steps.

dataset audio

Path: .

Microsoft Scalable Noisy Speech Dataset (MS-SNSD)

dataset audio

DagsHub-Datasets / RSC

Updated 1 year ago

Path: .

Runescape classic sounds

dataset audio

Path: .

Russian ASR dataset, see https://github.com/sberdevices/golos

dataset audio

Path: .

The Flickr 8k Audio Caption Corpus contains 40,000 spoken captions of 8,000 natural images. It was collected in 2015 to investigate multimodal learning schemes for unsupervised speech pattern discovery.

dataset audio

Path: .

Detecting bird sounds in audio is an important task for automatic wildlife monitoring, as well as in citizen science and audio library management.

dataset audio

Path: .

Crowd-sourced Emotional Multimodal Actors Dataset

dataset audio dvc

Path: .

EmoSynth is a dataset of 144 audio files which have been labelled by 40 listeners for the perceived emotion, in regard to the dimensions of Valence and Arousal.

dataset audio

Path: .

The CHiME-Home dataset is a collection of annotated domestic environment audio recordings.

dataset audio dvc git

Path: .

The LEGOv2 database is a parameterized and annotated version of the CMU Let’s Go database from 2006 and 2007. This spoken dialogue corpus contains interactions captured from the CMU Let’s Go (LG) System by Carnegie Mellon University in 2006 and 2007. It is based on raw log-files from the LG system. The corpus has been parameterized and annotated by the Dialogue Systems Group at Ulm University, Germany.

dataset audio