Open Source Data Science Datasets

Path: .

WARBLRB10k is a collection of 10,000 smartphone audio recordings from around the UK, crowdsourced by users of Warblr the bird recognition app

dataset audio dvc git

0 0 0

Path: .

The FSL4 dataset contains ~4000 user-contributed loops uploaded to Freesound.

dataset audio dvc git

0 0 0

Path: .

The FSDnoisy18k dataset is an open dataset containing 42.5 hours of audio across 20 sound event classes, including a small amount of manually-labeled data and a larger quantity of real-world noisy data.

dataset audio dvc git

0 0 0

Path: .

Urban Sound 8K is an audio dataset that contains 8732 labeled sound excerpts (<=4s) of urban sounds from 10 classes.

dataset audio dvc git

0 0 0

Path: .

Dataset from Open SLR. LibriSpeech is a corpus of read speech, based on LibriVox's public domain audio books. Its purpose is to enable the training and testing of automatic speech recognition(ASR) systems.

dataset audio speech recognition

0 0 0

Path: .

Urban Sound 8K is an audio dataset that contains 8732 labeled sound excerpts (<=4s) of urban sounds from 10 classes.

dataset audio dvc git

1 0 0

Path: .

dataset audio dvc git

0 0 0

Path: .

The FSL4 dataset contains ~4000 user-contributed loops uploaded to Freesound.

dataset audio dvc git

0 0 0

Path: .

WARBLRB10k is a collection of 10,000 smartphone audio recordings from around the UK, crowdsourced by users of Warblr the bird recognition app

dataset audio dvc git

0 0 0

Path: .

The LEGOv2 database is a parameterized and annotated version of the CMU Let’s Go database from 2006 and 2007. This spoken dialogue corpus contains interactions captured from the CMU Let’s Go (LG) System by Carnegie Mellon University in 2006 and 2007. It is based on raw log-files from the LG system. The corpus has been parameterized and annotated by the Dialogue Systems Group at Ulm University, Germany.

dataset audio dvc git

0 0 0

Path: .

Automatic Speech Recognition using Facebook wav2vec2-xls-r-300m model and mozilla-foundation common_voice_8_0 Urdu Dataset

dataset model audio pytorch transfer learning

2 9 1

Path: .

The CHiME-Home dataset is a collection of annotated domestic environment audio recordings.

dataset audio dvc git

1 0 0

Path: .

Emotion expression is an essential part of human interaction. The same text can hold different meanings when expressed with different emotions. Thus understanding the text alone is not enough for getting the meaning of an utterance. Acted and natural corpora have been used to detect emotions from speech.

dataset audio

0 0 0

Path: .

EmoSynth is a dataset of 144 audio files which have been labelled by 40 listeners for the perceived emotion, in regard to the dimensions of Valence and Arousal.

dataset audio

0 0 0

Path: .

Ukrainian Voice Dataset consisting of 6843 short audio clips

dataset audio speech recognition

0 0 0

Path: .

Crowd-sourced Emotional Multimodal Actors Dataset

dataset audio dvc

0 1 0

Path: .

Detecting bird sounds in audio is an important task for automatic wildlife monitoring, as well as in citizen science and audio library management.

dataset audio

1 0 0

Path: .

No description

dataset audio

0 0 0

Path: .

The Flickr 8k Audio Caption Corpus contains 40,000 spoken captions of 8,000 natural images. It was collected in 2015 to investigate multimodal learning schemes for unsupervised speech pattern discovery.

dataset audio

0 0 0

Path: .

Russian ASR dataset, see https://github.com/sberdevices/golos

dataset audio

0 0 0

Previous 1 2 3 4 5 Next

General

Task

Data Domain

Framework

Integration

Open Source Data Science Datasets