kinkusuma
Interested in machine learning development
kinkusuma
Interested in machine learning development
kinkusuma
Interested in machine learning development
Updated 3 years ago
Parallel English speech samples from 177 countries
dataset audio
Updated 3 years ago
This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours.
dataset audio
Updated 3 years ago
347 dialogs with 9,083 system-user exchanges; emotions classified as garbage, non-angry, slightly angry and very angry.
dataset audio
Updated 3 years ago
4 speakers, 2,000 recordings (50 of each digit per speaker), English pronunciations.
dataset audio
Updated 3 years ago
The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally produced studio speech recordings and recordings of the same speech on common consumer devices (tablet and smartphone) in real-world environments.
dataset audio
Updated 3 years ago
Children's Song Dataset is open source dataset for singing voice research. This dataset contains 50 Korean and 50 English songs sung by one Korean female professional pop singer.
dataset audio
Updated 3 years ago
Basic Arabic Vocal Emotions Dataset (BAVED) is a dataset that contains Arabic words spelled in different levels of emotions recorded in an audio/wav format.
dataset audio
Updated 3 years ago
A labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.
dataset audio
Updated 3 years ago
Multi-track music dataset for music source separation.
dataset audio