Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
General:  hacktoberfest Type:  dataset Data Domain:  audio
2ebabf1d74
added data files
2 years ago
067172af89
initial commit
2 years ago
2ebabf1d74
added data files
2 years ago
059148fc36
added original dataset description
2 years ago
aae27c66c9
added meta info
2 years ago
3aedea779a
updated README.md
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
2ebabf1d74
added data files
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Deeply Vocal Characterizer Dataset

Sample Dataset


General Information

This dataset is a sample of the bigger corpus The Vocal Characterizer Dataset. The latter is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers, crowdsourced by the general public in South Korea. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain ‘teeth-chattering’, ‘teeth-grinding’, ‘tongue-clicking’, ‘nose-blowing’, ‘coughing’, ‘yawning’, ‘throat clearing’, ‘sighing’, ‘lip-popping’, ‘lip-smacking’, ‘panting’, ’crying’, ‘laughing’, ‘sneezing’, ‘moaning’, and ‘screaming’.

Device Android phones
Volume ~ 57 hours, ~ 70,000 utterances,
~ 18 GB, ~ 1500 speakers
Format wav/h5(16/44.1kHz, 16-bit, mono)

The sample dataset here consists of 723 utterances (ca. 1% of the whole corpus) and is free to use under CC BY-NC-ND 4.0. For accessing the complete dataset under a more restrictive license please contact http://deeplyinc.com/us. See the dataset description and the original Github repository for a more detailed account.

Structure


├── Nonverbal_Vocalization.json
├── coughing
├────── 0C1S_4_8_0_27_0_1_1.wav
│   ├── ...
├── crying
│   ├── 1TCO_11_10_0_20_0_0_0.wav
│   ├── ...
├── ...
├── ...
├── tongue-clicking
│   ├── 06RU_2_7_1_38_0_0_0.wav
│   ├── ...
└── yawning
    ├── 0DYI_5_10_1_12_0_1_0.wav
    ├── ...
└── Deeply\ Nonverbal\ Vocalization\ Dataset\ description_Eng.pdf
    

Filename convention

{speaker_ID}_{class}_{trial}_{sex}_{age}_{location}_{quality}_{noise}.wav

Class: {0: ‘teeth-chattering’, 1: ‘teeth-grinding’, 2: ‘tongue-clicking’, 3: ‘nose-blowing’, 
    4: ‘coughing’, 5: ‘yawning’, 6: ‘throat-clearing’, 7: ‘sighing’, 8: ‘lip-popping’, 
    9: ‘lip-smacking’, 10: ‘panting’, 11: ‘crying’, 12: ‘laughing’, 13: ‘sneezing’, 14: ‘moaning’, 15: screaming’}
Sex: {0: ‘Female’, 1: ‘Male’}
Location: {0: ‘indoor’, 1: ‘outdoor’}
Quality: {0: ‘High’, 1: ‘Low’}
Noise: {0: ‘Noiseless’, 1: ‘Noisy’}

License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Contact

Tel: (+82) 70-7459-0704Web: http://deeplyinc.com/Email: contact@deeplyinc.com

Tip!

Press p or to see the previous file or, n or to see the next file

About

Dataset from Open SLR http://www.openslr.org/99/

Collaborators 1

Comments

Loading...