Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

README.md 3.2 KB

You have to be logged in to leave a comment. Sign In

Deeply Vocal Characterizer Dataset

Sample Dataset


General Information

This dataset is a sample of the bigger corpus The Vocal Characterizer Dataset. The latter is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers, crowdsourced by the general public in South Korea. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain ‘teeth-chattering’, ‘teeth-grinding’, ‘tongue-clicking’, ‘nose-blowing’, ‘coughing’, ‘yawning’, ‘throat clearing’, ‘sighing’, ‘lip-popping’, ‘lip-smacking’, ‘panting’, ’crying’, ‘laughing’, ‘sneezing’, ‘moaning’, and ‘screaming’.

Device Android phones
Volume ~ 57 hours, ~ 70,000 utterances,
~ 18 GB, ~ 1500 speakers
Format wav/h5(16/44.1kHz, 16-bit, mono)

The sample dataset here consists of 723 utterances (ca. 1% of the whole corpus) and is free to use under CC BY-NC-ND 4.0. For accessing the complete dataset under a more restrictive license please contact http://deeplyinc.com/us. See the dataset description and the original Github repository for a more detailed account.

Structure


├── Nonverbal_Vocalization.json
├── coughing
├────── 0C1S_4_8_0_27_0_1_1.wav
│   ├── ...
├── crying
│   ├── 1TCO_11_10_0_20_0_0_0.wav
│   ├── ...
├── ...
├── ...
├── tongue-clicking
│   ├── 06RU_2_7_1_38_0_0_0.wav
│   ├── ...
└── yawning
    ├── 0DYI_5_10_1_12_0_1_0.wav
    ├── ...
└── Deeply\ Nonverbal\ Vocalization\ Dataset\ description_Eng.pdf
    

Filename convention

{speaker_ID}_{class}_{trial}_{sex}_{age}_{location}_{quality}_{noise}.wav

Class: {0: ‘teeth-chattering’, 1: ‘teeth-grinding’, 2: ‘tongue-clicking’, 3: ‘nose-blowing’, 
    4: ‘coughing’, 5: ‘yawning’, 6: ‘throat-clearing’, 7: ‘sighing’, 8: ‘lip-popping’, 
    9: ‘lip-smacking’, 10: ‘panting’, 11: ‘crying’, 12: ‘laughing’, 13: ‘sneezing’, 14: ‘moaning’, 15: screaming’}
Sex: {0: ‘Female’, 1: ‘Male’}
Location: {0: ‘indoor’, 1: ‘outdoor’}
Quality: {0: ‘High’, 1: ‘Low’}
Noise: {0: ‘Noiseless’, 1: ‘Noisy’}

License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Contact

Tel: (+82) 70-7459-0704Web: http://deeplyinc.com/Email: contact@deeplyinc.com

Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...