README.md

Added attribution for cover photo

3 years ago

You have to be logged in to leave a comment.

EmoSynth: The Emotional Synthetic Audio Dataset

1. General information

The ability of sound to enhance human wellbeing has been known since ancient civilizations, and methods can be found today across domains of health and within a variety of cultures. EmoSynth is a dataset of 144 audio files which have been labelled by 40 listeners for their the perceived emotion, in regards to the dimensions of Valence and Arousal.

The similar version of dataset is uploaded to DagsHub: EmoSynth , enabling you to preview the dataset before downloading it.

2. Organization of the dataset

The dataset is small (106MB) and simple to navigate as it has only one folder based containing synthetic audio files. We also have an audio_labels.csv file, which contains details about the classification of audio based on the dimensions of Valence and Arousal. Each audio file is approximate 5 seconds long and 430 KB in size.

For the best experience keep your volume high to listen to the sounds.

<root directory>
    |
    .- README.md
    |
    .- meta.txt
    |
    .- citation.txt
    |
    .- audio_labels.csv
    |
    .- Audio-Data/
          |
          .- s1_a0_d1.wav
          |
          .- s1_a0_d2.wav
          |
          .- s1_a1_d1.wav
          | ...

meta.txt: contains labeling collection information.
citation.txt: contains citation data for research paper.
audio_labels.csv: contains labels of audio based on perceived listener rating form 1-5.
- audio_file: wav audio files name
- valence: average rating of valence (1~5)
- arousal: average rating of arousal (1~5)
- round_val: round valence mean rating
- round_ar: round arousal mean rating
- round_val_sd: round valence standard deviation
- round_ar_sd: round arousal standard deviation

3. Results

Results on the dataset show that Arousal does correlate moderately to fundamental frequency, and that the sine waveform is perceived as significantly different to square and sawtooth waveforms when evaluating perceived Arousal. The general results suggest that isolated synthetic audio can be modelled as a means of evoking affective states of emotion.

Acknowledgments

First, I would like to thank Baird, Alice and Parada-Cabaleiro, Emilia and Fraser, Cameron and Hantke, Simone and Schuller, Bjorn for publishing dataset on Zendo and explaining the results. Secondly, I would like to thank Zenodo for maintaining amazing open source dataset.

Alice Baird; Emilia Parada-Cabaleiro, Aug 20, 2019

Original Dataset: EmoSynth| Zenodo

DAGsHub Dataset: kingabzpro/EmoSynth

Photo by Jonathan Borba on Unsplash

This open source contribution is part of DagsHub x Hacktoberfest

Tip!

Press p or to see the previous file or, n or to see the next file

README.md

EmoSynth: The Emotional Synthetic Audio Dataset

1. General information

2. Organization of the dataset

3. Results

Acknowledgments

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

DagsHub / audio-datasets connected to https://github.com/DAGsHub/audio-datasets.git

README.md

EmoSynth: The Emotional Synthetic Audio Dataset

1. General information

2. Organization of the dataset

3. Results

Acknowledgments

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

DagsHub
/
audio-datasets
connected to https://github.com/DAGsHub/audio-datasets.git