Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Hazal 1ab828b832
Add files via upload
2 years ago
..
1ab828b832
Add files via upload
2 years ago

Readme.md

You have to be logged in to leave a comment. Sign In

MS-SNSD

abc credit: Kelly Sikkema

The dataset is posted to DagsHub, where you may preview it before downloading it.

About:

This dataset contains a large collection of clean speech files and variety of environmental noise files in .wav format sampled at 16 kHz. The main application of this dataset is to train Deep Neural Network (DNN) models to suppress background noise. But it can be used for other audio and speech applications. We provide the recipe to mix clean speech and noise at various signal to noise ratio (SNR) conditions to generate large noisy speech dataset. The SNR conditions and the number of hours of data required can be configured depending on the application requirements. This dataset will continue to grow in size as we encourage researchers and practitioners to contribute to this dataset by adding more clean speech and noise clips. This dataset will immensely help researchers and practitioners in accademia and industry to develop better models. We also provide test set that is different from training set to evaluate the developed models.

Structure:

The audio files are in .wav format and sampled at 16 kHz

Citation:

@article{reddy2019scalable,
  title={A Scalable Noisy Speech Dataset and Online Subjective Test Framework},
  author={Reddy, Chandan KA and Beyrami, Ebrahim and Pool, Jamie and Cutler, Ross and Srinivasan, Sriram and Gehrke, Johannes},
  journal={Proc. Interspeech 2019},
  pages={1816--1820},
  year={2019}
}

Dataset licenses

MICROSOFT PROVIDES THE DATASETS ON AN "AS IS" BASIS. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, GUARANTEES OR CONDITIONS WITH RESPECT TO YOUR USE OF THE DATASETS. TO THE EXTENT PERMITTED UNDER YOUR LOCAL LAW, MICROSOFT DISCLAIMS ALL LIABILITY FOR ANY DAMAGES OR LOSSES, INLCUDING DIRECT, CONSEQUENTIAL, SPECIAL, INDIRECT, INCIDENTAL OR PUNITIVE, RESULTING FROM YOUR USE OF THE DATASETS.

The datasets are provided under the original terms that Microsoft received such datasets. See below for more information about each dataset.

The datasets used in this project are licensed as follows:

  1. Clean speech:
  1. Noise:

You may get the dataset by clicking on the link

Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...