Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
General:  hacktoberfest Type:  dataset Data Domain:  audio
74f9cd9bc0
stop tracking models and recording_scripts with git
2 years ago
0c17234dc7
added Asset and photo
2 years ago
622f94b524
Added data
2 years ago
74f9cd9bc0
stop tracking models and recording_scripts with git
2 years ago
622f94b524
Added data
2 years ago
38a7c6aa47
Update 'README.md'
2 years ago
622f94b524
Added data
2 years ago
74f9cd9bc0
stop tracking models and recording_scripts with git
2 years ago
622f94b524
Added data
2 years ago
74f9cd9bc0
stop tracking models and recording_scripts with git
2 years ago
622f94b524
Added data
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

image

Image credit: Christian Lillelund on Kaggle

Deep neural networks have been successfully applied to problems in many domains. Understanding their inner workings with respect to feature selection and decision making, however, remains challenging and thus trained models are often regarded as black boxes. Layerwise Relevance Propagation (LRP) addresses this issue by finding those features that a model relies on, offering deeper understanding and interpretation of trained networks. This repository contains code and data used in Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals (https://arxiv.org/abs/1807.03418).

Repository structure

data (audioMNIST)

  • The dataset consists of 30000 audio samples of spoken digits (0-9) of 60 different speakers.
  • There is one directory per speaker holding the audio recordings.
  • Additionally "audioMNIST_meta.txt" provides meta information such as gender or age of each speaker.

models

  • There are two different model architectures and training parameters in the CAFFE deep learning framework format.
  • Bash script to train and test models.

recording_scripts

  • Scripts to gather further audio samples.

preprocessing_data.py

  • A python script to preprocess the provided audio records and to store them in a format suitable for the provided caffe models.

Reference

If you use the provided audioMNIST dataset for your project, please cite our paper:

@ARTICLE{becker2018interpreting,
  author    = {Becker, S\"oren and Ackermann, Marcel and Lapuschkin, Sebastian and M\"uller, Klaus-Robert and Samek, Wojciech},
  title     = {Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals},
  journal   = {CoRR},
  volume    = {abs/1807.03418},
  year      = {2018},
  archivePrefix = {arXiv},
  eprint    = {1807.03418},
}

Dataset DAGsHub Repository

This open source contribution is part of DagsHub x Hacktoberfest

Tip!

Press p or to see the previous file or, n or to see the next file

About

No description

Collaborators 1

Comments

Loading...