Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
General:  hacktoberfest Type:  dataset Data Domain:  audio
L-theorist 0be320781b
updated README.md
2 years ago
b54567917d
set dvc remote
2 years ago
a226489fb4
added data folders to dvc tracking
2 years ago
a226489fb4
added data folders to dvc tracking
2 years ago
696b94e23e
added research preprint and license
2 years ago
0be320781b
updated README.md
2 years ago
696b94e23e
added research preprint and license
2 years ago
a226489fb4
added data folders to dvc tracking
2 years ago
1d8a03f400
upload train/crowd/1
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Golos - Russian ASR dataset

Note: This is at the moment only a sample of the whole dataset:

train/test/crowd/0
train/test/crowd/1
test
*.jsonl

General Information

Golos is a Russian corpus suitable for speech research. The dataset mainly consists of recorded audio files manually annotated on the crowd-sourcing platform. The total duration of the audio is about 1240 hours.

For more overview see the research article. For a detailed account and acoustic models please consult the original github repository.

Folder structure

.
├── test
│   ├── crowd
│   │   └── files
│   └── farfield
│       └── files
└── train
    ├── crowd
    │   ├── 0
    │   ├── 1   (not uploaded yet)
    │   ├── 2   (not uploaded yet)
    │   ├── 3   (not uploaded yet)
    │   ├── 4   (not uploaded yet)
    │   ├── 5   (not uploaded yet)
    │   ├── 6   (not uploaded yet)
    │   ├── 7   (not uploaded yet)
    │   ├── 8   (not uploaded yet)
    │   └── 9   (not uploaded yet)
    └── farfield
    ├──── 1hour.jsonl
    ├──── 10hours.jsonl
    ├──── 10min.jsonl
    ├──── 100hours.jsonl
    ├──── manifest.jsonl

Full Dataset structure

Domain Train files Train hours Test files Test hours
Crowd 979 796 1 095 9 994 11.2
Farfield 124 003 132.4 1 916 1.4
Total 1 103 799 1 227.4 11 910 12.6

License

Creative Commons License
This work is licensed under a variant of Creative Commons Attribution-ShareAlike 4.0 International License.

Please see the specific license.

Authors and Credits

Alexander Denisenko
Angelina Kovalenko
Fedor Minkin
Nikolay Karpov

You can cite the data using the following BibTeX entry:

@article{karpov2021golos, title={Golos: Russian Dataset for Speech Research}, author={Karpov, Nikolay and Denisenko, Alexander and Minkin, Fedor}, journal={arXiv preprint arXiv:2106.10161}, year={2021} }

Tip!

Press p or to see the previous file or, n or to see the next file

About

Russian ASR dataset, see https://github.com/sberdevices/golos

Collaborators 1

Comments

Loading...