4 Branches 1 Releases

.dvc

1c78367383

sota

2 years ago

Data

Eval Results

a61ebcbc62

eval and markdown

2 years ago

Model

.dvcignore

73d6051127

first test

2 years ago

.gitignore

73d6051127

first test

2 years ago

Data.dvc

73d6051127

first test

2 years ago

LICENSE

b9df3647ca

Initial commit

2 years ago

Model.dvc

73d6051127

first test

2 years ago

README.md

a61ebcbc62

eval and markdown

2 years ago

eval.py

a61ebcbc62

eval and markdown

2 years ago

metrics.csv

de1a0f3d8c

Sota Results

2 years ago

notebook.ipynb

de1a0f3d8c

Sota Results

2 years ago

params.yml

de1a0f3d8c

Sota Results

2 years ago

requirements.txt

e2f5996ee7

changed requirment

2 years ago

run_eval.sh

28739914e3

eval edit

2 years ago

DagsHub Storage

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

You have to be logged in to leave a comment.

Urdu-ASR-SOTA

Automatic Speech Recognition using Facebook wav2vec2-xls-r-300m model and mozilla-foundation common_voice_8_0 Urdu Dataset.

wav2vec2-large-xls-r-300m-Urdu

This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the common_voice dataset.

It achieves the following results on the evaluation set:

Loss: 0.9889
Wer: 0.5607
Cer: 0.2370

Evaluation Commands

To evaluate on mozilla-foundation/common_voice_8_0 with split test

python3 ./eval.py --model_id ./Model --dataset ./Data --config ur --split test --chunk_length_s 5.0 --stride_length_s 1.0 --log_outputs

import torch
from datasets import load_dataset, Audio
from transformers import pipeline
import torchaudio.functional as F
model = "Model"
data = load_dataset("Data", "ur", split="test", delimiter="\t")
def path_adjust(batch):
    batch["path"] = "Data/ur/clips/" + str(batch["path"])
    return batch
data = data.map(path_adjust)
sample_iter = iter(data.cast_column("path", Audio(sampling_rate=16_000)))
sample = next(sample_iter)

asr = pipeline("automatic-speech-recognition", model=model)
prediction = asr(
            sample["path"]["array"], chunk_length_s=5, stride_length_s=1)
prediction
# => {'text': 'اب یہ ونگین لمحاتانکھار دلمیں میںفوث کریلیا اجائ'}

Eval results on Common Voice 8 "test" (WER):

Without LM	With LM (run `./eval.py`)
56.21	46.37

Tip!

Press p or to see the previous file or, n or to see the next file

README.md

Urdu-ASR-SOTA

wav2vec2-large-xls-r-300m-Urdu

Evaluation Commands

Eval results on Common Voice 8 "test" (WER):

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

kingabzpro / Urdu-ASR-SOTA

README.md

Urdu-ASR-SOTA

wav2vec2-large-xls-r-300m-Urdu

Evaluation Commands

Eval results on Common Voice 8 "test" (WER):

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

kingabzpro
/
Urdu-ASR-SOTA