Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
General:  deep learning Type:  model Task:  speech recognition Data Domain:  audio Framework:  pytorch Integration:  dvc git mlflow arxiv
1 week ago
2527dcb9c4
update beam
3 weeks ago
33fb1021be
update semi - supervised learning
1 month ago
2 days ago
09926504fe
update label
1 month ago
1ae0e97f8f
update z
2 days ago
1 month ago
49729843ff
config dvc
1 month ago
2 months ago
1 month ago
33fb1021be
update semi - supervised learning
1 month ago
2 months ago
1 month ago
2 weeks ago
3 weeks ago
796ede3c66
config pipeline
1 month ago
2 months ago
2 months ago
1 month ago
5ae26a3671
setup ddp
1 day ago
1 month ago
49901f38dd
update requirement
1 month ago
1 month ago
2 weeks ago
2 days ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

(Speech to Text) Conformer Model

Model Architecture

Unsupervised Training

Semi - Supervised Training

Multi - Head Attention Module

Convolution Module

Feed Forward Module

Folder Structure

assets
configs
preprocessing
|---processor.py
pretraining
|---byol.py
src
|---modules
|---|---decoder.py
|---|---encoder.py
|---utils
|---|---activation.py
|---|---attention.py
|---|---block.py
|---|---convolution.py
|---|---ffn.py
|---|---masking.py
|---|---position.py
|---conformer.py
|---loss.py
|---metric.py
vocabulary
.gitignore
build_lm.py
build_onnx.py
dataset.py
infer.py
pretrain.py
README.md
requirements.txt
test.py
train.py

Setup Environment

git clone https://dagshub.com/Tri_0401/Conformer.git
cd Conformer
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Pull Dataset (CSV Files)

dvc pull

Setup KenLM

The Setup Intruction is followed by the git: https://github.com/kpu/kenlm.git

sudo apt install cmake build-essential libboost-system-dev libboost-thread-dev libboost-program-options-dev libboost-test-dev
git clone https://github.com/kpu/kenlm.git
cd kenlm
mkdir -p build
cd build
cmake ..
make -j 4

Train ASR Model

CUDA_VISIBLE_DEVICES={index} python3 train.py --device cuda --batch_size {train_batch_size} --val_batch_size {val_batch_size} --num_epochs {number_of_epochs}
kenlm/build/bin/lmplz -o {n_gram} --text {text_path} --arpa {lm_path}
Tip!

Press p or to see the previous file or, n or to see the next file

About

No description

Publications
View on arXiv  
Collaborators 1

Comments

Loading...