materializer

21e832ec6d

codespell fix

1 year ago

pipelines

614ca91f53

Format with black

1 year ago

steps

6911db538f

Merge remote-tracking branch 'origin/main' into bug/fix-last-run-fetch

1 year ago

.dockerignore

5546374e64

update sign language zenfile

1 year ago

README.md

781c5ae7ec

Removed dead links

3 months ago

model.py

f21562f20e

Format all projects with latest `ruff` (#95)

1 month ago

requirements.txt

042c56a82b

Update ZenML and remove secrets managers

1 year ago

run.py

9db55ab527

Fix step name in sign language project

1 year ago

service.py

614ca91f53

Format with black

1 year ago

You have to be logged in to leave a comment.

Detect and recognize the American Sign Language alphabet in real-time images using Yolov5 and ZenML

Problem statement: One of the most anticipated capabilities of machine learning and AI is to help people with disabilities. The deaf community cannot do what most of the population takes for granted and are often placed in degrading situations due to these challenges they face every day. In this ZenML Project you will see how computer vision can be used to create a model that can bridge the gap for the deaf and hard of hearing by learning American Sign Language and be able to understand the meaning of each sign.

This project uses ZenML to create a pipeline that will train a model to detect and recognize the American Sign Language alphabet in real-time images using Yolov5, MLFlow and the Vertex AI Platform.

The purpose of this repository is to demonstrate how ZenML empowers you to build, track and deploy a computer vision pipeline using some of the most popular tools in the industry:

By offering you a framework and template on which you can base your own work
By using a custom code Object Detection algorithm called Yolov5
By integrating with tools like MLflow to track the hyperparameters and metrics of the model
By allowing you to train your model on Google Vertex AI Platform with minimal effort

Note: This project is based on Interactive ABC's with American Sign Language. The main difference is that this project is using ZenML to create a pipeline that will train a model to detect and recognize the American Sign Language alphabet in real-time images using Yolov5, MLFlow and Vertex AI Platform.

:notebook: Explanation of the project

In order to build a model that can detect and recognize the American Sign Language alphabet in real-time images we will need to do the following steps:

Download the dataset from Roboflow
Augment the training and valdiation sets using Albumentations
Train the model using a pretrained model from Yolov5 while tracking the hyperparameters and metrics using MLflow within a GPU environment by leveraging Google's Vertex AI Step Operator stack component.
Load the model in a different pipeline that deploys the model using BentoML and the provided ZenML integration.
Create an inference pipeline that will use the deployed model to detect and recognize the American Sign Language alphabet in test images from the first pipeline.

Requirements

In order to follow this tutorial, you need to have the following software installed on your local machine:

Python (version 3.7-3.9)
Docker
GCloud CLI (authenticated)
MLFlow Tracking Server (deployed remotely)
Remote ZenML Server: a Remote Deployment of the ZenML HTTP server and database

:rocket: Remote ZenML Server

For advanced use cases where we have a remote orchestrator or step operators such as Vertex AI or to share stacks and pipeline information with a team we need to have a separated non-local remote ZenML Server that can be accessible from your machine as well as all stack components that may need access to the server. Read more information about the use case here

In order to achieve this there are two different ways to get access to a remote ZenML Server.

Deploy and manage the server manually on your own cloud/
Sign up for ZenML Enterprise and get access to a hosted version of the ZenML Server with no setup required.

:snake: Setup Python Environment

Let's jump into the Python packages you need. Within the Python environment of your choice, run:

git clone https://github.com/zenml-io/zenml-projects.git
git submodule update --init --recursive
cd zenml-projects/sign-language-detection-yolov5
pip install -r requirements.txt
pip install -r yolov5/requirements.txt

Starting with ZenML 0.20.0, ZenML comes bundled with a React-based dashboard. This dashboard allows you to observe your stacks, stack components and pipeline DAGs in a dashboard interface. To access this, you need to launch the ZenML Server and Dashboard locally, but first you must install the optional dependencies for the ZenML server:

zenml connect --url=$ZENML_SERVER_URL
zenml init

👣 Step-by-Step on how to set up your GCP project

I will show how to create Google Cloud resources for this project using gcloud cli. Follow this if you don't have it set up.

1. Make sure you are in the correct GCP project

List the current configurations and check that project_id is set to your GCP project:

gcloud config list

If not, use:

gcloud config set project <PROJECT_ID>

2. Set permissions to create and manage Vertex AI custom jobs and to access data from BigQuery

Create a service account:

gcloud iam service-accounts create <NAME>

# Example:
gcloud iam service-accounts create zenml-sa

Grant permission to the service account:

gcloud projects add-iam-policy-binding <PROJECT_ID> --member="serviceAccount:<SA-NAME>@<PROJECT_ID>.iam.gserviceaccount.com" --role=<ROLE>

# Example:
gcloud projects add-iam-policy-binding zenml-vertex-ai --member="serviceAccount:zenml-sa@zenml-vertex-ai.iam.gserviceaccount.com" --role=roles/storage.admin
gcloud projects add-iam-policy-binding zenml-vertex-ai --member="serviceAccount:zenml-sa@zenml-vertex-ai.iam.gserviceaccount.com" --role=roles/aiplatform.admin

Generate a key file:

gcloud iam service-accounts keys create <FILE-NAME>.json --iam-account=<SA-NAME>@<PROJECT_ID>.iam.gserviceaccount.com

# Example:
gcloud iam service-accounts keys create credentials.json --iam-account=zenml-sa@zenml-vertex-ai.iam.gserviceaccount.com

Make sure that these credentials are available in your environment so that the relevant permissions are picked up for GCP authentication:

export GOOGLE_APPLICATION_CREDENTIALS="<PATH_TO_JSON_KEY_FILE>"

3. Create a GCP bucket

Vertex AI and ZenML will use this bucket for output of any artifacts from the training run:

gsutil mb -l <REGION> gs://bucket-name

# Example:
gsutil mb -l europe-west1 gs://zenml-bucket

4. Configure and enable Container Registry in GCP

ZenML will use this registry to push your job images that Vertex will use.

a) Enable Container Registry

b) Authenticate your local docker cli with your GCP container registry:

docker pull busybox
docker tag busybox gcr.io/<PROJECT-ID/busybox
docker push gcr.io/<PROJECT-ID>/busybox

5. Enable `Vertex AI API`

To be able to use custom Vertex AI jobs, you first need to enable their API inside Google Cloud console.

👣 Set up the components required for ZenML stack

Set a GCP bucket as your artifact store:

zenml artifact-store register <NAME> --flavor=gcp --path=<GCS_BUCKET_PATH>

# Example:
zenml artifact-store register gcp-store --flavor=gcp --path=gs://zenml-bucket

Create a Vertex step operator:

zenml step-operator register <NAME> \
    --flavor=vertex \
    --project=<PROJECT-ID> \
    --region=<REGION> \
    --machine_type=<MACHINE-TYPE> \
    --accelerator_type=<ACCELERATOR-TYPE> \
    --accelerator_count=<ACCELERATOR-COUNT> \
    --service_account_path=<SERVICE-ACCOUNT-KEY-FILE-PATH>

# Example:
zenml step-operator register vertex \
    --flavor=vertex \
    --project=zenml-core \
    --region=europe-west1 \
    --machine_type=n1-standard-4 \
    --accelerator_type=NVIDIA_TESLA_K80 \
    --accelerator_count=1 \
    --service_account_path=credentials.json

List of available machines

zenml container-registry register <NAME> --type=default --uri=gcr.io/<PROJECT-ID>/<IMAGE>

# Example:
zenml container-registry register gcr_registry --type=default --uri=gcr.io/zenml-vertex-ai/busybox

zenml experiment-tracker register <NAME> --flavor=mlflow --tracking_uri=<MLFLOW-TRACKING-SERVER-URI> --tracking_username=<USERNAME> --tracking_password=<PASSWORD>

# Example:
zenml experiment-tracker register mlflow --flavor=mlflow --tracking_uri=http://mlflow_zenml_yolo:5000 --tracking_username=admin --tracking_password=admin

zenml stack register vertex_mlflow_stack \
    -o default \
    -c gcr_registry \
    -a gcp-store \
    -s vertex \
    -e mlflow

View all your stacks: zenml stack list

Activate the stack:

zenml stack set vertex_training_stack

Install relevant integrations:

zenml integration install bentoml gcp -y

In order to run the project, simply run:

python run.py -c train_and_deploy_and_predict

📙 Resources & References

A blog explaining this project in depth is forthcoming!

If you'd like to watch the video that explains the project, you can watch the video.

Training Pipeline

The training pipeline is made up of the following steps:

data_loader.py: Loads the data from the Roboflow platforms using the given API key and saves it to the artifact store as dictionary that contains all information about each set by using the zenml.artifacts.DatasetArtifact class.
train_augmenter.py: Loads the training set data from the artifact store and performs data augmentation using the albumentations library. It then saves the augmented data to the artifact store as a zenml.artifacts.DatasetArtifact class.
vald_augmenter.py: Loads the validation set data from the artifact store and performs data augmentation using the albumentations library. It then saves the augmented data to the artifact store as a zenml.artifacts.DatasetArtifact class.
trainer.py: Loads the augmented training and validation data from the artifact store and trains the model using the yolov5 library in a custom Vertex AI job. which track the training process using the mlflow library. It then saves the trained model to the artifact store as a zenml.artifacts.ModelArtifact class.

Deployment Pipeline

The Deployment pipeline is made up of the following steps:

model_loader.py: Loads the trained model from the previously trained pipeline and saves it locally.
deployment_triggeer.py: Triggers the deployment process once the model is loaded locally.
bento_builder.py: Builds a BentoML bundle from the model and saves it to the artifact store and passes it to the next step, which is the bento_deployer.py.
bento_deployer.py: Deploys the BentoML bundle to the Vertex AI endpoint.

Inference Pipeline

The Inference pipeline is made up of the following steps:

inference_loader.py: Loads the Test set data from the first step of the training pipeline and save it locally.
prediction_service_loader.py: Loads the ZenML prediction service in order to make predictions on the test set data.
predictor.py: Runs the prediction service on the test set data and print the results.

📜 References

Documentation on Step Operators
More on Step Operators
Documentation on how to create a GCP service account
ZenML CLI documentation

Tip!

Press p or to see the previous file or, n or to see the next file

README.md

Detect and recognize the American Sign Language alphabet in real-time images using Yolov5 and ZenML

:notebook: Explanation of the project

Requirements

:rocket: Remote ZenML Server

:snake: Setup Python Environment

👣 Step-by-Step on how to set up your GCP project

1. Make sure you are in the correct GCP project

2. Set permissions to create and manage Vertex AI custom jobs and to access data from BigQuery

3. Create a GCP bucket

4. Configure and enable Container Registry in GCP

5. Enable `Vertex AI API`

👣 Set up the components required for ZenML stack

📙 Resources & References

Training Pipeline

Deployment Pipeline

Inference Pipeline

📜 References

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

htahir1 / zenfiles connected to https://github.com/zenml-io/zenfiles.git

README.md

Detect and recognize the American Sign Language alphabet in real-time images using Yolov5 and ZenML

:notebook: Explanation of the project

Requirements

:rocket: Remote ZenML Server

:snake: Setup Python Environment

👣 Step-by-Step on how to set up your GCP project

1. Make sure you are in the correct GCP project

2. Set permissions to create and manage Vertex AI custom jobs and to access data from BigQuery

3. Create a GCP bucket

4. Configure and enable Container Registry in GCP

5. Enable Vertex AI API

👣 Set up the components required for ZenML stack

📙 Resources & References

Training Pipeline

Deployment Pipeline

Inference Pipeline

📜 References

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

htahir1
/
zenfiles
connected to https://github.com/zenml-io/zenfiles.git

5. Enable `Vertex AI API`