You have to be logged in to leave a comment.

GIF Analyzer

Introduction
Architecture
Contributing

Introduction

GIF Analyzer is a computer vision project that recognizes which TV show the GIF came from.

When designing this project, I had the following goals in mind:

Apply transfer learning to video classification models (hey, GIFs are the simplest videos)
Deploy the model as API to AWS using always free resources (and no cheating with trials)
Automate a CI/CD pipeline for model serving using free tools (alas, AWS automatic deployment requires paid S3 buckets)

Architecture

The above design goals guided my architecture as demonstrated on the diagram below.

The main building blocks are as follows:

TensorFlow: it includes a high-level Keras API that is easy to work with, and has plenty of tutorials about transfer learning for video classification
ONNX: it is a format for machine learning models that is optimized for fast inference and also has a small runtime library
MLflow: it is a popular MLOps tool which can track, version control, and serve machine learning models
DagsHub: it is a version control platform for data scientists that generously hosts an MLflow server and many other fancy tools (DVC, Label Studio, FiftyOne) for every repository
FastAPI: it is the fastest Python library for creating APIs and, de facto, became an industry standard
AWS Lambda: it is the only always free compute resource on AWS and its free tier is limited to 250 MB for all your library dependencies
Terraform: it is a go-to tool for infrastructure as code solutions and makes automating cloud deployment a breeze
GitHub Actions: it is a built-in CI/CD tool on GitHub that has a large number of predefined actions on their marketplace

Training workflow

Training of the machine learning model is represented on the left side of the diagram and would be typically performed by an ML Scientist if there is one in the team.

The trainig workflow would consist of the following steps that are usually done in a Jupyter notebook (I prefer a VS Code version):

Model experimentation that includes data collection (using Giphy API), data preprocessing (using Pillow), and model training (using TensorFlow Keras API)
Model conversion to the ONNX format once the candidate model is chosen
Model registration in the MLflow model registry (using DagsHub URI) and assigning it a challenger alias

Serving workflow

Serving of the machine learning model is represented on the right side of the diagram and would be typically performed by an ML Engineer.

The serving workflow would consist of the following steps that are usually done in an IDE of your choice (I prefer VS Code):

Coding an API (using FastAPI) and adding an ASGI wrapper for AWS Lambda (using Mangum)
Writing deployment configuration to AWS Lambda (using Terraform and AWS modules)
Setting up a CI/CD pipeline (using GitHub Actions) that will load the model from MLflow, deploy it as an API to staging/production after successful testing, and finally update its status in MLflow

Contributing

If there are any open Issues that you would like to work on, please reach out to me on LinkedIn and I can add you as a collaborator. Alternatively, you can fork the repository and send the pull request after completing the issue. If you choose forking, you won't be able to contribute to the serving workflow unless you create your own Terraform and AWS accounts.

Environment

Make sure that you have Conda installed (I prefer Miniforge).

To install the development environment, run in your terminal

conda env create -f environment.yml

Some tools like Terraform AWS modules don't work on Windows so I highly recommend using WSL2 if you are on a Windows machine.

Tip!

Press p or to see the previous file or, n or to see the next file

Abid Ali Awan

commented in commit8921b88bdfon branch main

2 months ago

Can you explain why you didn't use MLflow serve instead of creating your own FastAPI app? With MLflow Serve, you can get a highly secure API with more than just an async option.

Pavlo Fesenko

commented in commit8921b88bdf

1 month ago

@kingabzpro I wanted to practice using FastAPI and to add it to my skills in the resume but you are right, one should be able to call mlflow models serve -m <model_uri> directly from AWS Lambda, thus symplifying the deployment. 😀

@pavlofesenko intresting. I think FastAPI is so much more then simple inference.

README.md 4.7 KB

Permalink History Raw

GIF Analyzer

Table of contents

Introduction

Architecture

Training workflow

Serving workflow

Contributing

Environment

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

PavloFesenko / gif_analyzer connected to https://github.com/PavloFesenko/gif_analyzer.git

README.md 4.7 KB Permalink History Raw

GIF Analyzer

Table of contents

Introduction

Architecture

Training workflow

Serving workflow

Contributing

Environment

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

PavloFesenko
/
gif_analyzer
connected to https://github.com/PavloFesenko/gif_analyzer.git

README.md 4.7 KB

Permalink History Raw