You have to be logged in to leave a comment.

🚀 MLflow Model Serving - Complete Guide

This guide provides all the commands needed to set up and run the iris model serving API locally and with Docker.

🏗️ Quick Setup (Automated)

Run the automated setup script:

./setup.sh

📋 Manual Setup Commands

1. Install Dependencies

pip install -r requirements.txt

2. Train the Model

python simple_train.py

3. Start the API Server

python app.py

The API will be available at:

Main API: http://localhost:8000
Interactive Docs: http://localhost:8000/docs
OpenAPI Schema: http://localhost:8000/openapi.json

4. Test the API

Test Welcome Endpoint

curl http://localhost:8000/

Test Prediction Endpoint

curl -X POST "http://localhost:8000/predict" \
     -H "Content-Type: application/json" \
     -d '{"features": [5.1, 3.5, 1.4, 0.2]}'

Expected response:

{
  "prediction": 0,
  "features": [5.1, 3.5, 1.4, 0.2]
}

🐳 Docker Deployment

1. Build Docker Image

docker build -t iris-model-api .

2. Run Docker Container

docker run -p 8000:8000 iris-model-api

3. Test Docker Container

# Test welcome endpoint
curl http://localhost:8000/

# Test prediction
curl -X POST "http://localhost:8000/predict" \
     -H "Content-Type: application/json" \
     -d '{"features": [5.9, 3.0, 5.1, 1.8]}'

🌐 Cloud Deployment Options

Deploy to Heroku

# Install Heroku CLI first
heroku create your-iris-api
git push heroku main

Deploy to Railway

# Connect your GitHub repo to Railway
# Set PORT environment variable to 8000

Deploy to Google Cloud Run

gcloud run deploy iris-api \
  --source . \
  --platform managed \
  --region us-central1 \
  --allow-unauthenticated

📊 API Endpoints

GET `/`

Returns a welcome message.

Response:

{
  "message": "Welcome to the Iris Model Prediction API"
}

POST `/predict`

Makes predictions using the trained iris model.

Request Body:

{
  "features": [5.1, 3.5, 1.4, 0.2]
}

Response:

{
  "prediction": 0,
  "features": [5.1, 3.5, 1.4, 0.2]
}

Iris Classes:

0: Iris Setosa
1: Iris Versicolor
2: Iris Virginica

🔧 Troubleshooting

Model Not Found Error

If you get a "Model not loaded" error:

# Train a new model
python simple_train.py

# Restart the API
python app.py

Port Already in Use

If port 8000 is busy:

# Kill existing processes
pkill -f "python app.py"

# Or use a different port
uvicorn app:app --host 0.0.0.0 --port 8001

Docker Build Issues

Ensure you have:

Docker installed and running
All files in the current directory
Model file exists in models/iris_model.pkl

📈 Performance Testing

Load Testing with curl

# Simple load test
for i in {1..100}; do
  curl -s -X POST "http://localhost:8000/predict" \
       -H "Content-Type: application/json" \
       -d '{"features": [5.1, 3.5, 1.4, 0.2]}' &
done
wait

Using Apache Bench

# Install apache2-utils first
ab -n 1000 -c 10 -T "application/json" \
   -p test_payload.json http://localhost:8000/predict

Create test_payload.json:

{ "features": [5.1, 3.5, 1.4, 0.2] }

🔍 Monitoring

Health Check Endpoint

Add this to your app.py for monitoring:

@app.get("/health")
async def health_check():
    return {
        "status": "healthy",
        "model_loaded": model is not None,
        "timestamp": datetime.now().isoformat()
    }

View Logs

# Docker logs
docker logs <container_id>

# Local logs
tail -f /var/log/iris-api.log

🚀 Next Steps

Add Authentication: Implement API keys or JWT tokens
Add Logging: Use structured logging for better monitoring
Add Caching: Cache predictions for repeated requests
Add Batch Processing: Support multiple predictions in one request
Add Model Versioning: Support A/B testing with multiple models
Add Metrics: Implement Prometheus metrics for monitoring

📝 Example Integration

Python Client

import requests

def predict_iris(features):
    response = requests.post(
        "http://localhost:8000/predict",
        json={"features": features}
    )
    return response.json()

# Usage
result = predict_iris([5.1, 3.5, 1.4, 0.2])
print(f"Predicted class: {result['prediction']}")

JavaScript Client

async function predictIris(features) {
  const response = await fetch("http://localhost:8000/predict", {
    method: "POST",
    headers: {
      "Content-Type": "application/json",
    },
    body: JSON.stringify({ features: features }),
  });
  return await response.json();
}

// Usage
predictIris([5.1, 3.5, 1.4, 0.2]).then((result) =>
  console.log("Prediction:", result.prediction)
);

🎯 Summary

You now have a complete model serving solution that:

✅ Loads a trained scikit-learn model
✅ Serves predictions via REST API
✅ Includes interactive documentation
✅ Can be containerized with Docker
✅ Ready for cloud deployment
✅ Includes comprehensive testing commands

The API is production-ready and can handle real-time predictions for the Iris dataset classification task.

Tip!

Press p or to see the previous file or, n or to see the next file

yahiaehab10 / MLFlow_demo connected to https://github.com/yahiaehab10/MLFlow_demo.git

SERVING_GUIDE.md 5.2 KB Permalink History Raw

🚀 MLflow Model Serving - Complete Guide

🏗️ Quick Setup (Automated)

📋 Manual Setup Commands

1. Install Dependencies

2. Train the Model

3. Start the API Server

4. Test the API

Test Welcome Endpoint

Test Prediction Endpoint

🐳 Docker Deployment

1. Build Docker Image

2. Run Docker Container

3. Test Docker Container

🌐 Cloud Deployment Options

Deploy to Heroku

Deploy to Railway

Deploy to Google Cloud Run

📊 API Endpoints

GET /

POST /predict

🔧 Troubleshooting

Model Not Found Error

Port Already in Use

Docker Build Issues

📈 Performance Testing

Load Testing with curl

Using Apache Bench

🔍 Monitoring

Health Check Endpoint

View Logs

🚀 Next Steps

📝 Example Integration

Python Client

JavaScript Client

🎯 Summary

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

yahiaehab10
/
MLFlow_demo
connected to https://github.com/yahiaehab10/MLFlow_demo.git

SERVING_GUIDE.md 5.2 KB

Permalink History Raw

GET `/`

POST `/predict`