You have to be logged in to leave a comment.

🧠 Brain Tumor Classification with MRI Scans

This repository contains a machine learning project for classifying brain tumors using MRI images. The model is trained to detect and categorize four types of brain conditions from axial brain scan images.

👤 Author

Daniel Egbo

🧩 Problem Statement

Brain tumors pose a serious health challenge, requiring timely and accurate diagnosis to improve treatment outcomes. Magnetic Resonance Imaging (MRI), especially T1-weighted contrast-enhanced scans, is widely used for brain tumor detection. However, manual interpretation of MRI scans is time-consuming and can be subject to inter-observer variability.

This project aims to automate the classification of brain tumors from MRI images using a machine learning model. The objective is to accurately categorize axial brain scans into one of four classes — glioma, meningioma, pituitary tumor, or no tumor — thereby assisting radiologists in diagnosis and reducing diagnostic delays.

🗂️ Dataset Description

The dataset used in this project is sourced from the Kaggle Brain Tumor MRI Dataset. It consists of T1-weighted contrast-enhanced MRI images captured in the axial plane. The images are grouped into four categories, each representing a distinct medical condition:

glioma – a type of tumor that arises from glial cells in the brain.
meningioma – typically a slow-growing tumor that forms on the meninges, the membranes covering the brain and spinal cord.
pituitary – tumors originating in the pituitary gland, located at the base of the brain.
notumor – MRI scans that show no evidence of tumor.

Each category is stored in a separate subdirectory, and the images are in JPEG format. The dataset is balanced and suitable for supervised image classification tasks.

✅ Requirements

This project is built using a modern MLOps stack and requires the following tools and libraries:

Python 3.10+ — Core programming language for data preprocessing, model training, and orchestration
torch — Deep learning framework used to build and train the brain tumor classification model
torchvision — Utilities for image transformations and loading image datasets
scikit-learn — Metrics, evaluation tools, and utilities for model validation
MLflow — For experiment tracking, model logging, and registry management
prefect — Workflow orchestration to manage the ML pipeline as reproducible tasks and flows
Docker — Containerization of the training and inference environments
AWS ECR (Elastic Container Registry) — Storage for Docker images used in deployment
AWS ECS (Elastic Container Service) — For deploying the trained model as a scalable web service

🚀 Getting Started

1. Clone the repository

git clone https://github.com/Danselem/brain_mri.git
cd brain_mri

The project makes use of Makefile and Astral uv. Click the Astral link to see the details of the package and how to install it.

2. Create and activate a virtual environment

To create and activate an environment:

make init

3. ⚙️ Install dependencies

make install

4. Fetch Data

make fetch-data

This will fetch the data from Kaggle and store it in the data repo. Ensure you have a Kaggle account and set up your API key.

5. Set up MLflow server

There are two options to set up MLflow

Use AWS EC2 and S3 Ensure terraform is installed on your PC and you have AWS credentials set up on your PC with aws configure. Next, cd infra then follow the instructions in infra for a complete set up of AWS resources including EC2, RDS, S3, Kinesis, Lambda, etc.
Use DagsHub Sign up at Dagshub and obtain an API key and create a project repo. After that, run the command to create a .env file:

make env

Next, fill the .env file with the right information.

6. Start the orchestrator.

This project uses Prefect for running the ML pipeline. To start the prefect server, run the command:

make prefect

This will start a prefect server running at https://127.0.0.1/4200.

7 Run the ML Pipeline

To run the pipeline,

make pipeline

This will proceed to load the data, transform it and start the parameter tuning. See image below for the prefect modeling pipeline

It will also log the ML experiments in Dagshub and also register the best model. For example, see below. .

All experiments ran for this project can be accessed in Dagshub.

8. Fetch and serve the best model

fetch-best-model

The above command will fetch the registered model from the Dagshub MLFlow server and save it in the models repo. With this, we are ready to serve the model.

9. Serve the model locally

Test the local deployment

make serve_local

10. Build the `Docker` container

make build

11. Start and run the Docker container

make run

12. Push the container to AWS ECR

make ecr

This uses the ecr bash script to create and container and push to AWS ECR. Here is the sample below:

13. Deploy the container to AWS ECS

make ecs

This uses the ecs bash script and deploy the container to AWS ECS. Here is the sample below:

🧪 Testing with Pytest

To test your setup or add unit tests:

make test

📊 Evaluation

Accuracy and loss plots
Confusion matrix

Performance metrics are saved in the MLFlow server.

📚 References

Masoud Nickparvar, Brain Tumor MRI Dataset – Kaggle Dataset
Related works on medical image classification with deep learning

📜 License

This project is for educational and research purposes only. Please refer to the dataset's license on Kaggle for usage terms. This project is licensed under the MIT License.

🙋🏽‍♀️ Contact

Made with 💻 by Daniel Egbo. Feel free to reach out with questions, issues, or suggestions.

Tip!

Press p or to see the previous file or, n or to see the next file

Danselem / brain_mri connected to https://github.com/Danselem/brain_mri.git

README.md 6.4 KB History Raw

🧠 Brain Tumor Classification with MRI Scans

👤 Author

🧩 Problem Statement

🗂️ Dataset Description

✅ Requirements

🚀 Getting Started

1. Clone the repository

2. Create and activate a virtual environment

3. ⚙️ Install dependencies

4. Fetch Data

5. Set up MLflow server

6. Start the orchestrator.

7 Run the ML Pipeline

8. Fetch and serve the best model

9. Serve the model locally

10. Build the Docker container

11. Start and run the Docker container

12. Push the container to AWS ECR

13. Deploy the container to AWS ECS

🧪 Testing with Pytest

📊 Evaluation

📚 References

📜 License

🙋🏽‍♀️ Contact

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Danselem
/
brain_mri
connected to https://github.com/Danselem/brain_mri.git

README.md 6.4 KB

History Raw

10. Build the `Docker` container