Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
3acde0d4da
updated
1 month ago
3acde0d4da
updated
1 month ago
f0d88c8039
mlflow updated for ydf
3 months ago
f830366300
documentation
1 month ago
6c793e12cb
model rerun
1 month ago
6c793e12cb
model rerun
1 month ago
8ade705bef
experiment updated
1 month ago
src
13ca0dafc1
read_html function added to utils
2 months ago
74c108a7c4
prediction completed
3 months ago
3acde0d4da
updated
1 month ago
77be392fbf
removed model and reports from dvc
3 months ago
3acde0d4da
updated
1 month ago
78874064b7
ydf added
3 months ago
ca73fac6f7
readme updated
3 months ago
3acde0d4da
updated
1 month ago
e9cc54884b
dvc updated
1 month ago
3acde0d4da
updated
1 month ago
85446ac015
requiements updated
1 month ago
41f2190f5c
updated readme
3 months ago
09850cd0ba
dagshub integrated
3 months ago
05c12219d5
mlfow rerun
3 months ago
3acde0d4da
updated
1 month ago
05c12219d5
mlfow rerun
3 months ago
3 months ago
1b60eb4c72
dvc integrated
3 months ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Thyroid-Disease-Prediction 🩺

Thyroid Disease Prediction ML project

Data Description ℹ️

From Garavan Institute
Documentation: as given by Ross Quinlan
6 databases from the Garavan Institute in Sydney, Australia
Approximately the following for each database:

    ** 2800 training (data) instances and 972 test instances
    ** Plenty of missing data
    ** 29 or so attributes, either Boolean or continuously-valued 

2 additional databases, also from Ross Quinlan, are also here

    ** Hypothyroid.data and sick-euthyroid.data
    ** Quinlan believes that these databases have been corrupted
    ** Their format is highly similar to the other databases 

Workflows 🔧

1. Update config.yaml
2. Update schema.yaml
3. Update params.yaml
4. Update entity
5. Update configuration manager (configuration.py) in src config
6. Update components
7. Update pipeline
8. Update the main.py
9. Update the app.py

How to run the project

Static Badge

  1. Clone the project
git clone https://github.com/tejas05in/Thyroid-Disease-Prediction.git
  1. Change into the project directory
cd /Thyroid-Disease-Prediction
  1. Create a conda environment
conda create -p env python==3.11.4 -y
  1. Activate the conda environment
conda activate env/
  1. Install the requirements
pip install -r requirements.txt
  1. Start the streamlit app
streamlit run app.py
# This will redirect you to the end point in your browser

Static Badge

Initialization of training pipeline

python main.py

MLflow experiments 🔱

Optional: Run this to export the environemnt variables which will log the results in mlflow at dagshub:

Static Badge

export MLFLOW_TRACKING_URI=https://dagshub.com/tejas05in/Thyroid-Disease-Prediction.mlflow \
export MLFLOW_TRACKING_USERNAME=tejas05in \
export MLFLOW_TRACKING_PASSWORD=9efcb5c7b79d0e949378459b922b1462a80fa413

If the above variables are not exported then mlfow will store the results locally and it can be accessed by passing

mlflow ui --port 5000

alt text

docker pull tejas05in/tdpapp
docker run -p 5000:5000 tdpapp:latest

Model Drift Reports and Tests

  • Drift Report : Contains information about the dataset and model drift parameters
  • Tests : Contains information about the various tests performed on the dataset , model and features

Dagshub Repository :

Repo link : Directs you to the Dagshub repository

Pipline Version Control :

Static Badge

Tip!

Press p or to see the previous file or, n or to see the next file

About

Thyroid Disease Prediction ML project

Collaborators 1

Comments

Loading...