Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Integration:  dvc git
Arjun Vikram e5d4d941ca
Include instructions for Docker-ECS integration deployment
1 year ago
471f4b6e1c
update dvc remote
2 years ago
3805020d34
renamed folders
2 years ago
8e051f419c
minor processing code optimization
1 year ago
e5d4d941ca
Include instructions for Docker-ECS integration deployment
1 year ago
966f2f19d0
removed summary, updated classification threshold
1 year ago
77e47a0526
no directory bug fix
1 year ago
src
3de7e3c30e
updated colab setup scripts
1 year ago
3805020d34
renamed folders
2 years ago
0929a309c5
Update DVC remote
2 years ago
b3fff765de
resolves #51
1 year ago
bce95ec156
init Git and DVC
2 years ago
72438f8522
Add note about deployment to README
1 year ago
966f2f19d0
removed summary, updated classification threshold
1 year ago
fdedcebc5a
1 epoch training
2 years ago
4f6eb43b5e
Add Streamlit client for remote deployment on SageMaker and API
1 year ago
891b90ab8b
sidebar expanded
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

CheXNet

A Python3 (TensorFlow) reimplementation of CheXNet paper - Classification and Localization of Thoracic Diseases

Open In Colab

Prerequisites

  • Python 3.8+

  • TensorFlow 2.5+

  • All the specified requirements in the text file

Deployment

This model is deployed with a REST API at http://chexnet-1.public.dagshubusercontent.com:8080/invocations. A streamlit client for the deployed model is provided in deployment/src/streamlit_client.py

To deploy this model yourself, follow the instructions in deployment/README.md.

Training

  1. Clone this repository.

  2. Install requirements.txt using pip install -r requirements.txt.

  3. Use DVC to pull the files that are stored on the DagsHub remote storage by running dvc pull

  4. Modify the code as you wish.

  5. Run dvc repro to run the pipeline and train the model.

The resultant heatmaps from the evaluation stages should look something like 👇

target-image

The ChestX-ray14 Dataset

[comment]: The 45GB dataset comprises 112,120 frontal-view chest X-ray images organized 14 different folders, based on individual tarballs. There is a master list of images with target labels, image IDs, patient IDs alongside additional patient metadata. The dataset is available on the Kaggle platform.

1GB-Dataset

To use a smaller, subsampled dataset, run the following commands from the repository's root directory:

git update-index --assume-unchanged data_labeling/data.dvc
curl https://dagshub.com/nirbarazida/CheXNet/raw/d74cccbd0957c41dac7560565d2b7b4df3c0d195/data_labeling/data.dvc -o data_labeling/data.dvc

Acknowledgements

Note: *If you are adding/removing/moving files to different directories, it can affect the DVC pipeline, and therefore the *dvc repro command might not run properly.

Tip!

Press p or to see the previous file or, n or to see the next file

About

Classifying chest X-Ray images for Pneumonia

Collaborators 3

Comments

Loading...