You have to be logged in to leave a comment.

Easily train or fine-tune SOTA computer vision models with one open source training library

Fill our 4-question quick survey! We will raffle free SuperGradients swag between those who will participate -> Fill Survey

Website • Why Use SG? • User Guide • Docs • Getting Started Notebooks • Transfer Learning • Pretrained Models • Community • License • Deci Platform

SuperGradients

Introduction

Welcome to SuperGradients, a free, open-source training library for PyTorch-based deep learning models. SuperGradients allows you to train or fine-tune SOTA pre-trained models for all the most commonly applied computer vision tasks with just one training library. We currently support object detection, image classification and semantic segmentation for videos and images.

Docs and full user guide

Why use SuperGradients?

Built-in SOTA Models

Easily load and fine-tune production-ready, pre-trained SOTA models that incorporate best practices and validated hyper-parameters for achieving best-in-class accuracy.

Easily Reproduce our Results

Why do all the grind work, if we already did it for you? leverage tested and proven recipes & code examples for a wide range of computer vision models generated by our team of deep learning experts. Easily configure your own or use plug & play hyperparameters for training, dataset, and architecture.

Production Readiness and Ease of Integration

All SuperGradients models’ are production ready in the sense that they are compatible with deployment tools such as TensorRT (Nvidia) and OpenVINO (Intel) and can be easily taken into production. With a few lines of code you can easily integrate the models into your codebase.

What's New

【07/08/2022】DDRNet23 - new pre-trained checkpoints and recipes for Cityscapes with SOTA mIoU scores (~1% above paper)🎯
【27/07/2022】YOLOX models (object detection) - recipes and pre-trained checkpoints.
【07/07/2022】SSD Lite MobileNet V2,V1 - Training recipes and pre-trained checkpoints on COCO - Tailored for edge devices! 📱
【07/07/2022】 STDC - new pre-trained checkpoints and recipes for Cityscapes with super SOTA mIoU scores (~2.5% above paper)🎯
【16/06/2022】 ResNet50 - new pre-trained checkpoint and recipe for ImageNet top-1 score of 81.9 💪
【09/06/2022】 ViT models (Vision Transformer) - Training recipes and pre-trained checkpoints (ViT, BEiT).
【09/06/2022】 Knowledge Distillation support - training module and notebook.
【06/04/2022】 Integration with professional tools - Weights and Biases and DagsHub.
【09/03/2022】 New quick start and transfer learning example notebooks for Semantic Segmentation.
【07/02/2022】 We added RegSeg recipes and pre-trained models to our Semantic Segmentation models.
【01/02/2022】 We added issue templates for feature requests and bug reporting.
【20/01/2022】 STDC family - new recipes added with even higher mIoU💪

Check out SG full release notes.

Coming soon

Single class detectors (recipes, pre-trained checkpoints) for edge devices deployment.
Single class segmentation (recipes, pre-trained checkpoints) for edge devices deployment.
QAT capabilities (Quantization Aware Training).
Dali implementation.
Integration with more professional tools.
Improved pre-trained checkpoints and recipes (DDRNet, ResNet, RegSeg, etc.)

Table of Content

Getting Started
- Quick Start Notebook - Classification example
- Quick Start Notebook - Semantic segmentation example

Transfer Learning
- Transfer Learning with SG Notebook - Semantic segmentation example
Knowledge Distillation Training
- Knowledge Distillation Training Quick Start with SG Notebook - ResNet18 example
Installation Methods
- Prerequisites
- Quick Installation
Computer Vision Models - Pretrained Checkpoints
Implemented Model Architectures
Contributing
Citation
Community
License
Deci Platform

Getting Started

Start Training with Just 1 Command Line

The most simple and straightforward way to start training SOTA performance models with SuperGradients reproducible recipes. Just define your dataset path and where you want your checkpoints to be saved and you are good to go from your terminal!

python -m super_gradients.train_from_recipe --config-name=imagenet_regnetY architecture=regnetY800 dataset_interface.data_dir=<YOUR_Imagenet_LOCAL_PATH> ckpt_root_dir=<CHEKPOINT_DIRECTORY>

Quickly Load Pre-Trained Weights for Your Desired Model with SOTA Performance

Want to try our pre-trained models on your machine? Import SuperGradients, initialize your Trainer, and load your desired architecture and pre-trained weights from our SOTA model zoo

# The pretrained_weights argument will load a pre-trained architecture on the provided dataset
# This is an example of loading COCO-2017 pre-trained weights for a YOLOX Nano object detection model
    
import super_gradients
from super_gradients.training import Trainer

trainer = SgModel(experiment_name="yoloxn_coco_experiment",ckpt_root_dir=<CHECKPOINT_DIRECTORY>)
trainer.build_model(architecture="yolox_n", arch_params={"pretrained_weights": "coco", num_classes": 80})

Quick Start Notebook - Classification

Get started with our quick start notebook for image classification tasks on Google Colab for a quick and easy start using free GPU hardware.

Classification Quick Start in Google Colab

Download notebook

View source on GitHub

Quick Start Notebook - Semantic Segmentation

Get started with our quick start notebook for semantic segmentation tasks on Google Colab for a quick and easy start using free GPU hardware.

Segmentation Quick Start in Google Colab

Download notebook

View source on GitHub

Transfer Learning

Transfer Learning with SG Notebook - Semantic Segmentation

Learn more about SuperGradients transfer learning or fine tuning abilities with our Citiscapes pre-trained RegSeg48 fine tuning into a sub-dataset of Supervisely example notebook on Google Colab for an easy to use tutorial using free GPU hardware

Segmentation Transfer Learning in Google Colab

Download notebook

View source on GitHub

Knowledge Distillation Training

Knowledge Distillation Training Quick Start with SG Notebook - ResNet18 example

Knowledge Distillation is a training technique that uses a large model, teacher model, to improve the performance of a smaller model, the student model. Learn more about SuperGradients knowledge distillation training with our pre-trained BEiT base teacher model and Resnet18 student model on CIFAR10 example notebook on Google Colab for an easy to use tutorial using free GPU hardware

KD Training in Google Colab

Download notebook

View source on GitHub

Installation Methods

Prerequisites

General requirements

Python 3.7, 3.8 or 3.9 installed.
torch>=1.9.0
- https://pytorch.org/get-started/locally/
The python packages that are specified in requirements.txt;

To train on nvidia GPUs

Nvidia CUDA Toolkit >= 11.2
CuDNN >= 8.1.x
Nvidia Driver with CUDA >= 11.2 support (≥460.x)

Quick Installation

Install stable version using PyPi

See in PyPi

pip install super-gradients

That's it !

Install using GitHub

pip install git+https://github.com/Deci-AI/super-gradients.git@stable

Computer Vision Models - Pretrained Checkpoints

Pretrained Classification PyTorch Checkpoints

Model	Dataset	Resolution	Top-1	Top-5	Latency (HW)*_T4	Latency (Production)**_T4	Latency (HW)*_{Jetson Xavier NX}	Latency (Production)**_{Jetson Xavier NX}	Latency _{Cascade Lake}
ViT base	ImageNet21K	224x224	84.15	-	4.46ms	4.60ms	- *	-	57.22ms
ViT large	ImageNet21K	224x224	85.64	-	12.81ms	13.19ms	- *	-	187.22ms
BEiT	ImageNet21K	224x224	-	-	-ms	-ms	- *	-	-ms
EfficientNet B0	ImageNet	224x224	77.62	93.49	0.93ms	1.38ms	- *	-	3.44ms
RegNet Y200	ImageNet	224x224	70.88	89.35	0.63ms	1.08ms	2.16ms	2.47ms	2.06ms
RegNet Y400	ImageNet	224x224	74.74	91.46	0.80ms	1.25ms	2.62ms	2.91ms	2.87ms
RegNet Y600	ImageNet	224x224	76.18	92.34	0.77ms	1.22ms	2.64ms	2.93ms	2.39ms
RegNet Y800	ImageNet	224x224	77.07	93.26	0.74ms	1.19ms	2.77ms	3.04ms	2.81ms
ResNet 18	ImageNet	224x224	70.6	89.64	0.52ms	0.95ms	2.01ms	2.30ms	4.56ms
ResNet 34	ImageNet	224x224	74.13	91.7	0.92ms	1.34ms	3.57ms	3.87ms	7.64ms
ResNet 50	ImageNet	224x224	81.91	93.0	1.03ms	1.44ms	4.78ms	5.10ms	9.25ms
MobileNet V3_large-150 epochs	ImageNet	224x224	73.79	91.54	0.67ms	1.11ms	2.42ms	2.71ms	1.76ms
MobileNet V3_large-300 epochs	ImageNet	224x224	74.52	91.92	0.67ms	1.11ms	2.42ms	2.71ms	1.76ms
MobileNet V3_small	ImageNet	224x224	67.45	87.47	0.55ms	0.96ms	2.01ms *	2.35ms	1.06ms
MobileNet V2_w1	ImageNet	224x224	73.08	91.1	0.46 ms	0.89ms	1.65ms *	1.90ms	1.56ms

NOTE:

Latency (HW)* - Hardware performance (not including IO)

Latency (Production)** - Production Performance (including IO)

Performance measured for T4 and Jetson Xavier NX with TensorRT, using FP16 precision and batch size 1

Performance measured for Cascade Lake CPU with OpenVINO, using FP16 precision and batch size 1

Pretrained Object Detection PyTorch Checkpoints

Model	Dataset	Resolution	mAP^val 0.5:0.95	Latency (HW)*_T4	Latency (Production)**_T4	Latency (HW)*_{Jetson Xavier NX}	Latency (Production)**_{Jetson Xavier NX}	Latency _{Cascade Lake}
SSD lite MobileNet v2	COCO	320x320	21.5	0.77ms	1.40ms	5.28ms	6.44ms	4.13ms
SSD lite MobileNet v1	COCO	320x320	24.3	1.55ms	2.84ms	8.07ms	9.14ms	22.76ms
YOLOX nano	COCO	640x640	26.77	2.47ms	4.09ms	11.49ms	12.97ms	-
YOLOX tiny	COCO	640x640	37.18	3.16ms	4.61ms	15.23ms	19.24ms	-
YOLOX small	COCO	640x640	40.47	3.58ms	4.94ms	18.88ms	22.48ms	-
YOLOX medium	COCO	640x640	46.4	6.40ms	7.65ms	39.22ms	44.5ms	-
YOLOX large	COCO	640x640	49.25	10.07ms	11.12ms	68.73ms	77.01ms	-

NOTE:

Latency (HW)* - Hardware performance (not including IO)

Latency (Production)** - Production Performance (including IO)

Latency performance measured for T4 and Jetson Xavier NX with TensorRT, using FP16 precision and batch size 1

Latency performance measured for Cascade Lake CPU with OpenVINO, using FP16 precision and batch size 1

Pretrained Semantic Segmentation PyTorch Checkpoints

Model	Dataset	Resolution	mIoU	Latency b1_T4	Latency b1_T4 including IO
DDRNet 23	Cityscapes	1024x2048	80.26	7.62ms	25.94ms
DDRNet 23 slim	Cityscapes	1024x2048	78.01	3.56ms	22.80ms
STDC 1-Seg50	Cityscapes	512x1024	75.11	2.83ms	12.57ms
STDC 1-Seg75	Cityscapes	768x1536	77.8	5.71ms	26.70ms
STDC 2-Seg50	Cityscapes	512x1024	76.44	3.74ms	13.89ms
STDC 2-Seg75	Cityscapes	768x1536	78.93	7.35ms	28.18ms
RegSeg (exp48)	Cityscapes	1024x2048	78.15	13.09ms	41.88ms
Larger RegSeg (exp53)	Cityscapes	1024x2048	79.2	24.82ms	51.87ms
ShelfNet LW 34	COCO Segmentation (21 classes from PASCAL including background)	512x512	65.1	-	-

NOTE: Performance measured on T4 GPU with TensorRT, using FP16 precision and batch size 1 (latency), and not including IO

Implemented Model Architectures

Image Classification

DensNet (Densely Connected Convolutional Networks) - Densely Connected Convolutional Networks https://arxiv.org/pdf/1608.06993.pdf
DPN - Dual Path Networks https://arxiv.org/pdf/1707.01629
EfficientNet - https://arxiv.org/abs/1905.11946
GoogleNet - https://arxiv.org/pdf/1409.4842
LeNet - https://yann.lecun.com/exdb/lenet/
MobileNet - Efficient Convolutional Neural Networks for Mobile Vision Applications https://arxiv.org/pdf/1704.04861
MobileNet v2 - https://arxiv.org/pdf/1801.04381
MobileNet v3 - https://arxiv.org/pdf/1905.02244
PNASNet - Progressive Neural Architecture Search Networks https://arxiv.org/pdf/1712.00559
Pre-activation ResNet - https://arxiv.org/pdf/1603.05027
RegNet - https://arxiv.org/pdf/2003.13678.pdf
RepVGG - Making VGG-style ConvNets Great Again https://arxiv.org/pdf/2101.03697.pdf
ResNet - Deep Residual Learning for Image Recognition https://arxiv.org/pdf/1512.03385
ResNeXt - Aggregated Residual Transformations for Deep Neural Networks https://arxiv.org/pdf/1611.05431
SENet - Squeeze-and-Excitation Networkshttps://arxiv.org/pdf/1709.01507
ShuffleNet - https://arxiv.org/pdf/1707.01083
ShuffleNet v2 - Efficient Convolutional Neural Network for Mobile Deviceshttps://arxiv.org/pdf/1807.11164
VGG - Very Deep Convolutional Networks for Large-scale Image Recognition https://arxiv.org/pdf/1409.1556

Object Detection

Semantic Segmentation

DDRNet (Deep Dual-resolution Networks) - https://arxiv.org/pdf/2101.06085.pdf
LadderNet - Multi-path networks based on U-Net for medical image segmentation https://arxiv.org/pdf/1810.07810
RegSeg - Rethink Dilated Convolution for Real-time Semantic Segmentation https://arxiv.org/pdf/2111.09957
ShelfNet - https://arxiv.org/pdf/1811.11254
STDC - Rethinking BiSeNet For Real-time Semantic Segmentation https://arxiv.org/pdf/2104.13188

Documentation

Check SuperGradients Docs for full documentation, user guide, and examples.

Contributing

To learn about making a contribution to SuperGradients, please see our Contribution page.

Our awesome contributors:

Made with contrib.rocks.

Citation

If you are using SuperGradients library or benchmarks in your research, please cite SuperGradients deep learning training library.

Community

If you want to be a part of SuperGradients growing community, hear about all the exciting news and updates, need help, request for advanced features, or want to file a bug or issue report, we would love to welcome you aboard!

Slack is the place to be and ask questions about SuperGradients and get support. Click here to join our Slack
To report a bug, file an issue on GitHub.
Join the SG Newsletter for staying up to date with new features and models, important announcements, and upcoming events.
For a short meeting with us, use this link and choose your preferred time.

License

This project is released under the Apache 2.0 license.

Deci Platform

Deci Platform is our end to end platform for building, optimizing and deploying deep learning models to production.

Sign up for our FREE Community Tier to enjoy immediate improvement in throughput, latency, memory footprint and model size.

Features:

Automatically compile and quantize your models with just a few clicks (TensorRT, OpenVINO).
Gain up to 10X improvement in throughput, latency, memory and model size.
Easily benchmark your models’ performance on different hardware and batch sizes.
Invite co-workers to collaborate on models and communicate your progress.
Deci supports all common frameworks and Hardware, from Intel CPUs to Nvidia's GPUs and Jetsons.

Tip!

Press p or to see the previous file or, n or to see the next file

Deci-AI / super-gradients connected to https://github.com/Deci-AI/super-gradients.git

README.md 33 KB History Raw

Fill our 4-question quick survey! We will raffle free SuperGradients swag between those who will participate -> Fill Survey

SuperGradients

Introduction

Why use SuperGradients?

What's New

Coming soon

Table of Content

Getting Started

Start Training with Just 1 Command Line

Quickly Load Pre-Trained Weights for Your Desired Model with SOTA Performance

Quick Start Notebook - Classification

Quick Start Notebook - Semantic Segmentation

Transfer Learning

Transfer Learning with SG Notebook - Semantic Segmentation

Knowledge Distillation Training

Knowledge Distillation Training Quick Start with SG Notebook - ResNet18 example

Installation Methods

Prerequisites

Quick Installation

Computer Vision Models - Pretrained Checkpoints

Pretrained Classification PyTorch Checkpoints

Pretrained Object Detection PyTorch Checkpoints

Pretrained Semantic Segmentation PyTorch Checkpoints

Implemented Model Architectures

Image Classification

Object Detection

Semantic Segmentation

Documentation

Contributing

Citation

Community

License

Deci Platform

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Deci-AI
/
super-gradients
connected to https://github.com/Deci-AI/super-gradients.git

README.md 33 KB

History Raw