Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

#211 SG-136: Apply ema only on student (KD)

Merged
Louis Dupont merged 1 commits into Deci-AI:master from deci-ai:feature/SG-136_use_ema_only_on_kd_student
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
  1. # Shelfnet34_lw recipe for COCO segmentation 21 classes from PASCAL.
  2. # Reaches ~65.1 mIOU
  3. # Trained using 4 X 2080 Ti using DDP- takes ~ 2d 7h with batch size of 8 and batch accumulate of 3 (i.e effective batch
  4. # size is 4*8*3 = 96)
  5. # Logs and tensorboards: s3://deci-pretrained-models/shelfnet34_coco_segmentation_tensorboard/
  6. # python train_from_recipe_example/train_from_recipe.py --config-name=coco_segmentation_shelfnet_lw
  7. defaults:
  8. - training_hyperparams: coco_segmentation_shelfnet_lw_train_params
  9. - dataset_params: coco_segmentation_dataset_params
  10. - arch_params: shelfnet34_lw_arch_params
  11. - checkpoint_params: default_checkpoint_params
  12. sub_classes:
  13. _target_: super_gradients.training.utils.segmentation_utils.coco_sub_classes_inclusion_tuples_list
  14. dataset_interface:
  15. _target_: super_gradients.training.datasets.dataset_interfaces.dataset_interface.CoCoSegmentationDatasetInterface
  16. dataset_params: ${dataset_params}
  17. dataset_classes_inclusion_tuples_list: ${sub_classes}
  18. data_loader_num_workers: 8
  19. load_checkpoint: True
  20. checkpoint_params:
  21. load_checkpoint: ${load_checkpoint}
  22. strict_load: True
  23. load_weights_only: True
  24. load_backbone: True
  25. source_ckpt_folder_name: resnet_34
  26. experiment_name: coco_segmentation_21_subclass_shelfnet34
  27. multi_gpu:
  28. _target_: super_gradients.training.sg_model.MultiGPUMode
  29. value: 'DDP'
  30. ckpt_root_dir:
  31. sg_model:
  32. _target_: super_gradients.SgModel
  33. experiment_name: ${experiment_name}
  34. model_checkpoints_location: ${model_checkpoints_location}
  35. ckpt_root_dir: ${ckpt_root_dir}
  36. multi_gpu: ${multi_gpu}
  37. architecture: shelfnet34_lw
Discard
Tip!

Press p or to see the previous file or, n or to see the next file