You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* LR scheduler ideas from [AllenNLP](https://github.com/allenai/allennlp/tree/master/allennlp/training/learning_rate_schedulers) and [FAIRseq](https://github.com/pytorch/fairseq/tree/master/fairseq/optim/lr_scheduler)
15
-
* Random Erasing from [Zhun Zhong](https://github.com/zhunzhong07/Random-Erasing/blob/master/transforms.py)
14
+
* LR scheduler ideas from [AllenNLP](https://github.com/allenai/allennlp/tree/master/allennlp/training/learning_rate_schedulers), [FAIRseq](https://github.com/pytorch/fairseq/tree/master/fairseq/optim/lr_scheduler), and SGDR: Stochastic Gradient Descent with Warm Restarts (https://arxiv.org/abs/1608.03983)
15
+
* Random Erasing from [Zhun Zhong](https://github.com/zhunzhong07/Random-Erasing/blob/master/transforms.py) (https://arxiv.org/abs/1708.04896)
16
16
17
17
## Models
18
18
@@ -30,12 +30,13 @@ I've included a few of my favourite models, but this is not an exhaustive collec
30
30
* DPN (from [me](https://github.com/rwightman/pytorch-dpn-pretrained), weights hosted by Cadene)
* Generic MobileNet (from my standalone [GenMobileNet](https://github.com/rwightman/genmobilenet-pytorch)) - A generic model that implements many of the mobile optimized architecture search derived models that utilize similar DepthwiseSeparable and InvertedResidual blocks
33
-
* MNASNet B1, A1 (Squeeze-Excite), and Small
34
-
* MobileNet-V1
35
-
* MobileNet-V2
36
-
* MobileNet-V3 (work in progress, validating config)
37
-
* ChamNet (details hard to find, currently an educated guess)
38
-
* FBNet-C (TODO A/B variants)
33
+
* MNASNet B1, A1 (Squeeze-Excite), and Small (https://arxiv.org/abs/1807.11626)
34
+
* MobileNet-V1 (https://arxiv.org/abs/1704.04861)
35
+
* MobileNet-V2 (https://arxiv.org/abs/1801.04381)
36
+
* MobileNet-V3 (https://arxiv.org/abs/1905.02244) -- work in progress, validating config
37
+
* ChamNet (https://arxiv.org/abs/1812.08934) -- specific arch details hard to find, currently an educated guess
38
+
* FBNet-C (https://arxiv.org/abs/1812.03443) -- TODO A/B variants
39
+
* Single-Path NAS (https://arxiv.org/abs/1904.02877) -- pixel1 variant
39
40
40
41
The full list of model strings that can be passed to model factory via `--model` arg for train, validation, inference scripts:
0 commit comments