You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4-35Lines changed: 4 additions & 35 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,6 +2,9 @@
2
2
3
3
## What's New
4
4
5
+
### May 12, 2020
6
+
* Add ResNeSt models (code adapted from https://github.com/zhanghang1989/ResNeSt, paper https://arxiv.org/abs/2004.08955))
7
+
5
8
### May 3, 2020
6
9
* Pruned EfficientNet B1, B2, and B3 (https://arxiv.org/abs/2002.08258) contributed by [Yonathan Aflalo](https://github.com/yoniaflalo)
7
10
@@ -70,41 +73,6 @@
70
73
* Add RandAugment trained EfficientNet-B0 weight with 77.7 top-1. Trained by [Michael Klachko](https://github.com/michaelklachko) with this code and recent hparams (see Training section)
71
74
* Add `avg_checkpoints.py` script for post training weight averaging and update all scripts with header docstrings and shebangs.
72
75
73
-
### Dec 30, 2019
74
-
* Merge [Dushyant Mehta's](https://github.com/mehtadushy) PR for SelecSLS (Selective Short and Long Range Skip Connections) networks. Good GPU memory consumption and throughput. Original: https://github.com/mehtadushy/SelecSLS-Pytorch
75
-
76
-
### Dec 28, 2019
77
-
* Add new model weights and training hparams (see Training Hparams section)
78
-
*`efficientnet_b3` - 81.5 top-1, 95.7 top-5 at default res/crop, 81.9, 95.8 at 320x320 1.0 crop-pct
79
-
* trained with RandAugment, ended up with an interesting but less than perfect result (see training section)
80
-
*`seresnext26d_32x4d`- 77.6 top-1, 93.6 top-5
81
-
* deep stem (32, 32, 64), avgpool downsample
82
-
* stem/dowsample from bag-of-tricks paper
83
-
*`seresnext26t_32x4d`- 78.0 top-1, 93.7 top-5
84
-
* deep tiered stem (24, 48, 64), avgpool downsample (a modified 'D' variant)
85
-
* stem sizing mods from Jeremy Howard and fastai devs discussing ResNet architecture experiments
86
-
87
-
### Dec 23, 2019
88
-
* Add RandAugment trained MixNet-XL weights with 80.48 top-1.
89
-
*`--dist-bn` argument added to train.py, will distribute BN stats between nodes after each train epoch, before eval
90
-
91
-
### Dec 4, 2019
92
-
* Added weights from the first training from scratch of an EfficientNet (B2) with my new RandAugment implementation. Much better than my previous B2 and very close to the official AdvProp ones (80.4 top-1, 95.08 top-5).
93
-
94
-
### Nov 29, 2019
95
-
* Brought EfficientNet and MobileNetV3 up to date with my https://github.com/rwightman/gen-efficientnet-pytorch code. Torchscript and ONNX export compat excluded.
96
-
* AdvProp weights added
97
-
* Official TF MobileNetv3 weights added
98
-
* EfficientNet and MobileNetV3 hook based 'feature extraction' classes added. Will serve as basis for using models as backbones in obj detection/segmentation tasks. Lots more to be done here...
99
-
* HRNet classification models and weights added from https://github.com/HRNet/HRNet-Image-Classification
100
-
* Consistency in global pooling, `reset_classifer`, and `forward_features` across models
101
-
*`forward_features` always returns unpooled feature maps now
102
-
* Reasonable chance I broke something... let me know
103
-
104
-
### Nov 22, 2019
105
-
* Add ImageNet training RandAugment implementation alongside AutoAugment. PyTorch Transform compatible format, using PIL. Currently training two EfficientNet models from scratch with promising results... will update.
106
-
*`drop-connect` cmd line arg finally added to `train.py`, no need to hack model fns. Works for efficientnet/mobilenetv3 based models, ignored otherwise.
107
-
108
76
## Introduction
109
77
110
78
For each competition, personal, or freelance project involving images + Convolution Neural Networks, I build on top of an evolving collection of code and models. This repo contains a (somewhat) cleaned up and paired down iteration of that code. Hopefully it'll be of use to others.
@@ -130,6 +98,7 @@ Included models:
130
98
* Instagram trained / ImageNet tuned ResNeXt101-32x8d to 32x48d from from [facebookresearch](https://pytorch.org/hub/facebookresearch_WSL-Images_resnext/)
0 commit comments