|
1 | | -# PyTorch Image Models, etc |
| 1 | +# PyTorch Image Models |
| 2 | +- [Sponsors](#sponsors) |
| 3 | +- [What's New](#whats-new) |
| 4 | +- [Introduction](#introduction) |
| 5 | +- [Models](#models) |
| 6 | +- [Features](#features) |
| 7 | +- [Results](#results) |
| 8 | +- [Getting Started (Documentation)](#getting-started-documentation) |
| 9 | +- [Train, Validation, Inference Scripts](#train-validation-inference-scripts) |
| 10 | +- [Awesome PyTorch Resources](#awesome-pytorch-resources) |
| 11 | +- [Licenses](#licenses) |
| 12 | +- [Citing](#citing) |
| 13 | + |
| 14 | +## Sponsors |
| 15 | + |
| 16 | +A big thank you to my [GitHub Sponsors](https://github.com/sponsors/rwightman) for their support! |
| 17 | + |
| 18 | +In addition to the sponsors at the link above, I've received hardware and/or cloud resources from |
| 19 | +* Nvidia (https://www.nvidia.com/en-us/) |
| 20 | +* TFRC (https://www.tensorflow.org/tfrc) |
| 21 | + |
| 22 | +I'm fortunate to be able to dedicate significant time and money of my own supporting this and other open source projects. However, as the projects increase in scope, outside support is needed to continue with the current trajectory of hardware, infrastructure, and electricty costs. |
2 | 23 |
|
3 | 24 | ## What's New |
4 | 25 |
|
|
120 | 141 | * Support for native Torch AMP and channels_last memory format added to train/validate scripts (`--channels-last`, `--native-amp` vs `--apex-amp`) |
121 | 142 | * Models tested with channels_last on latest NGC 20.08 container. AdaptiveAvgPool in attn layers changed to mean((2,3)) to work around bug with NHWC kernel. |
122 | 143 |
|
123 | | -### Aug 12, 2020 |
124 | | -* New/updated weights from training experiments |
125 | | - * EfficientNet-B3 - 82.1 top-1 (vs 81.6 for official with AA and 81.9 for AdvProp) |
126 | | - * RegNetY-3.2GF - 82.0 top-1 (78.9 from official ver) |
127 | | - * CSPResNet50 - 79.6 top-1 (76.6 from official ver) |
128 | | -* Add CutMix integrated w/ Mixup. See [pull request](https://github.com/rwightman/pytorch-image-models/pull/218) for some usage examples |
129 | | -* Some fixes for using pretrained weights with `in_chans` != 3 on several models. |
130 | | - |
131 | | -### Aug 5, 2020 |
132 | | -Universal feature extraction, new models, new weights, new test sets. |
133 | | -* All models support the `features_only=True` argument for `create_model` call to return a network that extracts feature maps from the deepest layer at each stride. |
134 | | -* New models |
135 | | - * CSPResNet, CSPResNeXt, CSPDarkNet, DarkNet |
136 | | - * ReXNet |
137 | | - * (Modified Aligned) Xception41/65/71 (a proper port of TF models) |
138 | | -* New trained weights |
139 | | - * SEResNet50 - 80.3 top-1 |
140 | | - * CSPDarkNet53 - 80.1 top-1 |
141 | | - * CSPResNeXt50 - 80.0 top-1 |
142 | | - * DPN68b - 79.2 top-1 |
143 | | - * EfficientNet-Lite0 (non-TF ver) - 75.5 (submitted by [@hal-314](https://github.com/hal-314)) |
144 | | -* Add 'real' labels for ImageNet and ImageNet-Renditions test set, see [`results/README.md`](results/README.md) |
145 | | -* Test set ranking/top-n diff script by [@KushajveerSingh](https://github.com/KushajveerSingh) |
146 | | -* Train script and loader/transform tweaks to punch through more aug arguments |
147 | | -* README and documentation overhaul. See initial (WIP) documentation at https://rwightman.github.io/pytorch-image-models/ |
148 | | -* adamp and sgdp optimizers added by [@hellbell](https://github.com/hellbell) |
149 | | - |
150 | | -### June 11, 2020 |
151 | | -Bunch of changes: |
152 | | -* DenseNet models updated with memory efficient addition from torchvision (fixed a bug), blur pooling and deep stem additions |
153 | | -* VoVNet V1 and V2 models added, 39 V2 variant (ese_vovnet_39b) trained to 79.3 top-1 |
154 | | -* Activation factory added along with new activations: |
155 | | - * select act at model creation time for more flexibility in using activations compatible with scripting or tracing (ONNX export) |
156 | | - * hard_mish (experimental) added with memory-efficient grad, along with ME hard_swish |
157 | | - * context mgr for setting exportable/scriptable/no_jit states |
158 | | -* Norm + Activation combo layers added with initial trial support in DenseNet and VoVNet along with impl of EvoNorm and InplaceAbn wrapper that fit the interface |
159 | | -* Torchscript works for all but two of the model types as long as using Pytorch 1.5+, tests added for this |
160 | | -* Some import cleanup and classifier reset changes, all models will have classifier reset to nn.Identity on reset_classifer(0) call |
161 | | -* Prep for 0.1.28 pip release |
162 | 144 |
|
163 | 145 | ## Introduction |
164 | 146 |
|
@@ -271,9 +253,13 @@ Several (less common) features that I often utilize in my projects are included. |
271 | 253 |
|
272 | 254 | Model validation results can be found in the [documentation](https://rwightman.github.io/pytorch-image-models/results/) and in the [results tables](results/README.md) |
273 | 255 |
|
274 | | -## Getting Started |
| 256 | +## Getting Started (Documentation) |
| 257 | + |
| 258 | +My current [documentation](https://rwightman.github.io/pytorch-image-models/) for `timm` covers the basics. |
| 259 | + |
| 260 | +[timmdocs](https://fastai.github.io/timmdocs/) is quickly becoming a much more comprehensive set of documentation for `timm`. A big thanks to [Aman Arora](https://github.com/amaarora) for his efforts creating timmdocs. |
275 | 261 |
|
276 | | -See the [documentation](https://rwightman.github.io/pytorch-image-models/) |
| 262 | +[paperswithcode](https://paperswithcode.com/lib/timm) is a good resource for browsing the models within `timm`. |
277 | 263 |
|
278 | 264 | ## Train, Validation, Inference Scripts |
279 | 265 |
|
|
0 commit comments