Skip to content

Commit 41dc49a

Browse files
committed
Vision Transformer refactoring and Rel Pos impl
1 parent b7cb8d0 commit 41dc49a

File tree

5 files changed

+544
-83
lines changed

5 files changed

+544
-83
lines changed

README.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,15 @@ I'm fortunate to be able to dedicate significant time and money of my own suppor
2323

2424
## What's New
2525

26+
27+
### May 2, 2022
28+
* Vision Transformer experiments adding Relative Position (Swin-V2 log-coord) (`vision_transformer_relpos.py`) and Residual Post-Norm branches (from Swin-V2) (`vision_transformer*.py`)
29+
* `vit_relpos_base_patch32_plus_rpn_256` - 79.5 @ 256, 80.6 @ 320 -- rel pos + extended width + res-post-norm, no class token, avg pool
30+
* `vit_relpos_base_patch16_224` - 82.5 @ 224, 83.6 @ 320 -- rel pos, layer scale, no class token, avg pool
31+
* `vit_base_patch16_rpn_224` - 82.3 @ 224 -- rel pos + res-post-norm, no class token, avg pool
32+
* Vision Transformer refactor to remove representation layer that was only used in initial vit and rarely used since with newer pretrain (ie `How to Train Your ViT`)
33+
* `vit_*` models support removal of class token, use of global average pool, use of fc_norm (ala beit, mae).
34+
2635
### April 22, 2022
2736
* `timm` models are now officially supported in [fast.ai](https://www.fast.ai/)! Just in time for the new Practical Deep Learning course. `timmdocs` documentation link updated to [timm.fast.ai](http://timm.fast.ai/).
2837
* Two more model weights added in the TPU trained [series](https://github.com/rwightman/pytorch-image-models/releases/tag/v0.1-tpu-weights). Some In22k pretrain still in progress.

timm/models/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,6 +49,7 @@
4949
from .visformer import *
5050
from .vision_transformer import *
5151
from .vision_transformer_hybrid import *
52+
from .vision_transformer_relpos import *
5253
from .volo import *
5354
from .vovnet import *
5455
from .xception import *

0 commit comments

Comments
 (0)