This repository was archived by the owner on Apr 28, 2023. It is now read-only.
Commit 1e5ad91
committed
promoteToRegistersBelow: sort tensor reference groups
Follow the same strategy as with shared memory promotion: first, sort
tensors in decreasing order of the total number of references; then, for
each tensor, sort groups based on the number of references in this
group. Tensor groups with more references are expected to benefit more
from promotion as more global memory accesses may be avoided thanks to
explicit caching in faster layers of the memory hierarchy. Note that
since there is no limit on the number of registers to use, all groups
that can be promoted into registers are promoted, and the sorting has no
effect on the outcome. Such limit will be introduced next.1 parent 4aa377d commit 1e5ad91
1 file changed
+5
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
656 | 656 | | |
657 | 657 | | |
658 | 658 | | |
659 | | - | |
660 | | - | |
| 659 | + | |
| 660 | + | |
661 | 661 | | |
662 | 662 | | |
663 | 663 | | |
| |||
673 | 673 | | |
674 | 674 | | |
675 | 675 | | |
676 | | - | |
| 676 | + | |
677 | 677 | | |
678 | | - | |
679 | | - | |
| 678 | + | |
| 679 | + | |
680 | 680 | | |
681 | 681 | | |
682 | 682 | | |
| |||
0 commit comments