This repository was archived by the owner on Apr 28, 2023. It is now read-only.
Commit c6730e3
committed
promoteToRegistersAtDepth: limit the total number of elements promoted
The limit applies per thread and is cumulated for all subtrees where
promotion is performed. By default, it is set to SIZE_MAX, which
ensures backwards-compatible behavior for all sensible cases (if
something had required more than SIZE_MAX registers, it would have been
spilled to global memory and still would not have fit). This limit will
be exposed as a mapping option in an upcoming commit.1 parent 9155a07 commit c6730e3
File tree
2 files changed
+10
-4
lines changed- tc/core/polyhedral/cuda
2 files changed
+10
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
748 | 748 | | |
749 | 749 | | |
750 | 750 | | |
751 | | - | |
| 751 | + | |
752 | 752 | | |
753 | | - | |
| 753 | + | |
| 754 | + | |
| 755 | + | |
| 756 | + | |
754 | 757 | | |
755 | 758 | | |
756 | 759 | | |
| |||
784 | 787 | | |
785 | 788 | | |
786 | 789 | | |
787 | | - | |
| 790 | + | |
788 | 791 | | |
789 | 792 | | |
790 | 793 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
50 | 53 | | |
51 | 54 | | |
52 | 55 | | |
| |||
0 commit comments