Commit 0287a5c
authored
[OpenMP] Remove 'minncta' attributes from NVPTX kernels (#88398)
Summary:
Currently we treat this attribute as a minimum number for the amount of
blocks scheduled on the kernel. However, the doucmentation states that
this applies to CTA's mapped onto a *single* SM. Currently we just set
it to the total number of blocks, which will almost always result in a
warning that the value is out of range and will be ignored. We don't
have a good way to automatically know how many CTAs can be put on a
single SM nor if we should do this, so we should probably leave this up
to users manually adding it.
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#performance-tuning-directives-minnctapersm1 parent 071ac0a commit 0287a5c
File tree
2 files changed
+2
-5
lines changed- clang/test/OpenMP
- llvm/lib/Frontend/OpenMP
2 files changed
+2
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
40 | | - | |
| 39 | + | |
41 | 40 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4786 | 4786 | | |
4787 | 4787 | | |
4788 | 4788 | | |
4789 | | - | |
| 4789 | + | |
4790 | 4790 | | |
4791 | 4791 | | |
4792 | | - | |
4793 | | - | |
4794 | 4792 | | |
4795 | 4793 | | |
4796 | 4794 | | |
| |||
0 commit comments