This repository was archived by the owner on Apr 28, 2023. It is now read-only.
Commit 900cb42
committed
promoteToSharedAtDepth: ignore scopes below thread mapping
Band nodes at the given depth may be above or below the thread mapping.
In the latter case, promoteToSharedBelow would throw an IncorrectScope
exception because promotion to shared below a thread mapping node is
forbidden and is generally nonsensical. In a branching tree, there is
no guarantee that nodes at the given schedule depth are all above or
below the thread mapping. Simply ignore the subtrees that have a
thread-mapping ancestor node when performing shared memory promotion at
depth.
Leaving promoteToSharedBelow as is (throwing an exception) because it
may end up being reused by different callers. Arguably,
promoteToSharedAtDepth is a higher-level API and should not throw if
only one subtree is problematic.1 parent b0b73cb commit 900cb42
File tree
2 files changed
+18
-8
lines changed- tc/core/polyhedral/cuda
- test
2 files changed
+18
-8
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
565 | 565 | | |
566 | 566 | | |
567 | 567 | | |
568 | | - | |
| 568 | + | |
| 569 | + | |
569 | 570 | | |
570 | 571 | | |
571 | 572 | | |
| |||
600 | 601 | | |
601 | 602 | | |
602 | 603 | | |
603 | | - | |
| 604 | + | |
| 605 | + | |
604 | 606 | | |
605 | 607 | | |
| 608 | + | |
| 609 | + | |
| 610 | + | |
| 611 | + | |
| 612 | + | |
| 613 | + | |
606 | 614 | | |
607 | 615 | | |
608 | 616 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
437 | 437 | | |
438 | 438 | | |
439 | 439 | | |
440 | | - | |
441 | | - | |
442 | | - | |
443 | | - | |
444 | | - | |
445 | | - | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
| 446 | + | |
| 447 | + | |
446 | 448 | | |
447 | 449 | | |
448 | 450 | | |
| |||
0 commit comments