-
Notifications
You must be signed in to change notification settings - Fork 15.5k
[AArch64] Optimize more floating-point round+convert combinations into fcvt instructions #170018
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
valadaptive
wants to merge
17
commits into
llvm:main
Choose a base branch
from
valadaptive:aarch64-round-convert
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+2,745
−169
Open
Changes from all commits
Commits
Show all changes
17 commits
Select commit
Hold shift + click to select a range
dc4777e
[AArch64] Add tests for roundeven+conversion fusion
valadaptive c4763b2
[AArch64] Add tests for vector round+conversion fusion
valadaptive f6c8b78
[AArch64] Add float-to-int codegen pattern for roundeven+fptoi
valadaptive d5b88fc
[AArch64] Use vector rounding conversion instructions
valadaptive e56d806
[AArch64] Add saturating vector float->int tests; remove intrinsic de…
valadaptive ea85543
[AArch64] Actually optimize all the vector float->int patterns
valadaptive 9507254
[AArch64] Add more roundeven+conversion tests
valadaptive 1eaf0d4
[AArch64] Regen roundeven tests
valadaptive 0886610
[AArch64] Add global-isel to new tests
valadaptive 9ca61d1
[AArch64] Use roundeven intrinsic for tests
valadaptive f62a112
[AArch64] Add non-saturating roundeven+fpto[su]i tests
valadaptive dd5b29a
[AArch64] Use intrinsics for all the rounding tests
valadaptive bc5441b
[AArch64] Add GlobalISel to fpto[su]i+round tests
valadaptive fb752c5
[AArch64][GlobalISel] Add patterns for scalar float-to-int conversions
valadaptive 7f64305
Merge remote-tracking branch 'upstream/main' into aarch64-round-convert
valadaptive 4ac8662
[AArch64] Remove `nounwind readnone` from some tests
valadaptive 9aa714c
Update llvm/test/CodeGen/AArch64/round-fptosi-sat-scalar.ll
valadaptive File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should also add tests for non-bitcast patterns. You can existing tests conversion llvm/test/CodeGen/AArch64/round-conv.ll and /home/marluk01/llvm-project/cvt-round/llvm/test/CodeGen/AArch64/round-fptosi-sat-scalar.ll if it helps.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That was a good catch; I actually discovered that the
roundeven/roundevenf/roundevenllibcalls are missing fromSelectionDAGBuilder::visitCalland can't get lowered into LLVM intrinsics! Since that's a target-agnostic issue and this PR only touches AArch64, I'm going to fix that in a separate PR.For now, I've added tests to round-fptosi-sat-scalar.ll and round-fptoui-sat-scalar.ll. The generated code isn't where we want it to be yet, but that's because of the missing SelectionDAG lowering. The round-conv.ll tests have manually-written test lines, so I haven't added anything there for now.