Skip to content

Conversation

@ChSonnabend
Copy link
Collaborator

@ChSonnabend ChSonnabend commented Jun 30, 2025

Improve filling kernel speed by 20x using for-loop fill on CPU

@ChSonnabend ChSonnabend requested a review from davidrohr as a code owner June 30, 2025 20:54
@github-actions
Copy link
Contributor

REQUEST FOR PRODUCTION RELEASES:
To request your PR to be included in production software, please add the corresponding labels called "async-" to your PR. Add the labels directly (if you have the permissions) or add a comment of the form (note that labels are separated by a ",")

+async-label <label1>, <label2>, !<label3> ...

This will add <label1> and <label2> and removes <label3>.

The following labels are available
async-2023-pbpb-apass4
async-2023-pp-apass4
async-2024-pp-apass1
async-2022-pp-apass7
async-2024-pp-cpass0
async-2024-PbPb-apass1
async-2024-ppRef-apass1
async-2024-PbPb-apass2
async-2023-PbPb-apass5

@ChSonnabend ChSonnabend changed the title Improve filling kernel speed by 20x using for-loop fill on CPU NN clusterizer improve kernel speed Jun 30, 2025
@ChSonnabend
Copy link
Collaborator Author

Can be merged from my side

@davidrohr davidrohr merged commit 11329f6 into AliceO2Group:dev Jul 1, 2025
12 checks passed
@ChSonnabend ChSonnabend deleted the onnx_cpu_fill_kernel branch July 1, 2025 07:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants