Interleaved forward links #220

BramOtte · 2025-12-06T17:04:23Z

This put the forwards links at the end of Node, when the forward links don't fit into the Node extra nodes will simply be inserted which will be skipped over when iterating.
This results in not having to go to a separate area in memory when looking up the forward links and not even having to load in an extra cache-line when there are less than 6 forward links.
This does have the disadvantage of making iterating a bit more complicated.
Additionally this requires unsafe and is a bit error prone, I tried to minimize potential error by minimizing magic numbers and putting in asserts.

Benchmarks:
rtps for tickn(50_000) with 1000 samples for iris mandlebrot benchmark (ran twice in a row):
From this we can see about a 5.6% speedup.

master (https://github.com/MCHPR/MCHPRS/tree/a179ab1be44dd44fa3f20ba67195a7f147ab06f2)

avg= 3259104 dev=   31748 q0= 3100066 q1= 3241976 q2= 3246659 q3= 3291493 q4= 3302341
avg= 3252497 dev=   26361 q0= 3098181 q1= 3228507 q2= 3248133 q3= 3278679 q4= 3304033

old (https://github.com/BramOtte/MCHPRS/tree/refactor-forward-links-unsafe):
(this seems to be within margin of error from master)

avg= 3274858 dev=   29929 q0= 2998197 q1= 3274732 q2= 3281709 q3= 3284870 q4= 3352242
avg= 3280807 dev=   27363 q0= 3156344 q1= 3279137 q2= 3283034 q3= 3286403 q4= 3349798

This PR:

avg= 3459374 dev=   15655 q0= 3297415 q1= 3455197 q2= 3459280 q3= 3463341 q4= 3483251
avg= 3465641 dev=   20777 q0= 3198885 q1= 3457385 q2= 3473685 q3= 3478063 q4= 3482424

KonaeAkira · 2025-12-07T16:09:33Z

crates/redpiler/src/backend/direct/node.rs

+            assert!(offset_of!(Node, fwd_links) + size_of::<LinkBuffer>() == size_of::<Node>())
+        }
+
+        (fwd_link_len + BLOCK_SIZE - 1 - LINKS_IN_NODE) / BLOCK_SIZE


Suggested change

(fwd_link_len + BLOCK_SIZE - 1 - LINKS_IN_NODE) / BLOCK_SIZE

(fwd_link_len - LINKS_IN_NODE).div_ceil(BLOCK_SIZE)

Wouldn't that result in undefined behaviour when LINKS_IN_NODE is larger than fwd_link_len?
I guess combined with saturating sub it would work out.

Sorry, I missed that. Yeah, adding saturating_sub will work. The branch is fine as this section isn't performance-critical.

I must point out that integer overflow/underflow is not UB in Rust. Even in C and C++ only signed integer overflow is UB. Rust integer overflow is well-defined and will just wrap around. The whole design goal of Rust is to make it impossible to have UB unless you're using unsafe.

Uhm ok interesting, but overflow is still not desired here and like you said saturating_sub is fine because its not performance critical so I used that approach.
Even if without it it will likely work out since it will likely result in basically the same code I had before but someone reading the code can't really see that since they just see the div and not whats inside.

For performance critical code we should however use the code I had before even without the saturating_sub the div_ceil does infact not result in the same code.
https://godbolt.org/z/he4GE693z

BramOtte · 2025-12-08T12:53:07Z

Doing some more benchmarks this change seems to pretty significantly decrease the performance of flush from ~2.5ms to ~4ms.
So this should probably only get merged if this gets merged too: #206 as this should nullify this cost

KonaeAkira · 2025-12-08T15:33:23Z

Doing some more benchmarks this change seems to pretty significantly decrease the performance of flush from ~2.5ms to ~4ms.

Would storing the valid node indices instead of computing them on the fly on every flush solve this regression?

…ator

BramOtte · 2025-12-08T16:20:38Z

Doing some more benchmarks this change seems to pretty significantly decrease the performance of flush from ~2.5ms to ~4ms.

Would storing the valid node indices instead of computing them on the fly on every flush solve this regression?

Doing this reduces the overhead and it now takes ~3ms instead which is a more acceptable amount.

BramOtte added 5 commits December 7, 2025 11:42

Make forward range u32

7802d8b

Use get_unchecked in set_node

8ac8889

put forward links within the nodes

e16d464

Some cleanup

ad4ed36

run cargo fmt

37f7db9

BramOtte force-pushed the interleaved-forward-links branch from 500b45a to 37f7db9 Compare December 7, 2025 10:45

BramOtte added 3 commits December 7, 2025 13:08

Use forward_link_blocks_for method in direct backend compile

3212700

remove redundant as usize

e1a5f21

Remove old comment

29f9233

KonaeAkira reviewed Dec 7, 2025

View reviewed changes

BramOtte added 2 commits December 7, 2025 17:18

Use div_ceil in forward_link_blocks_for

563d785

run cargo fmt

9ad589a

Remove forward_links vec from DirectBackend

98517c6

BramOtte added 3 commits December 8, 2025 16:59

Store valid NodeId in a vector instead of calculating them in an iter…

1067625

…ator

derive Clone for Compiler

bea3c0b

run cargo fmt

f44b03b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Interleaved forward links #220

Interleaved forward links #220

Uh oh!

BramOtte commented Dec 6, 2025 •

edited

Loading

Uh oh!

KonaeAkira Dec 7, 2025

Uh oh!

BramOtte Dec 7, 2025

Uh oh!

KonaeAkira Dec 7, 2025

Uh oh!

KonaeAkira Dec 7, 2025 •

edited

Loading

Uh oh!

BramOtte Dec 7, 2025 •

edited

Loading

Uh oh!

BramOtte Dec 7, 2025

Uh oh!

BramOtte commented Dec 8, 2025 •

edited

Loading

Uh oh!

KonaeAkira commented Dec 8, 2025

Uh oh!

BramOtte commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	(fwd_link_len + BLOCK_SIZE - 1 - LINKS_IN_NODE) / BLOCK_SIZE
	(fwd_link_len - LINKS_IN_NODE).div_ceil(BLOCK_SIZE)

Uh oh!

Interleaved forward links #220

Are you sure you want to change the base?

Interleaved forward links #220

Uh oh!

Conversation

BramOtte commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KonaeAkira Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

BramOtte Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

KonaeAkira Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

KonaeAkira Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BramOtte Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BramOtte Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

BramOtte commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KonaeAkira commented Dec 8, 2025

Uh oh!

BramOtte commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BramOtte commented Dec 6, 2025 •

edited

Loading

KonaeAkira Dec 7, 2025 •

edited

Loading

BramOtte Dec 7, 2025 •

edited

Loading

BramOtte commented Dec 8, 2025 •

edited

Loading