add parsing variant that doesn't allocate #83

petrosagg · 2025-04-23T18:31:48Z

This PR improves the routines that convert f64 values into Decimal<N> values by avoiding allocating large strings. The performance comes from:

Serializing the float using the scientific notation, which uses at most 24 characters as opposed to ~800 characters of the only decimal notation.
Avoiding all roundtrips to the allocator by writing into a small stack allocated buffer

I have included a benchmark for the from_f64 method which shows a 4x improvement:

parse_decimal           time:   [658.76 ns 661.73 ns 665.05 ns]                           
                        change: [-78.751% -78.645% -78.534%] (p = 0.00 < 0.05)
                        Performance has improved.

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

ParkMyCar

Nice!

ParkMyCar · 2025-04-29T13:30:37Z

dec/src/decimal.rs

+        // * 3 bytes for the largest possible exponent (308)
+        // An example of such maximal float value is f64::from_bits(0x8008000000000000) whose
+        // decimal representation is '-1.1125369292536007e-308'
+        const MAX_LEN: usize = 24;


This makes sense to me! Just double checked and ryu also sizes their buffer with 24 bytes so these seems quite safe

Nice to see it validated elsewhere!

ParkMyCar · 2025-04-29T13:31:48Z

dec/src/decimal.rs

+
+        let mut buf = [0u8; MAX_LEN + 1];
+        let mut unwritten = &mut buf[..MAX_LEN];
+        write!(unwritten, "{:e}", n).unwrap();


IIRC the write! macro has a decent amount of machinery internally that it might be nice to bypass, not a huge priority though.

Talked on slack, doesn't seem to be much we do here so leaving it as-is

ParkMyCar · 2025-04-29T13:32:05Z

dec/src/decimal.rs

+        // decimal representation is '-1.1125369292536007e-308'
+        const MAX_LEN: usize = 24;
+
+        let mut buf = [0u8; MAX_LEN + 1];


It seems like this +1 is to ensure the last byte is a null character for the C String? mind documenting that?

antiguru

The f32 conversion is not correct.

fn main() {
    println!("{}", 4.8f32 as f64);
    println!("{}", 4.8f64 as f64);
}

prints

4.800000190734863
4.8

petrosagg · 2025-04-29T15:46:27Z

@antiguru good catch! Pushed a change to fix this. I added a generic method for both float sizes that calls into the appropriate stringify method

antiguru

Thanks for addressing the f32 issue!

antiguru · 2025-04-29T18:01:56Z

dec/src/decimal.rs

+        self.from_float(n)
+    }
+
+    /// Converts an `f64` to a `Decimal<N>`.


The description is now incorrect. Also, we should ensure that we only pass f32|f64 here, and not accidentally f128 in the future.

Indeed, I'll fix the description. I'm not super concerned about ensuring statically that T is either f32 or f64 since you can't cause UB by doing that but I will document it for the future engineer

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

This PR fixes a regression introduced in #83 which started serializing floats using their scientific notation, which is much shorter than the decimal notation and therefore faster. It turns out that conversion from string to decimal is syntactic instead of semantic. This means that the strings "12" and "1.2E2", which are semantically the same number, parse into a different Decimal instance. This PR fixes the regression by serializing to the decimal representation. The overall improvement from 0.4.9 is much more mild as a result. ``` parse_decimal time: [2.4036 µs 2.4180 µs 2.4389 µs] change: [-22.670% -21.979% -21.336%] (p = 0.00 < 0.05) Performance has improved. ``` Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg requested a review from antiguru April 23, 2025 18:31

petrosagg added 2 commits April 29, 2025 16:17

add decimal parse benchmark

1889166

add parsing variant that doesn't allocate

c1498b0

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg force-pushed the non-allocating-parsing branch from e81561c to 0d76ca8 Compare April 29, 2025 13:23

ParkMyCar approved these changes Apr 29, 2025

View reviewed changes

petrosagg force-pushed the non-allocating-parsing branch from 0d76ca8 to 132d7ae Compare April 29, 2025 14:24

antiguru requested changes Apr 29, 2025

View reviewed changes

petrosagg force-pushed the non-allocating-parsing branch from 132d7ae to b906875 Compare April 29, 2025 15:44

antiguru approved these changes Apr 29, 2025

View reviewed changes

optimize float to decimal conversion

2af10a8

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg force-pushed the non-allocating-parsing branch from b906875 to 2af10a8 Compare April 29, 2025 18:12

petrosagg enabled auto-merge (rebase) April 29, 2025 18:12

petrosagg disabled auto-merge April 29, 2025 18:29

petrosagg enabled auto-merge (rebase) April 29, 2025 18:31

update ctest

c7db9da

Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg force-pushed the non-allocating-parsing branch from 4d012b1 to c7db9da Compare April 30, 2025 08:22

petrosagg merged commit 37c0632 into master May 1, 2025
6 checks passed

petrosagg deleted the non-allocating-parsing branch May 1, 2025 11:27

petrosagg mentioned this pull request May 7, 2025

dec: use decimal representation when converting to Decimal #86

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add parsing variant that doesn't allocate #83

add parsing variant that doesn't allocate #83

Uh oh!

petrosagg commented Apr 23, 2025 •

edited

Loading

Uh oh!

ParkMyCar left a comment

Uh oh!

ParkMyCar Apr 29, 2025

Uh oh!

petrosagg Apr 29, 2025

Uh oh!

ParkMyCar Apr 29, 2025

Uh oh!

petrosagg Apr 29, 2025

Uh oh!

ParkMyCar Apr 29, 2025

Uh oh!

antiguru left a comment •

edited

Loading

Uh oh!

petrosagg commented Apr 29, 2025

Uh oh!

antiguru left a comment

Uh oh!

antiguru Apr 29, 2025

Uh oh!

petrosagg Apr 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

add parsing variant that doesn't allocate #83

add parsing variant that doesn't allocate #83

Uh oh!

Conversation

petrosagg commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ParkMyCar left a comment

Choose a reason for hiding this comment

Uh oh!

ParkMyCar Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

petrosagg Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

ParkMyCar Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

petrosagg Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

ParkMyCar Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

antiguru left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

petrosagg commented Apr 29, 2025

Uh oh!

antiguru left a comment

Choose a reason for hiding this comment

Uh oh!

antiguru Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

petrosagg Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

petrosagg commented Apr 23, 2025 •

edited

Loading

antiguru left a comment •

edited

Loading