[AURON #1693] join operation should flush in time on duplicated keys #1701

XorSum · 2025-12-05T08:53:40Z

Which issue does this PR close?

Closes #1693.

Rationale for this change

As discussed previously in #1693 and #1694, the join operation should check batch size and trigger flushing in a timely manner, to prevent extreme large batch size.

What changes are included in this PR?

Are there any user-facing changes?

How was this patch tested?

Copilot

Pull request overview

This PR addresses issue #1693 by refactoring the sort-merge join (SMJ) implementation to check batch size and flush more frequently when processing joins with duplicate keys. The goal is to prevent excessive memory usage by ensuring timely flushing of output batches during cartesian product generation.

Key Changes

Simplified the duplicate key handling algorithm by collecting all duplicate indices from both sides before generating the cartesian product
Added flush checks inside the cartesian product generation loop (instead of only after collecting all duplicates)
Removed the complex streaming logic that separately handled left and right side streaming

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-05T09:13:03Z

native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs

+                            if self.should_flush() {
                                self.as_mut().flush(cur1, cur2).await?;


The flush condition is incomplete and missing cleanup calls that are critical for memory management during joins with many duplicate keys:

Missing check for cur1.num_buffered_batches() > 1 || cur2.num_buffered_batches() > 1 (present at lines 99-102)

Missing calls to clean_out_dated_batches() after flushing (used consistently elsewhere at lines 104-105, 174-175, 186, 196)

Without these, memory will accumulate unbounded during cartesian product generation when there are many duplicate keys, defeating the purpose of this PR.

Should be:

if self.should_flush() || cur1.num_buffered_batches() > 1 || cur2.num_buffered_batches() > 1 { self.as_mut().flush(cur1, cur2).await?; cur1.clean_out_dated_batches(); cur2.clean_out_dated_batches(); }

Suggested change

if self.should_flush() {

self.as_mut().flush(cur1, cur2).await?;

if self.should_flush()

|| cur1.num_buffered_batches() > 1

|| cur2.num_buffered_batches() > 1

{

self.as_mut().flush(cur1, cur2).await?;

cur1.clean_out_dated_batches();

cur2.clean_out_dated_batches();

We should not call cur1 .clean_out_dated_batches(), because doing so will break the index correlation between cur1 and equal_lindices.

xumingming · 2025-12-05T12:21:20Z

The change seems too much for me. I was thinking just split the loop here into small batches is enough?

auron/native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs

Lines 163 to 166 in 4f015cf

    
           for (&lidx, &ridx) in equal_lindices.iter().cartesian_product(&equal_rindices) { 
        
               self.lindices.push(lidx); 
        
               self.rindices.push(ridx); 
        
           }

This reverts commit caa1040.

XorSum · 2025-12-06T05:16:12Z

The change seems too much for me. I was thinking just split the loop here into small batches is enough?

done

xumingming · 2025-12-06T07:31:09Z

@XorSum I think a test that could reproduce your issue should be added to confirm it is fixed with the new code.

xumingming · 2025-12-06T07:42:12Z

native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs

-                    for (&lidx, &ridx) in equal_lindices.iter().cartesian_product(&equal_rindices) {
-                        self.lindices.push(lidx);
-                        self.rindices.push(ridx);
+                    for &lidx in &equal_lindices {


How about:

// Choose a better name. fn has_enough_room(&self, new_size: usize) -> bool { self.lindices.len() + new_size < self.join_params.batch_size } let new_size = equal_lindices.len() * equal_rindices.len() if (self.has_enough_room(self, new_size)) { // old cartesian_product way } else { // do more aggressive flush }

And don't forget to clear the outdated batches.

I have added this logic and unit test. But we should not clear the outdated batches here.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs:201

Inconsistent flushing strategy: The aggressive flushing logic added for the cartesian product (lines 177-186) is not applied here in the "stream right side" path. When equal_lindices is large, this loop can add many elements before checking should_flush() at line 197, potentially creating batches much larger than batch_size.

Consider adding a flush check inside the inner loop (after line 194), similar to the approach used in lines 177-186, to prevent extreme batch sizes when streaming with duplicated keys.

                    if r_equal {
                        // stream right side
                        while !cur2.finished() && cur2.cur_key() == cur1.key(l_key_idx) {
                            for &lidx in &equal_lindices {
                                self.lindices.push(lidx);
                                self.rindices.push(cur2.cur_idx());
                            }
                            cur_forward!(cur2);
                            if self.should_flush() || cur2.num_buffered_batches() > 1 {
                                self.as_mut().flush(cur1, cur2).await?;
                                cur2.clean_out_dated_batches();
                            }
                        }

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

native-engine/datafusion-ext-plans/src/joins/test.rs

native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

join operation should flush in time on duplicated keys

caa1040

github-actions bot added the native label Dec 5, 2025

cxzl25 requested a review from Copilot December 5, 2025 09:07

Copilot started reviewing on behalf of cxzl25 December 5, 2025 09:08 View session

Copilot finished reviewing on behalf of cxzl25 December 5, 2025 09:12

Copilot AI reviewed Dec 5, 2025

View reviewed changes

XorSum and others added 2 commits December 6, 2025 13:00

Revert "join operation should flush in time on duplicated keys"

d1f5e9d

This reverts commit caa1040.

just split the loop

7429210

xumingming reviewed Dec 6, 2025

View reviewed changes

XorSum added 2 commits December 7, 2025 13:31

add has_enough_room and unit test

059a4d6

format

d51385d

XorSum force-pushed the join_flush branch from 7957383 to d51385d Compare December 7, 2025 05:55

cxzl25 requested a review from Copilot December 8, 2025 03:08

Copilot started reviewing on behalf of cxzl25 December 8, 2025 03:08 View session

Copilot AI reviewed Dec 8, 2025

View reviewed changes

native-engine/datafusion-ext-plans/src/joins/test.rs Outdated Show resolved Hide resolved

native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs Outdated Show resolved Hide resolved

XorSum and others added 2 commits December 8, 2025 11:20

Update native-engine/datafusion-ext-plans/src/joins/test.rs

3c8a77a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update native-engine/datafusion-ext-plans/src/joins/smj/full_join.rs

44ab9e9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

cxzl25 requested a review from richox December 10, 2025 04:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AURON #1693] join operation should flush in time on duplicated keys #1701

[AURON #1693] join operation should flush in time on duplicated keys #1701

XorSum commented Dec 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 5, 2025

Uh oh!

XorSum Dec 5, 2025

Uh oh!

xumingming commented Dec 5, 2025

Uh oh!

XorSum commented Dec 6, 2025

Uh oh!

xumingming commented Dec 6, 2025

Uh oh!

xumingming Dec 6, 2025 •

edited

Loading

Uh oh!

XorSum Dec 7, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		if self.should_flush() {
		self.as_mut().flush(cur1, cur2).await?;

[AURON #1693] join operation should flush in time on duplicated keys #1701

Are you sure you want to change the base?

[AURON #1693] join operation should flush in time on duplicated keys #1701

Conversation

XorSum commented Dec 5, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

How was this patch tested?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes

Uh oh!

Copilot AI Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

XorSum Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

xumingming commented Dec 5, 2025

Uh oh!

XorSum commented Dec 6, 2025

Uh oh!

xumingming commented Dec 6, 2025

Uh oh!

xumingming Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

XorSum Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xumingming Dec 6, 2025 •

edited

Loading