Skip to content

Conversation

@rdettai-sk
Copy link
Collaborator

Description

When a search needs to process too many splits compared to the cluster capacity, we know upfront that it is pretty much impossible for the query to succeed. Instead of trusting resources for the whole duration of the configured timeout, this PR adds a node config to cancel these requests early (after the planning but before sending the jobs to the leafs).

How was this PR tested?

Unit test

split_footer_cache_capacity: ByteSize::mb(500),
partial_request_cache_capacity: ByteSize::mb(64),
max_num_concurrent_split_searches: 100,
max_splits_per_search: None,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm surprised serializing tests are passing without having to update json files. Have we broken something?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah no. we don't have tests like that for node_config. That's ok then

@fulmicoton fulmicoton merged commit 29422a9 into main Dec 17, 2025
8 checks passed
@fulmicoton fulmicoton deleted the fail-too-many-splits branch December 17, 2025 16:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants