Skip to content

Conversation

@ShreyeshArangath
Copy link
Contributor

@ShreyeshArangath ShreyeshArangath commented Nov 30, 2025

Which issue does this PR close?

Note: This is a WIP PR, I'm still working on testing the support and also only allowing reads from COW tables.

Closes #1676

Rationale for this change

Auron currently does not support native execution for Iceberg DSv2 reads backed by BatchScanExec, so all Iceberg scans fall back to Spark execution. This change enables native execution for Iceberg COW table scans to unlock native file scan performance for Parquet/ORC Iceberg workloads.

What changes are included in this PR?

  • Adds an Iceberg convert provider to detect and convert Iceberg-backed BatchScanExec.
  • Introduces NativeIcebergBatchScanExec for native execution of Iceberg scans.
  • Wires the scan into the existing native file scan (Parquet/ORC) execution path.
  • Guards the feature behind a config flag.

Are there any user-facing changes?

Yes. When enabled via configuration, Iceberg COW table scans can now run through native execution. No behavior change when the feature flag is disabled.

How was this patch tested?

  • [WIP] Unit Tests and verification

@@ -0,0 +1,68 @@
#
# Licensed to the Apache Software Foundation (ASF) under one or more
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Borrowed from #1541 as its not merged in yet.

yew1eb

This comment was marked as duplicate.

@github-actions github-actions bot added the spark label Dec 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Enable native BatchScanExec for Iceberg COW tables

2 participants