Fix race condition in concurrent artifact additions #483

rhatdan · 2025-11-21T14:56:47Z

This fixes a race condition where concurrent 'podman artifact add'
commands for different artifacts would result in only one artifact
being created, without any error messages.

The root cause was in the artifact store's Add() method. When the lock
was released during blob copying (for performance), concurrent additions
could overwrite each other's index entries:

Process A: Lock → Read index → Unlock → Copy blobs → Lock → Commit
Process B: Lock → Read index (missing A) → Unlock → Copy blobs → Lock → Commit
Result: Process B's commit overwrites Process A's entry

The fix adds conflict detection after reacquiring the lock. Before
committing, we reload the artifact index and verify the artifact name
hasn't been added by another process. If a conflict is detected, we
return an error rather than silently overwriting the concurrent change.

This preserves the performance benefits of concurrent blob copying
while preventing the race condition that caused artifacts to be lost.

Fixes: containers/podman#27569

Generated-with: Cursor AI

podmanbot · 2025-11-21T14:58:02Z

✅ A new PR has been created in buildah to vendor these changes: containers/buildah#6526

packit-as-a-service · 2025-11-21T14:58:44Z

Packit jobs failed. @containers/packit-build please check.

mtrmac

Thanks!

The diagnosis + fix is correct. Are we fine with paying the price? Earlier containers/podman#26524 , Cc: @mheon @baude @Luap99 .

Alternatively, it should be possible for oci/layout/newImageDestination to defer reading the index.json until the start of the “write manifests” phase; that would avoid the need to hold the lock for the duration of blob copies — with a fairly major downside that this would (again) make pkg/libartifact have a very remote dependency on internal implementation choices of oci/layout`.

mtrmac · 2025-11-21T23:26:52Z

common/pkg/libartifact/store/store.go


-	as.lock.Lock()
-	locked = true
+	// Lock is still held from the beginning of the function


This comment does not make much sense stand-alone for future readers of the code, without seeing it in this diff.

Luap99 · 2025-11-22T12:56:13Z

yes, ref containers/podman#27574

It would be much better to fix this right so that we reload index.json after the relock. It means redoing the conflict check again.

rhatdan · 2025-12-01T14:25:19Z

I asked Cursor to re-do the PR to match the comments above. @Luap99 @mheon @mtrmac PTAL

mheon · 2025-12-01T14:27:18Z

common/pkg/libartifact/store/store.go

-	locked = false
+	// Release the lock during blob copying to allow concurrent operations.
+	// We'll reacquire it later and check for conflicts before committing.
 	as.lock.Unlock()


If you're keeping this you absolutely need to keep the locked variable, this risks a double-unlock if the defer fires before the lock reacquires

This fixes a race condition where concurrent 'podman artifact add' commands for different artifacts would result in only one artifact being created, without any error messages. The root cause was in the artifact store's Add() method. When the lock was released during blob copying (for performance), concurrent additions could overwrite each other's index entries: - Process A: Lock → Read index → Unlock → Copy blobs → Lock → Commit - Process B: Lock → Read index (missing A) → Unlock → Copy blobs → Lock → Commit - Result: Process B's commit overwrites Process A's entry The fix adds conflict detection after reacquiring the lock. Before committing, we reload the artifact index and verify the artifact name hasn't been added by another process. If a conflict is detected, we return an error rather than silently overwriting the concurrent change. A 'locked' boolean variable tracks lock state to prevent double-unlock if an error occurs during blob copying (when the lock is released). The deferred unlock only fires if 'locked' is true. This preserves the performance benefits of concurrent blob copying while preventing the race condition that caused artifacts to be lost. Fixes: containers/podman#27569 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>

mtrmac · 2025-12-01T19:08:26Z

common/pkg/libartifact/store/store.go

+			return nil, fmt.Errorf("%s: %w", dest, libartTypes.ErrArtifactAlreadyExists)
+		}
+	}
+


The original imageDest, as identified in the original PR description, continues to exist, will be used below, and contains the cached data. So this no longer fixes the race at all.

(*grumble* I think ferrying data between an AI and review comments with little human review is effectively equivalent to asking the reviewers to do the work themselves with the help of an AI. In that sense, it takes advantage of the PR as a signal “I care about this so much to do the work, so you should care about this more than about just a filed issue”, and risks breaking that signal. In the worst case, PRs will stay in the same queue as issues, with no reason to look at them with any extra priority.)

github-actions bot added the common Related to "common" package label Nov 21, 2025

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

f0ebc18

podmanbot mentioned this pull request Nov 21, 2025

Sync: Fix race condition in concurrent artifact additions containers/buildah#6526

Draft

rhatdan force-pushed the artifact branch from f6b706a to b801c9d Compare November 21, 2025 14:58

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

876526a

rhatdan force-pushed the artifact branch from b801c9d to 8f466af Compare November 21, 2025 15:09

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

fcf9750

rhatdan force-pushed the artifact branch from 8f466af to f551efa Compare November 21, 2025 15:12

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

7facdfe

rhatdan force-pushed the artifact branch from f551efa to 29f6a52 Compare November 21, 2025 15:17

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

1b220b3

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

8c5be27

github-actions bot added storage Related to "storage" package image Related to "image" package labels Nov 21, 2025

rhatdan force-pushed the artifact branch from 4184530 to db5cf0a Compare November 21, 2025 15:54

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 21, 2025

dnm: Vendor changes from containers/container-libs#483

5940f65

mtrmac reviewed Nov 21, 2025

View reviewed changes

rhatdan force-pushed the artifact branch from db5cf0a to 3bac3ee Compare December 1, 2025 14:09

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Dec 1, 2025

dnm: Vendor changes from containers/container-libs#483

96e847a

rhatdan force-pushed the artifact branch from 3bac3ee to f439ffd Compare December 1, 2025 14:24

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Dec 1, 2025

dnm: Vendor changes from containers/container-libs#483

1464a7b

mheon reviewed Dec 1, 2025

View reviewed changes

rhatdan force-pushed the artifact branch from f439ffd to 91acbcc Compare December 1, 2025 14:46

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Dec 1, 2025

dnm: Vendor changes from containers/container-libs#483

7b854af

mtrmac reviewed Dec 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix race condition in concurrent artifact additions #483

Fix race condition in concurrent artifact additions #483

rhatdan commented Nov 21, 2025 •

edited

Loading

Uh oh!

podmanbot commented Nov 21, 2025

Uh oh!

packit-as-a-service bot commented Nov 21, 2025

Uh oh!

mtrmac left a comment

Uh oh!

mtrmac Nov 21, 2025

Uh oh!

Luap99 commented Nov 22, 2025

Uh oh!

rhatdan commented Dec 1, 2025

Uh oh!

mheon Dec 1, 2025

Uh oh!

mtrmac Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix race condition in concurrent artifact additions #483

Are you sure you want to change the base?

Fix race condition in concurrent artifact additions #483

Conversation

rhatdan commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

podmanbot commented Nov 21, 2025

Uh oh!

packit-as-a-service bot commented Nov 21, 2025

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

mtrmac Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Luap99 commented Nov 22, 2025

Uh oh!

rhatdan commented Dec 1, 2025

Uh oh!

mheon Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

mtrmac Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rhatdan commented Nov 21, 2025 •

edited

Loading