Skip to content

Conversation

@azrael417
Copy link
Collaborator

This MR ensures the correct reading of gradient accumulation parameters for all RL algorithms.

@azrael417 azrael417 requested a review from romerojosh November 11, 2025 06:14
@azrael417 azrael417 self-assigned this Nov 11, 2025
Signed-off-by: Thorsten Kurth <tkurth@nvidia.com>
@azrael417 azrael417 force-pushed the tkurth/grad-acc-fixes branch from 5563b8a to a3f43f1 Compare November 11, 2025 15:03
@romerojosh
Copy link
Collaborator

/build_and_test

@github-actions
Copy link

🚀 Build workflow triggered! View run

@github-actions
Copy link

✅ Build workflow passed! View run

Copy link
Collaborator

@romerojosh romerojosh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks for fixing this feature for RL.

@romerojosh romerojosh merged commit e79b873 into NVIDIA:master Nov 12, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants