Add support for gradient clipping. #94

azrael417 · 2025-11-11T06:29:06Z

This MR adds gradient clipping to all training procedures.

romerojosh · 2025-11-12T17:23:14Z

/build_and_test

github-actions · 2025-11-12T17:23:23Z

🚀 Build workflow triggered! View run

github-actions · 2025-11-12T17:35:39Z

✅ Build workflow passed! View run

romerojosh

Thanks for adding this feature!

Left a small comment about SAC grad clipping. There are also some merge conflicts that need to be resolved after merging in #93.

romerojosh · 2025-11-12T18:21:16Z

src/csrc/include/internal/rl/off_policy/sac.h

-    if (alpha_lr_scheduler) {
-      alpha_lr_scheduler->step();
+      // I think we do not need grad clipping here
+      // the model is just a scalar and clipping the grad for that might hurt


I noted this in the previous version of this MR. What does stable-baselines do in this case (if they support grad clipping)? We should match that as a reference if we can.

Yes, so stable baselines does not support gradient accumulation nor clipping except for PPO.

However, I think gradient accumulation makes sense in general for torchfort because of the online nature. That way we can have larger batch sizes without using a replay buffer.

Concerning grad clipping, I mean it's optional. I do not see why it should hurt if you have the support for it. We can add it here as well.

Signed-off-by: Thorsten Kurth <tkurth@nvidia.com>

azrael417 · 2025-11-13T07:39:07Z

/build_and_test

github-actions · 2025-11-13T07:39:16Z

🚀 Build workflow triggered! View run

github-actions · 2025-11-13T07:52:10Z

✅ Build workflow passed! View run

azrael417 requested a review from romerojosh November 11, 2025 06:29

azrael417 self-assigned this Nov 11, 2025

romerojosh changed the title ~~Tkurth/add grad clipping~~ Add support for gradient clipping. Nov 12, 2025

romerojosh reviewed Nov 12, 2025

View reviewed changes

azrael417 added 2 commits November 12, 2025 23:05

Fixing gradient accumulation in rl algorithms.

3dd556e

Signed-off-by: Thorsten Kurth <tkurth@nvidia.com>

fixing missing grad clipping in sac

cc498c8

Signed-off-by: Thorsten Kurth <tkurth@nvidia.com>

azrael417 force-pushed the tkurth/add_grad_clipping branch from f510a76 to cc498c8 Compare November 13, 2025 07:07

romerojosh approved these changes Nov 18, 2025

View reviewed changes

romerojosh merged commit d4aadb9 into NVIDIA:master Nov 18, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for gradient clipping. #94

Add support for gradient clipping. #94

Uh oh!

azrael417 commented Nov 11, 2025

Uh oh!

romerojosh commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

romerojosh left a comment

Uh oh!

romerojosh Nov 12, 2025

Uh oh!

azrael417 Nov 13, 2025

Uh oh!

azrael417 commented Nov 13, 2025

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add support for gradient clipping. #94

Add support for gradient clipping. #94

Uh oh!

Conversation

azrael417 commented Nov 11, 2025

Uh oh!

romerojosh commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

romerojosh left a comment

Choose a reason for hiding this comment

Uh oh!

romerojosh Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

azrael417 Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

azrael417 commented Nov 13, 2025

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

github-actions bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants