Add missing reset for extra_loss_arg tensor list to preserve reference to user data. #97

romerojosh · 2025-11-26T18:57:39Z

This PR fixes an issue in handling the optional extra_loss_args arguments to the TorchFort supervised training function. There was a missing call to the reset routine, which is needed to preserve references to the user data in cases where the TorchFort backend has to migrate the user data from CPU to GPU or GPU to CPU for a training step. Without this reset, user changes to the arrays passed to torchfort_tensor_list_add_tensor for the extra loss arguments list would not propagate after the first training step. Workloads where the model and extra loss args data are already present on the same device are not impacted by this.

I've adjusted the supervised training tests to better cover this scenario.

romerojosh · 2025-11-26T19:03:02Z

/build_and_test

github-actions · 2025-11-26T19:03:13Z

🚀 Build workflow triggered! View run

github-actions · 2025-11-26T19:14:20Z

❌ Build workflow failed! View run

…e to user data. Signed-off-by: Josh Romero <joshr@nvidia.com>

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh · 2025-11-27T00:09:11Z

/build_and_test

github-actions · 2025-11-27T00:09:20Z

🚀 Build workflow triggered! View run

github-actions · 2025-11-27T00:21:03Z

✅ Build workflow passed! View run

romerojosh added 2 commits November 26, 2025 16:06

Add missing reset for extra_loss_arg tensor list to preserve referenc…

84b9a37

…e to user data. Signed-off-by: Josh Romero <joshr@nvidia.com>

Formatting.

5b1752d

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh force-pushed the extra_loss_arg_reset branch from 30b9c74 to 5b1752d Compare November 27, 2025 00:06

romerojosh merged commit 35c26d9 into master Dec 2, 2025
4 checks passed

romerojosh deleted the extra_loss_arg_reset branch December 5, 2025 00:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add missing reset for extra_loss_arg tensor list to preserve reference to user data. #97

Add missing reset for extra_loss_arg tensor list to preserve reference to user data. #97

Uh oh!

romerojosh commented Nov 26, 2025

Uh oh!

romerojosh commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

romerojosh commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add missing reset for extra_loss_arg tensor list to preserve reference to user data. #97

Add missing reset for extra_loss_arg tensor list to preserve reference to user data. #97

Uh oh!

Conversation

romerojosh commented Nov 26, 2025

Uh oh!

romerojosh commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

romerojosh commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants