Skip to content

parthh01/rl_stuff

Repository files navigation

#let's play with an implementation of grpo with self play on long episodic tasks

About

reinforcement learning experiments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages