Skip to content

Conversation

@galatolofederico
Copy link
Contributor

Sorry if I am opening so many PRs 😅
There was i bug in the td.discount function when using it in a vectorized environment. I fixed it and wrote a test that compute the discounted rewards of a replay coming from a non vectorized environment and the discounted rewards of a replay coming from a vectorized environment and check if they are the same.
This PR is needed to make #26 work because my A2C implementation uses a vectorized environment

@seba-1511
Copy link
Member

This looks great @galatolofederico, thanks a lot for catching this bug!

I'll ask for a small change: could you add one line about this PR in CHANGELOG.md explaining the fix and ending with your GitHub handle ? This makes it easy to keep track of what changed and when it did.

Thanks again!

@seba-1511 seba-1511 merged commit d44cfe1 into learnables:master Feb 27, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants