-
Notifications
You must be signed in to change notification settings - Fork 20
Open
Description
Hi Tom,
I read carefully your code, since I want to implement a network very similar to CWRNN. Your code is really clear, thanks!
However, I found it seems that you didn't correctly implement CWRNN, there is a "little" bug. If you take the hidden_W variable value, for instance I take num_hidden=4, periods = [1, 2], and print the hidden_W change, I got:
...
[[ 1.19209290e-07 -1.71363354e-05 0.00000000e+00 0.00000000e+00]
[ 0.00000000e+00 -4.02897596e-04 0.00000000e+00 0.00000000e+00]
[ 0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00]
[ 0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00]]
...
that's to say your code just use 2 units of hidden_W. I think you forget to set the group_size of each period.
I think it's better to use tf.where rather than group_index, the group_index may cause lots of problem. Check https://github.com/braingineer/ikelos/blob/master/ikelos/layers/cwrnn.py, or later I will give a tensorflow implement.
Metadata
Metadata
Assignees
Labels
No labels