Normalize the pred of the network but not for the ground truth of score. Why?

Hi! Animadversio

Firstly, I'd like to commend you on your work; it's truly impressive! I've been closely examining the sample_X_and_score experiment in the ReverseSDE_Diffusion.ipynb notebook and have a few clarifications to seek.

In your approach to approximate the analytical score with a Neural Network, I noticed that while constructing the data-ground-truth pair, there doesn't seem to be any normalization applied to the ground truth of the score. However, in the code snippet, there appears to be a form of normalization in the fourth line. This has left me a bit puzzled. Specifically, while y_pred is normalized, y_train (which serves as the ground truth) isn't normalized by the timestep. This gives an impression that they might be on different scales.

-------- code ----------
for ep in pbar:
      y_pred = score_model_analy(X_train, T_train) 
      loss = torch.mean(torch.sum((y_pred - y_train)**2 * std_vec[:, None], dim=(1))) # <======= here
-------- code ----------
Moreover, in the forward method, the output is normalized:

-------- code ----------
  def forward(self, x, t):
         t_embed = self.embed(t)
          pred = self.net(torch.cat((x,t_embed),dim=1))
          pred = pred / self.marginal_prob_std_f(t)[:, None,]. # <======= here
    return pred
-------- code ----------

Given this, I'm curious about how arithmetic operations can be applied on y_pred and y_train if they're potentially on different scales.

I've also taken a look at the Colab notebook you provided ([link](https://colab.research.google.com/drive/1e2LXHvvufA3thNvdmsEZdLAI5xkjaLVl#scrollTo=ZyxlQxAUbJZb)) and attached an image for reference:
<img width="723" alt="image" src="https://github.com/Animadversio/DiffusionFromScratch/assets/30463291/af59ea74-59a6-43ea-89f0-53fecc5ec596">
I'd greatly appreciate your insights on this matter. Thank you for your time and patience.

Warm regards,


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Normalize the pred of the network but not for the ground truth of score. Why? #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Normalize the pred of the network but not for the ground truth of score. Why? #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions