In my opinion, authors define L_{vlb} = L_0 + ... + L_T, not L_t. #147

unl1002 · 2024-11-23T14:21:09Z

          In my opinion, authors define L_{vlb} = L_0 + ... + L_T, not L_t.

Thus, they may calculate the vlb loss with scale factor T (self.num_timestep).

Originally posted by @yhy258 in #114 (comment)

The text was updated successfully, but these errors were encountered:

unl1002 · 2024-11-23T14:26:41Z

so, which means we use L_t * T (self. num_timestep) to approximate L_ {vlb}?

unl1002 · 2024-11-27T09:15:30Z

unl1002 closed this as completed Nov 23, 2024

unl1002 reopened this Nov 23, 2024

Provide feedback