You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I'm trying to understand Activation-aware Reorder. I know it groups channels with similar salience, but I'm unsure why this makes them easier to quantize. Could you help to elaborate more on the benefit of this grouping? Thank you.
The text was updated successfully, but these errors were encountered:
Hi. We empirically found that group channels with similar salience can result in better quantization accuracy. We hypothesize that it might be related to the weight-activation scaling process. Specifically, we group the weight corresponding to larger activations to the same group. Before the quantization, we scale between weight and activation, which may lead to weight in the same group has similar magnitudes. That's why it may help to reduce the quantization error.
Hi, I'm trying to understand Activation-aware Reorder. I know it groups channels with similar salience, but I'm unsure why this makes them easier to quantize. Could you help to elaborate more on the benefit of this grouping? Thank you.
The text was updated successfully, but these errors were encountered: