[Misc] Add transpose optimization in the linear layer #280

rjg-lyh · 2025-03-09T08:50:08Z

What this PR does / why we need it?

In order to improve performance, extract the internal transpose operation and optimize it by transposing the Linear layer's weights after the model weights are loaded, when performing the forward inference of the Linear layer using the default non-quantized method.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Comprehensive unit tests have been performed in another PR.

Signed-off-by: rjg-lyh <[email protected]>

[v0.7.3][Misc] Add transpose optimization in the linear layer

7da8529

Signed-off-by: rjg-lyh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Add transpose optimization in the linear layer #280

[Misc] Add transpose optimization in the linear layer #280

rjg-lyh commented Mar 9, 2025

[Misc] Add transpose optimization in the linear layer #280

Are you sure you want to change the base?

[Misc] Add transpose optimization in the linear layer #280

Conversation

rjg-lyh commented Mar 9, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?