Does torchao support FP8 Grouped GEMM? #1928

zigzagcai · 2025-03-20T03:28:37Z

Grouped GEMM kernels (https://github.com/fanshiqing/grouped_gemm) are used in many MoE models.

I just wander does torchao support FP8 kernels for Grouped GEMM, such like the three commonly used ops:

grouped_gemm.backend.gmm
grouped_gemm.ops.unpermute
grouped_gemm.ops.permute

The text was updated successfully, but these errors were encountered:

vkuzo · 2025-03-20T12:55:35Z

hi @zigzagcai , we recently landed a grouped gemm API into core which includes fp8: pytorch/pytorch#148531 . We plan to provide wrappers in torchao, although we do not have them just yet. cc @drisspg

zigzagcai · 2025-03-20T14:59:52Z

hi @zigzagcai , we recently landed a grouped gemm API into core which includes fp8: pytorch/pytorch#148531 . We plan to provide wrappers in torchao, although we do not have them just yet. cc @drisspg

Thank you @vkuzo !
I just wander how can I use this aten newly needed grouped gemm ops?

supriyar · 2025-03-20T18:23:39Z

cc @HDCharles who has been looking into MoE quantization and grouped gemm recently

HDCharles · 2025-03-20T22:05:39Z

Hey,

I'm working to enable our existing quantization kernels to compose with group gemm its still in progress at the moment. As far as the core kernel, you can look at: https://github.com/pytorch/pytorch/pull/148531/files#diff-3f31c52b48cfddf8f4617d809f7695b2e4a1c78656f8c4b5143a4b45d01fcf23R1178

...for an example

jeromeku · 2025-03-22T12:32:14Z

@HDCharles @vkuzo

Interested in this as well and potentially helping tune the kernel.

There is a link mentioned in the grouped gemm PR describing the design of the grouped GEMM. How can I view the doc (access seems to be gated)?

drisspg added the float8 label Mar 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does torchao support FP8 Grouped GEMM? #1928

Does torchao support FP8 Grouped GEMM? #1928

zigzagcai commented Mar 20, 2025

vkuzo commented Mar 20, 2025

zigzagcai commented Mar 20, 2025

supriyar commented Mar 20, 2025

HDCharles commented Mar 20, 2025

jeromeku commented Mar 22, 2025

Does torchao support FP8 Grouped GEMM? #1928

Does torchao support FP8 Grouped GEMM? #1928

Comments

zigzagcai commented Mar 20, 2025

vkuzo commented Mar 20, 2025

zigzagcai commented Mar 20, 2025

supriyar commented Mar 20, 2025

HDCharles commented Mar 20, 2025

jeromeku commented Mar 22, 2025