Skip to content

Commit 063443e

Browse files
committed
fix typo
Signed-off-by: tanpinsiang <[email protected]>
1 parent 652f6e0 commit 063443e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_posts/2025-02-24-ptpc-fp8-rocm.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ The illustration shows two quantization approaches:
6464

6565
**Scaling Factors:**
6666
- **Top (Per-Tensor)**: Single scalars ΔX[1] and ΔW[1] for entire tensors
67-
- **Bottom (PTPC)**: Vector ΔX[T×1] with one scale per token and ΔW[1×Co] with one scale per output channel
67+
- **Bottom (PTPC)**: Vector ΔX[T×1] with one scale per token and ΔW[1×Co] with one scale per input channel
6868

6969
This granular scaling approach allows PTPC-FP8 to achieve accuracy close to BF16 while maintaining the speed and memory benefits of 8-bit computation.
7070

0 commit comments

Comments
 (0)