"Missing or incorrect absmax2 handling in q_galore_adamw8bit.py #10

LLMresearcher · 2025-01-13T21:37:59Z

When using the 8-bit AdamW optimizer in q_galore_adamw8bit.py, optimizer_update_8bit_blockwise() requires absmax2 but it is not defined. Seems like absmax2 expects passing a tensor.

[rank0]: File "/root/Q-GaLore/q_galore_torch/q_galore_adamw8bit.py", line 200, in update_step
[rank0]: F.optimizer_update_8bit_blockwise(
[rank0]: TypeError: optimizer_update_8bit_blockwise() missing 1 required positional argument: 'absmax2'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Missing or incorrect absmax2 handling in q_galore_adamw8bit.py #10

"Missing or incorrect absmax2 handling in q_galore_adamw8bit.py #10

LLMresearcher commented Jan 13, 2025

"Missing or incorrect absmax2 handling in q_galore_adamw8bit.py #10

"Missing or incorrect absmax2 handling in q_galore_adamw8bit.py #10

Comments

LLMresearcher commented Jan 13, 2025