[warning] loading model fails! #18

Godlovecui · 2024-11-08T03:51:30Z

Hello，
The quantized file suffix is ".safetensor", however, when I execute ./bin/llama_example, it output the below warning info，

where can I get these ".bin" files?
Thank you!

lswzjuer · 2025-02-05T03:47:30Z

Hello, where did you get the W8A16 model weights? We open source the simulated quantized ckpts, which is only used for accuracy verification. If actual inference is performed, packing preprocessing is required.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[warning] loading model fails! #18

[warning] loading model fails! #18

Godlovecui commented Nov 8, 2024 •

edited

Loading

lswzjuer commented Feb 5, 2025

[warning] loading model fails! #18

[warning] loading model fails! #18

Comments

Godlovecui commented Nov 8, 2024 • edited Loading

lswzjuer commented Feb 5, 2025

Godlovecui commented Nov 8, 2024 •

edited

Loading