Fix vLLM max tokens #570

Datta0 · 2025-02-18T05:52:02Z

vLLM has max_tokens as Optional[int] but defaulting to 16 here
That means whenever sampling_params is created, it assumes the value 16 and hence this sampling_params.max_tokens ends up being always equal to 16
Then the lighteval benchmark goes on to warn that the output is not in the Gold Format ...

NathanHB

Hi ! The behaviour we want is to use the max_new_tokens defined in the task config to be used by default except when it is set in the model's generation config.
In that case, the fix you would need to is set the sampling_params.max_tokens default value to None

Datta0 · 2025-02-18T18:06:45Z

Is there a way to control sampling params while invoking lighteval from the CLI? I couldn't find it in the documentation unfortunately...

Fixup vLLM max tokens

7f66af3

NathanHB reviewed Feb 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vLLM max tokens #570

Fix vLLM max tokens #570

Datta0 commented Feb 18, 2025

NathanHB left a comment

Datta0 commented Feb 18, 2025

Fix vLLM max tokens #570

Are you sure you want to change the base?

Fix vLLM max tokens #570

Conversation

Datta0 commented Feb 18, 2025

NathanHB left a comment

Choose a reason for hiding this comment

Datta0 commented Feb 18, 2025