[FT] Propagate batch size control for vLLM backend #573

alvin319 · 2025-02-18T20:54:44Z

Issue encountered

With vLLM backend, currently there's no way for us to control the batch size defined in here and the vLLM model config does not have ways to determine a specific batch size. However, we can control the maximum number of sequences (batch size) in vLLM directly from examples such as this.

Solution/Feature

Propagate the max_num_seqs parameter into the initialization of the vLLM model.

Possible alternatives

Other alternatives are to implement batching ourselves, which is an overkill since the vLLM backend already supports that.

The text was updated successfully, but these errors were encountered:

alvin319 · 2025-02-18T20:57:42Z

I'm happy to take this on, but I figured it warrants a discussion since vLLM leverages continuous batching, which behaves differently than the common understanding of a "fixed" batch size in the LightEval world.

alvin319 added the feature request New feature/request label Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FT] Propagate batch size control for vLLM backend #573

[FT] Propagate batch size control for vLLM backend #573

alvin319 commented Feb 18, 2025

alvin319 commented Feb 18, 2025

[FT] Propagate batch size control for vLLM backend #573

[FT] Propagate batch size control for vLLM backend #573

Comments

alvin319 commented Feb 18, 2025

Issue encountered

Solution/Feature

Possible alternatives

alvin319 commented Feb 18, 2025