Skip to content

Commit d0cfec7

Browse files
authored
[bugfix] fix inductor cache on max_position_embeddings (#15436)
Signed-off-by: youkaichao <[email protected]>
1 parent a608160 commit d0cfec7

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vllm/config.py

+3
Original file line numberDiff line numberDiff line change
@@ -221,6 +221,9 @@ def compute_hash(self) -> str:
221221
factors.append(self.trust_remote_code)
222222
factors.append(self.rope_scaling)
223223
factors.append(self.rope_theta)
224+
# rope cos/sin cache depends on the max_position_embeddings
225+
factors.append(
226+
getattr(self.hf_config, "max_position_embeddings", "None"))
224227
return hashlib.sha256(str(factors).encode()).hexdigest()
225228

226229
def __init__(

0 commit comments

Comments
 (0)