Skip to content

Commit b1cc4df

Browse files
authored
[VLM] Support loading InternVideo2.5 models as original InternVLChatModel (vllm-project#14738)
Signed-off-by: Isotr0py <[email protected]>
1 parent 3824039 commit b1cc4df

File tree

2 files changed

+10
-3
lines changed

2 files changed

+10
-3
lines changed

docs/source/models/supported_models.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -786,9 +786,9 @@ See [this page](#generative-models) for more information on how to use generativ
786786
*
787787
* ✅︎
788788
- * `InternVLChatModel`
789-
* InternVL 2.5, Mono-InternVL, InternVL 2.0
789+
* InternVideo 2.5, InternVL 2.5, Mono-InternVL, InternVL 2.0
790790
* T + I<sup>E+</sup>
791-
* `OpenGVLab/InternVL2_5-4B`, `OpenGVLab/Mono-InternVL-2B`, `OpenGVLab/InternVL2-4B`, etc.
791+
* `OpenGVLab/InternVideo2_5_Chat_8B`, `OpenGVLab/InternVL2_5-4B`, `OpenGVLab/Mono-InternVL-2B`, `OpenGVLab/InternVL2-4B`, etc.
792792
*
793793
* ✅︎
794794
* ✅︎

vllm/model_executor/models/internvl.py

+8-1
Original file line numberDiff line numberDiff line change
@@ -981,5 +981,12 @@ def sample(
981981

982982
def load_weights(self, weights: Iterable[Tuple[str,
983983
torch.Tensor]]) -> Set[str]:
984-
loader = AutoWeightsLoader(self)
984+
# unused modules appear in OpenGVLab/InternVideo2_5_Chat_8B
985+
skip_prefixes = [
986+
"action_embed", "temporal_embed", "track_embed",
987+
"track_embed_decoder", "box_token", "cg_criterion", "cg_model",
988+
"loc_encoder", "loc_decoder", "sam", "temporal_token",
989+
"track_token"
990+
]
991+
loader = AutoWeightsLoader(self, skip_prefixes=skip_prefixes)
985992
return loader.load_weights(weights)

0 commit comments

Comments
 (0)