You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
/scripts/fetch-model.py can't handle these split models by itself, using the model parameter.
Would it be possible to add a split syntax, so that a pod can be started with this environment variable, download the big model, stitch it together, delete the parts? E.g.
Your larger models are split into 50GB chunks due to huggingface file upload limits. Example: https://huggingface.co/TheBloke/goliath-120b-GGUF/tree/main
/scripts/fetch-model.py can't handle these split models by itself, using the model parameter.
Would it be possible to add a split syntax, so that a pod can be started with this environment variable, download the big model, stitch it together, delete the parts? E.g.
MODEL=https://huggingface.co/TheBloke/goliath-120b-GGUF/tree/main/goliath-120b.Q4_K_M.gguf-split-a,https://huggingface.co/TheBloke/goliath-120b-GGUF/tree/main/goliath-120b.Q4_K_M.gguf-split-b
The text was updated successfully, but these errors were encountered: