You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device
Exception caught at file:/home/runner/_work/llm.cpp/llm.cpp/ollama-llama-cpp/ggml/src/ggml-sycl/ggml-sycl.cpp, line:4463
The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device
Exception caught at file:/home/runner/_work/llm.cpp/llm.cpp/ollama-llama-cpp/ggml/src/ggml-sycl/ggml-sycl.cpp, line:4463
using this container, running on NixOS https://github.com/mattcurf/ollama-intel-gpu
podman build -t "ollama-intel-gpu" .
podman run --rm -p 127.0.0.1:11434:11434 -v /home/stereomato/models:/mnt -v ollama-volume:/root/.ollama -e OLLAMA_NUM_PARALLEL=1 -e OLLAMA_MAX_LOADED_MODELS=1 -e OLLAMA_FLASH_ATTENTION=1 -e OLLAMA_NUM_GPU=999 -e DEVICE=iGPU --device /dev/dri --name=ollama-intel-gpu
podman exec -it ollama-intel-gpu bash
./ollama pull deepseek-v2:16b, but the q4_k_m 16b also exhibits the same issue
./ollama run deepseek-v2 "hello deepseek"
Then, I get the error in the title/first two lines of this bug report.
HW:
Intel i5-12500h,
Intel Xe Graphics (Alder Lake)
24GB of RAM
up to date NixOS
The text was updated successfully, but these errors were encountered: