ollama + deepseek v2: The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device #12839

stereomato · 2025-02-17T18:35:14Z

The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device
Exception caught at file:/home/runner/_work/llm.cpp/llm.cpp/ollama-llama-cpp/ggml/src/ggml-sycl/ggml-sycl.cpp, line:4463

using this container, running on NixOS https://github.com/mattcurf/ollama-intel-gpu

podman build -t "ollama-intel-gpu" .

podman run --rm -p 127.0.0.1:11434:11434 -v /home/stereomato/models:/mnt -v ollama-volume:/root/.ollama -e OLLAMA_NUM_PARALLEL=1 -e OLLAMA_MAX_LOADED_MODELS=1 -e OLLAMA_FLASH_ATTENTION=1 -e OLLAMA_NUM_GPU=999 -e DEVICE=iGPU --device /dev/dri --name=ollama-intel-gpu

podman exec -it ollama-intel-gpu bash

./ollama pull deepseek-v2:16b, but the q4_k_m 16b also exhibits the same issue

./ollama run deepseek-v2 "hello deepseek"

Then, I get the error in the title/first two lines of this bug report.

HW:
Intel i5-12500h,
Intel Xe Graphics (Alder Lake)
24GB of RAM
up to date NixOS

stereomato · 2025-02-17T18:40:19Z

nvm, this seems to be a memory limitation, derp. Is there a way to work around this?

qiuxin2012 · 2025-02-18T00:39:43Z

You can try to tune OLLAMA_NUM_GPU=999, like OLLAMA_NUM_GPU=18. It means put 18 layers on GPU, rest layers on CPU.

qiuxin2012 added the user issue label Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ollama + deepseek v2: The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device #12839

ollama + deepseek v2: The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device #12839

stereomato commented Feb 17, 2025

stereomato commented Feb 17, 2025

qiuxin2012 commented Feb 18, 2025

ollama + deepseek v2: The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device #12839

ollama + deepseek v2: The number of work-items in each dimension of a work-group cannot exceed {512, 512, 512} for this device #12839

Comments

stereomato commented Feb 17, 2025

stereomato commented Feb 17, 2025

qiuxin2012 commented Feb 18, 2025