Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

vulkan: matmul gcn tuning ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#13016 opened Apr 18, 2025 by netrunnereve Loading…
CUDA: noncont MMVQ + batched bs1 MUL_MAT_ID ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#13014 opened Apr 18, 2025 by JohannesGaessler Loading…
Update OneAPI base toolkit to Latest Version for Windows SYCL Backend devops improvements to build systems and github actions SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13007 opened Apr 18, 2025 by kotauchisunsun Loading…
Nix portability improvements devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#13005 opened Apr 18, 2025 by hacker1024 Loading…
[SYCL][OPT] Fix reorder optimization for Q4_0 ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13003 opened Apr 18, 2025 by NeoZhangJianyu Loading…
make memset range dynamic ggml changes relating to the ggml tensor library for machine learning
#13002 opened Apr 18, 2025 by pockers21 Loading…
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling examples ggml changes relating to the ggml tensor library for machine learning
#12995 opened Apr 17, 2025 by max-krasnyansky Loading…
[CANN] Add the n_graph_splits performance metric to llama-bench. Ascend NPU issues specific to Ascend NPUs examples
#12994 opened Apr 17, 2025 by bachelor-dou Loading…
SYCL: Add non-contiguous support in ROPE ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12993 opened Apr 17, 2025 by qnixsynapse Loading…
Fix convert script for non-hf GLM4 checkpoints python python script changes
#12992 opened Apr 17, 2025 by Tianyue-Zhao Loading…
2 of 4 tasks
sycl: use DNN in the first part of ggml_sycl_mul_mat_batched_sycl ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12972 opened Apr 16, 2025 by lslusarczyk Draft
Vulkan: Fix Deepseek V2 inference by making ggml_vk_op_supports_incontiguous(GGML_OP_RMS_NORM) return true ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12960 opened Apr 15, 2025 by stduhpf Draft
Resolved half rope,multi-EOS issues in convert_hf_togguf.py for GLM4Z Model python python script changes
#12957 opened Apr 15, 2025 by piDack Loading…
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR ggml changes relating to the ggml tensor library for machine learning
#12943 opened Apr 14, 2025 by rgerganov Draft
llama-bench : Add --override-tensors arg examples
#12922 opened Apr 12, 2025 by 4onen Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD ggml changes relating to the ggml tensor library for machine learning
#12902 opened Apr 11, 2025 by yurivict Loading…
cuda: fix compilation error (#12893) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12894 opened Apr 11, 2025 by lizhenneng Loading…
llama-tts : input from stdin examples
#12890 opened Apr 11, 2025 by marcoStocchi Loading…
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX ggml changes relating to the ggml tensor library for machine learning
#12871 opened Apr 10, 2025 by slaren Loading…
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858 opened Apr 10, 2025 by Alcpz Loading…
2 of 3 tasks
ProTip! Adding no:label will show everything without a label.