-
Notifications
You must be signed in to change notification settings - Fork 18k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
UMA buffers get host-visible memory at allocation time
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#22930
opened May 11, 2026 by
winstonma
Loading…
server: fix checkpoints creation
examples
server
testing
Everything test related
#22929
opened May 11, 2026 by
jacekpoplawski
Contributor
•
Draft
kv-cache: use changes relating to the ggml tensor library for machine learning
-t threads for IQ4 packing from ggml code
ggml
#22928
opened May 11, 2026 by
shikaku2
Loading…
common: improve --fit host-memory accounting for CPU and iGPU
ggml
changes relating to the ggml tensor library for machine learning
vendor : update cpp-httplib to 0.44.0
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
python
python script changes
script
Script related
#22919
opened May 10, 2026 by
cabelo
Contributor
Loading…
Ggml/cuda snake fusion hardening
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22912
opened May 10, 2026 by
ServeurpersoCom
Contributor
Loading…
webui: preserve system message on edit cancel
examples
server/webui
server
#22911
opened May 10, 2026 by
ServeurpersoCom
Contributor
Loading…
ggml-webgpu: Enables running gpt-oss-20b
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
WebGPU
#22906
opened May 10, 2026 by
yomaytk
Contributor
Loading…
[SYCL] Add OP im2col_3d
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22903
opened May 10, 2026 by
arthw
Contributor
Loading…
webui: fix theme from --webui-config-file not applied on first load (fresh localStorage)
examples
server/webui
server
#22902
opened May 10, 2026 by
ServeurpersoCom
Contributor
Loading…
fix(quantize): add NVFP4 default type mapping and scale tensors
examples
#22897
opened May 10, 2026 by
t-timms
Loading…
[ggml] Fix Vulkan-Hpp handle usage on 32-bit targets.
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#22892
opened May 10, 2026 by
miyanyan
Contributor
Loading…
vulkan: Switch MUL_MAT_VEC to 4 K per iteration for F16/32
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#22887
opened May 9, 2026 by
TheBlueMatt
Contributor
Loading…
feat: add MiMo v2.5 vision
examples
python
python script changes
#22883
opened May 9, 2026 by
AesSedai
Contributor
Loading…
HIP: RDNA3 mma FA, faster AMD transpose, tune AMD
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22880
opened May 9, 2026 by
JohannesGaessler
Contributor
Loading…
docs: fix metrics endpoint description in server README
examples
server
#22879
opened May 9, 2026 by
willjoha
Loading…
Optimise memory usage by evicting weights after processing each layer
examples
#22877
opened May 9, 2026 by
EAddario
Contributor
Loading…
opencl: fix crash when warming up MoE on Adreno
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
ggml-cpu: Add IME2 Instruction Support for the SpacemiT Backend
build
Compilation issues
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#22863
opened May 9, 2026 by
alex-spacemit
Collaborator
Loading…
ggml-cpu: scope KleidiAI compile flags per-target via OBJECT library
ggml
changes relating to the ggml tensor library for machine learning
#22861
opened May 9, 2026 by
shreyanshp
Loading…
security: fix critical integer overflow (CWE-190) in tensor allocation
ggml
changes relating to the ggml tensor library for machine learning
#22857
opened May 8, 2026 by
programacionlogicT900r1000
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-05-07.