Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add support for R-4B multimodal model examples python python script changes
#17840 opened Dec 7, 2025 by infil00p Draft
[SYCL] fix softmax for iGPU ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17838 opened Dec 7, 2025 by NeoZhangJianyu Loading…
debug:Adding CPU-side visual trace for hexagon ggml changes relating to the ggml tensor library for machine learning script Script related
#17837 opened Dec 7, 2025 by Ethan-a2 Loading…
[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17826 opened Dec 6, 2025 by NeoZhangJianyu Loading…
cann : fix ops broken by circular padding guard Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17825 opened Dec 6, 2025 by CISC Loading…
cli: new CLI experience devops improvements to build systems and github actions examples script Script related server testing Everything test related
#17824 opened Dec 6, 2025 by ngxson Draft
4 of 6 tasks
llama : add token matching support to llama-grammar testing Everything test related
#17816 opened Dec 6, 2025 by aldehir Loading…
3 tasks done
CANN: support gated linear attn Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17814 opened Dec 6, 2025 by YushengZhao Loading…
vulkan: faster q6_k matmul ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17813 opened Dec 6, 2025 by netrunnereve Loading…
model: support Rnj-1 model Model specific python python script changes
#17811 opened Dec 6, 2025 by philip-essential Loading…
[DRAFT] CUDA: Improve performance via less synchronizations between token ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17795 opened Dec 5, 2025 by aendk Draft
Make graph_max_nodes vary by ubatch size
#17794 opened Dec 5, 2025 by pwilkin Loading…
SOLVE_TRI extension to more dimensions examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs server testing Everything test related
#17793 opened Dec 5, 2025 by pwilkin Loading…
ggml-cpu: add repack GEMM and GEMV for floating-point ggml changes relating to the ggml tensor library for machine learning
#17791 opened Dec 5, 2025 by taimur-10x Draft
ggml-cpu: add ggml_thread_cpu_relax with Zihintpause support ggml changes relating to the ggml tensor library for machine learning
#17784 opened Dec 5, 2025 by ixgbe Loading…
CANN : Optimize mul_mat_id quantization Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17782 opened Dec 5, 2025 by jjjxp03 Loading…
Add link to AshkanYarmoradi/go-llama.cpp
#17776 opened Dec 5, 2025 by AshkanYarmoradi Loading…
ProTip! Exclude everything labeled bug with -label:bug.