Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml-cpu: enable IBM NNPA Vector Intrinsics documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#14303 opened Jun 20, 2025 by taronaeo Draft
metal : fix thread-safety Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#14300 opened Jun 20, 2025 by ggerganov Loading…
memory : rename interface to llama_memory_context_i
#14296 opened Jun 20, 2025 by ggerganov Loading…
Fix Windows Null Pointer Bug and Enhance Memory Operations in ggml-sycl ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14290 opened Jun 20, 2025 by MengAiDev Loading…
kv-cache : use ggml_set_rows ggml changes relating to the ggml tensor library for machine learning
#14285 opened Jun 19, 2025 by ggerganov Draft
ggml : add ggml_set_rows ggml changes relating to the ggml tensor library for machine learning
#14274 opened Jun 19, 2025 by rgerganov Draft
CUDA: mul_mat_v support for batch sizes > 1 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14262 opened Jun 18, 2025 by JohannesGaessler Loading…
opencl: ref count ggml_backend_opencl_context and refactor profiling ggml changes relating to the ggml tensor library for machine learning
#14254 opened Jun 18, 2025 by lhez Loading…
server : add pidfile option examples server
#14242 opened Jun 17, 2025 by ericcurtin Loading…
Add SmolLM3 documentation Improvements or additions to documentation python python script changes
#14240 opened Jun 17, 2025 by Vaibhavs10 Draft
MODEL: Falcon-H1 support Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#14238 opened Jun 17, 2025 by younesbelkada Draft
ggml: introduce GGML_NUMA_MIGRATE to optimize cross NUMA op computation examples ggml changes relating to the ggml tensor library for machine learning
#14232 opened Jun 17, 2025 by wenlujon Loading…
llama: fix compilation warning (#464)
#14209 opened Jun 16, 2025 by L33TSP34KER Loading…
ci: re-enable rocm linux build, reduce the built targets to the ones currently available in rocblas devops improvements to build systems and github actions
#14184 opened Jun 14, 2025 by IMbackK Loading…
ggml : implement op fusion, starting with REGLU/GEGLU/SWIGLU Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning help wanted Extra attention is needed Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#14158 opened Jun 12, 2025 by CISC Draft
tests : add test-model-random help wanted Extra attention is needed testing Everything test related
#14139 opened Jun 12, 2025 by compilade Draft
4 of 16 tasks
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization ggml changes relating to the ggml tensor library for machine learning
#14117 opened Jun 11, 2025 by Vithulep Loading…
server: add model alias presets examples python python script changes server
#14083 opened Jun 9, 2025 by am17an Loading…
llama: automatically set runtime parameters such as --n-gpu-layers to fit VRAM ggml changes relating to the ggml tensor library for machine learning
#14067 opened Jun 8, 2025 by JohannesGaessler Draft
ProTip! What’s not been updated in a month: updated:<2025-05-20.