Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

PHI3-vision gguf conversion examples ggml changes relating to the ggml tensor library for machine learning python python script changes
#7705 opened Jun 3, 2024 by farris Loading…
add pkg-config spec file for llama.cpp build Compilation issues
#7702 opened Jun 2, 2024 by andy-tai Loading…
docs: Added initial PR template with directions for doc only changes and squash merges [no ci] devops improvements to build systems and github actions documentation Improvements or additions to documentation need feedback Testing and feedback with results are needed review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7700 opened Jun 2, 2024 by nicolasperez19 Loading…
Improve hipBLAS support in CMake build Compilation issues
#7696 opened Jun 2, 2024 by daniandtheweb Loading…
convert-hf : set the model name based on cli arg, if present python python script changes
#7693 opened Jun 2, 2024 by sasha0552 Loading…
refine .gitignore android Issues specific to Android merging soon Will merge soon unless anyone objects review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7688 opened Jun 2, 2024 by zhouwg Loading…
convert-hf : match model part name prefix and suffix bugfix fixes an issue or bug merging soon Will merge soon unless anyone objects python python script changes review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7687 opened Jun 2, 2024 by compilade Loading…
Per token attributes python python script changes testing Everything test related
#7685 opened Jun 1, 2024 by jaime-m-p Loading…
CUDA: use tensor cores for MMQ ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs review complexity : high Generally require indepth knowledge of LLMs or GPUs
#7676 opened May 31, 2024 by JohannesGaessler Draft
common : refactor cli arg parsing examples review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7675 opened May 31, 2024 by ggerganov Draft
3 tasks
docs: repeat-penalty 1.0 = disabled examples merging soon Will merge soon unless anyone objects review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7669 opened May 31, 2024 by brandon-lockaby Loading…
MiniCPM Support lm_head python python script changes review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7664 opened May 31, 2024 by zkh2016 Loading…
llama : avoid double token-to-piece cache review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7654 opened May 30, 2024 by ggerganov Loading…
Merging #7568 with #7430(Implementing LLaMA 3 torch to gguf conversion) examples python python script changes review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7651 opened May 30, 2024 by Manaball123 Loading…
fix: change first msg check bugfix fixes an issue or bug examples review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server
#7649 opened May 30, 2024 by ryan1117001 Loading…
Only use FIM middle token if it exists examples merging soon Will merge soon unless anyone objects review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server
#7648 opened May 30, 2024 by CISC Loading…
llama_supports_rpc() function merging soon Will merge soon unless anyone objects review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7647 opened May 30, 2024 by martindevans Loading…
More checks before assuming FIM tokens for Llama arch bugfix fixes an issue or bug medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable) merging soon Will merge soon unless anyone objects review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7644 opened May 30, 2024 by CISC Loading…
Catch exceptions correctly in server.cpp bugfix fixes an issue or bug examples high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow) merging soon Will merge soon unless anyone objects review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server
#7642 opened May 30, 2024 by 0wwafa Loading…
llama : offload to RPC in addition to other backends ggml changes relating to the ggml tensor library for machine learning review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7640 opened May 30, 2024 by rgerganov Loading…
ggml : unify rope norm/neox examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs python python script changes refactoring Refactoring review complexity : high Generally require indepth knowledge of LLMs or GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#7634 opened May 30, 2024 by ggerganov Loading…
8 of 11 tasks
Vulkan Mixture of Experts (MoE) support python python script changes review complexity : high Generally require indepth knowledge of LLMs or GPUs Vulkan Issues specific to the Vulkan backend
#7628 opened May 29, 2024 by 0cc4m Loading…
Readme: add HyperMink/inferenceable to HTTP server review complexity : low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7607 opened May 29, 2024 by sameercharles Loading…
ggml: Support OpenMP for multi-thread processing build Compilation issues devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning review complexity : medium Generally require more time to grok but manageable by beginner to medium expertise level
#7606 opened May 29, 2024 by msy-kato Loading…
ProTip! Filter pull requests by the default branch with base:master.