-
Notifications
You must be signed in to change notification settings - Fork 8.5k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
PHI3-vision gguf conversion
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#7705
opened Jun 3, 2024 by
farris
Loading…
add pkg-config spec file for llama.cpp
build
Compilation issues
#7702
opened Jun 2, 2024 by
andy-tai
Loading…
docs: Added initial PR template with directions for doc only changes and squash merges [no ci]
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
need feedback
Testing and feedback with results are needed
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7700
opened Jun 2, 2024 by
nicolasperez19
Loading…
Improve hipBLAS support in CMake
build
Compilation issues
#7696
opened Jun 2, 2024 by
daniandtheweb
Loading…
convert-hf : set the model name based on cli arg, if present
python
python script changes
#7693
opened Jun 2, 2024 by
sasha0552
Loading…
refine .gitignore
android
Issues specific to Android
merging soon
Will merge soon unless anyone objects
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7688
opened Jun 2, 2024 by
zhouwg
Loading…
convert-hf : match model part name prefix and suffix
bugfix
fixes an issue or bug
merging soon
Will merge soon unless anyone objects
python
python script changes
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7687
opened Jun 2, 2024 by
compilade
Loading…
Per token attributes
python
python script changes
testing
Everything test related
#7685
opened Jun 1, 2024 by
jaime-m-p
Loading…
CUDA: use tensor cores for MMQ
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
review complexity : high
Generally require indepth knowledge of LLMs or GPUs
#7676
opened May 31, 2024 by
JohannesGaessler
•
Draft
common : refactor cli arg parsing
examples
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
docs: repeat-penalty 1.0 = disabled
examples
merging soon
Will merge soon unless anyone objects
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7669
opened May 31, 2024 by
brandon-lockaby
Loading…
MiniCPM Support lm_head
python
python script changes
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7664
opened May 31, 2024 by
zkh2016
Loading…
llama : avoid double token-to-piece cache
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7654
opened May 30, 2024 by
ggerganov
Loading…
Merging #7568 with #7430(Implementing LLaMA 3 torch to gguf conversion)
examples
python
python script changes
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7651
opened May 30, 2024 by
Manaball123
Loading…
fix: change first msg check
bugfix
fixes an issue or bug
examples
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server
#7649
opened May 30, 2024 by
ryan1117001
Loading…
Only use FIM middle token if it exists
examples
merging soon
Will merge soon unless anyone objects
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server
#7648
opened May 30, 2024 by
CISC
Loading…
llama_supports_rpc()
function
merging soon
#7647
opened May 30, 2024 by
martindevans
Loading…
More checks before assuming FIM tokens for Llama arch
bugfix
fixes an issue or bug
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
merging soon
Will merge soon unless anyone objects
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7644
opened May 30, 2024 by
CISC
Loading…
Catch exceptions correctly in server.cpp
bugfix
fixes an issue or bug
examples
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
merging soon
Will merge soon unless anyone objects
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server
#7642
opened May 30, 2024 by
0wwafa
Loading…
llama : offload to RPC in addition to other backends
ggml
changes relating to the ggml tensor library for machine learning
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7640
opened May 30, 2024 by
rgerganov
Loading…
ggml : unify rope norm/neox
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
refactoring
Refactoring
review complexity : high
Generally require indepth knowledge of LLMs or GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#7634
opened May 30, 2024 by
ggerganov
Loading…
8 of 11 tasks
Vulkan Mixture of Experts (MoE) support
python
python script changes
review complexity : high
Generally require indepth knowledge of LLMs or GPUs
Vulkan
Issues specific to the Vulkan backend
#7628
opened May 29, 2024 by
0cc4m
Loading…
Readme: add HyperMink/inferenceable to HTTP server
review complexity : low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7607
opened May 29, 2024 by
sameercharles
Loading…
ggml: Support OpenMP for multi-thread processing
build
Compilation issues
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
review complexity : medium
Generally require more time to grok but manageable by beginner to medium expertise level
#7606
opened May 29, 2024 by
msy-kato
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.