llama.cpp
ggml-org/llama.cpp
8.0
Inference Engines
★ 114.4k◇ 19.1kC++MITtoday
vLLM
vllm-project/vllm
8.6
Inference Engines
★ 81.8k◇ 17.6kPythonApache-2.0today
gpt4all
nomic-ai/gpt4all
7.1
Inference Engines
★ 77.4k◇ 8.3kC++MIT1y ago
ray
ray-project/ray
8.6
Inference Engines
★ 42.8k◇ 7.6kPythonApache-2.0today
gitleaks
gitleaks/gitleaks
8.2
Inference Engines
★ 27.5k◇ 2.1kGoMIT1d ago
llm-action
liguodongiot/llm-action
6.8
Inference Engines
★ 24.4k◇ 2.8kHTMLApache-2.09d ago
litgpt
Lightning-AI/litgpt
7.8
Inference Engines
★ 13.4k◇ 1.4kPythonApache-2.01d ago
OpenLLM
bentoml/OpenLLM
7.4
Inference Engines
★ 12.3k◇ 811PythonApache-2.01d ago
mistral-inference
mistralai/mistral-inference
6.9
Inference Engines
★ 10.8k◇ 1.1kJupyter NotebookApache-2.01mo ago
openvino
openvinotoolkit/openvino
8.2
Inference Engines
★ 10.3k◇ 3.2kC++Apache-2.0today
PowerInfer
Tiiny-AI/PowerInfer
7.0
Inference Engines
★ 9.5k◇ 579C++MIT23d ago
BentoML
bentoml/BentoML
8.0
Inference Engines
★ 8.7k◇ 968PythonApache-2.0today
lmdeploy
InternLM/lmdeploy
7.5
Inference Engines
★ 7.9k◇ 701PythonApache-2.0today
plano
katanemo/plano
7.4
Inference Engines
★ 6.6k◇ 427RustApache-2.01d ago
openevolve
algorithmicsuperintelligence/openevolve
6.7
Inference Engines
★ 6.5k◇ 1.0kPythonApache-2.02mo ago
flashinfer
flashinfer-ai/flashinfer
7.4
Inference Engines
★ 5.7k◇ 1.0kPythonApache-2.0today
kserve
kserve/kserve
7.7
Inference Engines
★ 5.5k◇ 1.5kGoApache-2.0today
shimmy
Michael-A-Kuykendall/shimmy
6.3
Inference Engines
★ 5.3k◇ 503RustApache-2.01d ago
Awesome-LLM-Inference
xlite-dev/Awesome-LLM-Inference
6.5
Inference Engines
★ 5.3k◇ 381PythonGPL-3.01mo ago
gpustack
gpustack/gpustack
7.0
Inference Engines
★ 5.1k◇ 540PythonApache-2.0today
eko
FellouAI/eko
7.0
Inference Engines
★ 4.9k◇ 439TypeScriptMIT3mo ago
lemonade
lemonade-sdk/lemonade
7.1
Inference Engines
★ 4.2k◇ 330C++Apache-2.0today
ruvector
ruvnet/ruvector
7.0
Inference Engines
★ 4.2k◇ 544RustMITtoday
RuVector
ruvnet/RuVector
7.0
Inference Engines
★ 4.2k◇ 544RustMITtoday
optillm
algorithmicsuperintelligence/optillm
6.6
Inference Engines
★ 4.1k◇ 355PythonApache-2.027d ago
lorax
predibase/lorax
6.9
Inference Engines
★ 3.8k◇ 316PythonApache-2.05d ago
deepsparse
neuralmagic/deepsparse
5.9
Inference Engines
★ 3.2k◇ 192PythonNOASSERTION1y ago
spiceai
spiceai/spiceai
7.0
Inference Engines
★ 2.9k◇ 197RustApache-2.0today
distributed-llama
b4rtaz/distributed-llama
6.2
Inference Engines
★ 2.9k◇ 232C++MIT1mo ago
Medusa
FasterDecoding/Medusa
5.4
Inference Engines
★ 2.7k◇ 201Jupyter NotebookApache-2.01y ago
kvcached
ovg-project/kvcached
5.8
Inference Engines
★ 1.1k◇ 118PythonApache-2.0today
nobodywho
nobodywho-ooo/nobodywho
6.2
Inference Engines
★ 944◇ 66RustEUPL-1.2today
ZhiLight
zhihu/ZhiLight
5.3
Inference Engines
★ 905◇ 102C++Apache-2.02mo ago
mlxstudio
jjang-ai/mlxstudio
5.3
Inference Engines
★ 763◇ 49today
yalm
andrewkchan/yalm
3.7
Inference Engines
★ 584◇ 62C++8mo ago
KuiperLLama
zjhellofss/KuiperLLama
4.0
Inference Engines
★ 548◇ 142C++7mo ago
swiftLLM
interestingLSY/swiftLLM
3.8
Inference Engines
★ 329◇ 38PythonApache-2.011mo ago