llama.cpp
ggml-org/llama.cpp
8.0
Inference Engines
★ 103.7k◇ 16.8kC++MITtoday
gpt4all
nomic-ai/gpt4all
7.2
Inference Engines
★ 77.3k◇ 8.3kC++MIT10mo ago
vLLM
vllm-project/vllm
8.6
Inference Engines
★ 76.6k◇ 15.6kPythonApache-2.0today
ray
ray-project/ray
8.6
Inference Engines
★ 42.1k◇ 7.4kPythonApache-2.0today
gitleaks
gitleaks/gitleaks
8.2
Inference Engines
★ 25.9k◇ 2.0kGoMIT20d ago
llm-action
liguodongiot/llm-action
6.7
Inference Engines
★ 24.0k◇ 2.8kHTMLApache-2.01mo ago
litgpt
Lightning-AI/litgpt
7.9
Inference Engines
★ 13.3k◇ 1.4kPythonApache-2.04d ago
OpenLLM
bentoml/OpenLLM
7.4
Inference Engines
★ 12.3k◇ 805PythonApache-2.01d ago
mistral-inference
mistralai/mistral-inference
6.9
Inference Engines
★ 10.8k◇ 1.0kJupyter NotebookApache-2.01mo ago
openvino
openvinotoolkit/openvino
8.2
Inference Engines
★ 10.1k◇ 3.2kC++Apache-2.0today
PowerInfer
Tiiny-AI/PowerInfer
6.8
Inference Engines
★ 9.3k◇ 561C++MIT2mo ago
BentoML
bentoml/BentoML
8.0
Inference Engines
★ 8.6k◇ 950PythonApache-2.01d ago
lmdeploy
InternLM/lmdeploy
7.5
Inference Engines
★ 7.8k◇ 684PythonApache-2.0today
plano
katanemo/plano
7.4
Inference Engines
★ 6.3k◇ 399RustApache-2.0today
openevolve
algorithmicsuperintelligence/openevolve
6.8
Inference Engines
★ 6.0k◇ 949PythonApache-2.027d ago
flashinfer
flashinfer-ai/flashinfer
7.5
Inference Engines
★ 5.4k◇ 896PythonApache-2.0today
kserve
kserve/kserve
7.7
Inference Engines
★ 5.3k◇ 1.4kGoApache-2.01d ago
Awesome-LLM-Inference
xlite-dev/Awesome-LLM-Inference
6.6
Inference Engines
★ 5.1k◇ 360PythonGPL-3.05d ago
eko
FellouAI/eko
7.3
Inference Engines
★ 4.9k◇ 436TypeScriptMIT1mo ago
gpustack
gpustack/gpustack
7.0
Inference Engines
★ 4.8k◇ 497PythonApache-2.0today
shimmy
Michael-A-Kuykendall/shimmy
6.2
Inference Engines
★ 4.0k◇ 343RustApache-2.019d ago
RuVector
ruvnet/RuVector
6.7
Inference Engines
★ 3.8k◇ 463RustMITtoday
ruvector
ruvnet/ruvector
6.7
Inference Engines
★ 3.8k◇ 463RustMITtoday
lorax
predibase/lorax
6.1
Inference Engines
★ 3.7k◇ 312PythonApache-2.010mo ago
lemonade
lemonade-sdk/lemonade
7.0
Inference Engines
★ 3.5k◇ 261C++Apache-2.0today
optillm
algorithmicsuperintelligence/optillm
6.5
Inference Engines
★ 3.4k◇ 268PythonApache-2.026d ago
deepsparse
neuralmagic/deepsparse
6.1
Inference Engines
★ 3.2k◇ 190PythonNOASSERTION10mo ago
distributed-llama
b4rtaz/distributed-llama
6.3
Inference Engines
★ 2.9k◇ 225C++MITtoday
spiceai
spiceai/spiceai
6.9
Inference Engines
★ 2.9k◇ 185RustApache-2.0today
Medusa
FasterDecoding/Medusa
5.4
Inference Engines
★ 2.7k◇ 197Jupyter NotebookApache-2.01y ago
ZhiLight
zhihu/ZhiLight
5.5
Inference Engines
★ 904◇ 102C++Apache-2.027d ago
kvcached
ovg-project/kvcached
5.6
Inference Engines
★ 852◇ 98PythonApache-2.07d ago
nobodywho
nobodywho-ooo/nobodywho
6.2
Inference Engines
★ 790◇ 55RustEUPL-1.2today
yalm
andrewkchan/yalm
3.8
Inference Engines
★ 570◇ 59C++7mo ago
KuiperLLama
zjhellofss/KuiperLLama
4.1
Inference Engines
★ 527◇ 137C++5mo ago
mlxstudio
jjang-ai/mlxstudio
4.8
Inference Engines
★ 477◇ 32today
swiftLLM
interestingLSY/swiftLLM
3.9
Inference Engines
★ 323◇ 37PythonApache-2.010mo ago