omlx
jundot/omlx
7.8
Model Serving
★ 15.7k◇ 1.3kPythonApache-2.0today
TensorRT-LLM
NVIDIA/TensorRT-LLM
7.3
Model Serving
★ 13.8k◇ 2.4kPythonNOASSERTIONtoday
vllm-omni
vllm-project/vllm-omni
7.3
Model Serving
★ 4.9k◇ 1.0kPythonApache-2.0today
Olares
beclab/Olares
7.0
Model Serving
★ 4.6k◇ 266GoAGPL-3.0today
Deep-Learning-in-Production
ahkarami/Deep-Learning-in-Production
4.5
Model Serving
★ 4.4k◇ 6871y ago
LightLLM
ModelTC/LightLLM
6.5
Model Serving
★ 4.1k◇ 332PythonApache-2.0today
AI-Infra-from-Zero-to-Hero
HuaizhengZhang/AI-Infra-from-Zero-to-Hero
6.2
Model Serving
★ 4.1k◇ 393MIT10mo ago
chitu
thu-pacman/chitu
6.8
Model Serving
★ 3.1k◇ 266PythonApache-2.0today
ramalama
containers/ramalama
7.5
Model Serving
★ 2.9k◇ 340PythonMIT1d ago
inference
roboflow/inference
7.3
Model Serving
★ 2.3k◇ 269PythonNOASSERTIONtoday
envd
tensorchord/envd
6.9
Model Serving
★ 2.2k◇ 168GoApache-2.012d ago
vllm-ascend
vllm-project/vllm-ascend
7.2
Model Serving
★ 2.2k◇ 1.3kC++Apache-2.0today
aici
microsoft/aici
4.9
Model Serving
★ 2.1k◇ 84RustMIT1y ago
sie
superlinked/sie
6.6
Model Serving
★ 2.0k◇ 177PythonApache-2.04d ago
mlrun
mlrun/mlrun
7.2
Model Serving
★ 1.7k◇ 305PythonApache-2.0today
kitops
kitops-ml/kitops
6.9
Model Serving
★ 1.3k◇ 170GoApache-2.0today
hopsworks
logicalclocks/hopsworks
5.8
Model Serving
★ 1.3k◇ 158JavaAGPL-3.01y ago
rtp-llm
alibaba/rtp-llm
6.0
Model Serving
★ 1.2k◇ 204CudaApache-2.0today
truss
basetenlabs/truss
6.7
Model Serving
★ 1.2k◇ 107PythonMITtoday
Nanoflow
efeslab/Nanoflow
4.7
Model Serving
★ 962◇ 49Jupyter Notebook2mo ago
mosec
mosecorg/mosec
6.5
Model Serving
★ 900◇ 72PythonApache-2.02d ago
model_server
openvinotoolkit/model_server
6.5
Model Serving
★ 880◇ 253C++Apache-2.0today
pipeless
pipeless-ai/pipeless
4.9
Model Serving
★ 850◇ 52RustApache-2.02y ago
Yatai
bentoml/Yatai
6.2
Model Serving
★ 845◇ 76TypeScriptNOASSERTION4d ago
ServerlessLLM
ServerlessLLM/ServerlessLLM
5.9
Model Serving
★ 685◇ 73PythonApache-2.029d ago
timber
kossisoroyce/timber
5.5
Model Serving
★ 682◇ 23PythonNOASSERTION1mo ago
fastapi-ml-skeleton
eightBEC/fastapi-ml-skeleton
4.6
Model Serving
★ 601◇ 91PythonApache-2.04mo ago
pinferencia
underneathall/pinferencia
4.7
Model Serving
★ 544◇ 83PythonApache-2.03y ago
ome
ome-projects/ome
6.0
Model Serving
★ 461◇ 81GoApache-2.0today
JetStream
AI-Hypercomputer/JetStream
4.9
Model Serving
★ 442◇ 65PythonApache-2.04mo ago
xFasterTransformer
intel/xFasterTransformer
4.4
Model Serving
★ 435◇ 75C++Apache-2.08mo ago
gpu-rest-engine
NVIDIA/gpu-rest-engine
3.7
Model Serving
★ 423◇ 93C++BSD-3-Clause7y ago
stable-diffusion-deploy
Lightning-Universe/stable-diffusion-deploy
4.6
Model Serving
★ 391◇ 39PythonApache-2.02y ago
pmetal
Epistates/pmetal
5.1
Model Serving
★ 293◇ 20RustNOASSERTION26d ago
podman-desktop-extension-ai-lab
containers/podman-desktop-extension-ai-lab
5.9
Model Serving
★ 291◇ 82TypeScriptApache-2.0today
TurboOCR
aiptimizer/TurboOCR
5.1
Model Serving
★ 284◇ 35C++MIT9d ago
BMW-YOLOv4-Inference-API-GPU
BMW-InnovationLab/BMW-YOLOv4-Inference-API-GPU
4.1
Model Serving
★ 279◇ 67PythonBSD-3-Clause3y ago
BMW-YOLOv4-Inference-API-CPU
BMW-InnovationLab/BMW-YOLOv4-Inference-API-CPU
3.9
Model Serving
★ 219◇ 58PythonNOASSERTION3y ago