TensorRT-LLM
NVIDIA/TensorRT-LLM
7.3
Model Serving
★ 13.4k◇ 2.3kPythonNOASSERTIONtoday
omlx
jundot/omlx
7.6
Model Serving
★ 9.9k◇ 857PythonApache-2.0today
Deep-Learning-in-Production
ahkarami/Deep-Learning-in-Production
4.5
Model Serving
★ 4.4k◇ 6921y ago
Olares
beclab/Olares
7.0
Model Serving
★ 4.3k◇ 244GoAGPL-3.0today
vllm-omni
vllm-project/vllm-omni
7.2
Model Serving
★ 4.3k◇ 753PythonApache-2.0today
LightLLM
ModelTC/LightLLM
6.5
Model Serving
★ 4.0k◇ 319PythonApache-2.0today
AI-Infra-from-Zero-to-Hero
HuaizhengZhang/AI-Infra-from-Zero-to-Hero
6.3
Model Serving
★ 3.9k◇ 377MIT8mo ago
chitu
thu-pacman/chitu
6.9
Model Serving
★ 3.4k◇ 354PythonApache-2.0today
ramalama
containers/ramalama
7.4
Model Serving
★ 2.7k◇ 330PythonMITtoday
inference
roboflow/inference
7.2
Model Serving
★ 2.3k◇ 252PythonNOASSERTIONtoday
envd
tensorchord/envd
7.0
Model Serving
★ 2.2k◇ 167GoApache-2.04d ago
aici
microsoft/aici
4.9
Model Serving
★ 2.1k◇ 83RustMIT1y ago
vllm-ascend
vllm-project/vllm-ascend
7.2
Model Serving
★ 1.9k◇ 1.1kPythonApache-2.0today
mlrun
mlrun/mlrun
7.2
Model Serving
★ 1.7k◇ 301PythonApache-2.0today
kitops
kitops-ml/kitops
6.9
Model Serving
★ 1.3k◇ 173GoApache-2.01d ago
hopsworks
logicalclocks/hopsworks
5.8
Model Serving
★ 1.3k◇ 156JavaAGPL-3.01y ago
truss
basetenlabs/truss
6.7
Model Serving
★ 1.1k◇ 98PythonMITtoday
rtp-llm
alibaba/rtp-llm
6.1
Model Serving
★ 1.1k◇ 168CudaApache-2.0today
Nanoflow
efeslab/Nanoflow
4.9
Model Serving
★ 952◇ 48Jupyter Notebook16d ago
mosec
mosecorg/mosec
6.4
Model Serving
★ 898◇ 72PythonApache-2.0today
model_server
openvinotoolkit/model_server
6.5
Model Serving
★ 856◇ 248C++Apache-2.0today
pipeless
pipeless-ai/pipeless
4.9
Model Serving
★ 850◇ 53RustApache-2.01y ago
Yatai
bentoml/Yatai
5.3
Model Serving
★ 838◇ 77TypeScriptNOASSERTION1y ago
ServerlessLLM
ServerlessLLM/ServerlessLLM
5.9
Model Serving
★ 674◇ 70PythonApache-2.01mo ago
timber
kossisoroyce/timber
5.5
Model Serving
★ 667◇ 20PythonNOASSERTION1mo ago
fastapi-ml-skeleton
eightBEC/fastapi-ml-skeleton
4.7
Model Serving
★ 603◇ 93PythonApache-2.03mo ago
pinferencia
underneathall/pinferencia
4.7
Model Serving
★ 545◇ 82PythonApache-2.03y ago
xFasterTransformer
intel/xFasterTransformer
4.5
Model Serving
★ 436◇ 74C++Apache-2.06mo ago
JetStream
AI-Hypercomputer/JetStream
5.0
Model Serving
★ 424◇ 63PythonApache-2.03mo ago
gpu-rest-engine
NVIDIA/gpu-rest-engine
3.9
Model Serving
★ 423◇ 93C++BSD-3-Clause7y ago
stable-diffusion-deploy
Lightning-Universe/stable-diffusion-deploy
4.7
Model Serving
★ 391◇ 39PythonApache-2.02y ago
podman-desktop-extension-ai-lab
containers/podman-desktop-extension-ai-lab
5.9
Model Serving
★ 291◇ 80TypeScriptApache-2.0today
BMW-YOLOv4-Inference-API-GPU
BMW-InnovationLab/BMW-YOLOv4-Inference-API-GPU
4.1
Model Serving
★ 278◇ 67PythonBSD-3-Clause3y ago
BMW-YOLOv4-Inference-API-CPU
BMW-InnovationLab/BMW-YOLOv4-Inference-API-CPU
3.9
Model Serving
★ 218◇ 58PythonNOASSERTION3y ago