Model Serving
Platforms for deploying and serving ML/AI models at scale
TensorRT-LLM
7.3
★ 13.4k◇ 2.3kPython
jundot/omlx
7.6
★ 10.0k◇ 862Python
ahkarami/Deep-Learning-in-Production
4.5
★ 4.4k◇ 692
beclab/Olares
7.0
★ 4.3k◇ 244Go
vllm-project/vllm-omni
7.2
★ 4.3k◇ 757Python
ModelTC/LightLLM
6.5
★ 4.0k◇ 319Python
HuaizhengZhang/AI-Infra-from-Zero-to-Hero
6.3
★ 3.9k◇ 377
thu-pacman/chitu
6.9
★ 3.4k◇ 354Python
containers/ramalama
7.4
★ 2.7k◇ 330Python
roboflow/inference
7.2
★ 2.3k◇ 252Python
tensorchord/envd
7.0
★ 2.2k◇ 167Go
microsoft/aici
4.9
★ 2.1k◇ 83Rust
vllm-project/vllm-ascend
7.2
★ 1.9k◇ 1.1kPython
mlrun/mlrun
7.2
★ 1.7k◇ 301Python
kitops-ml/kitops
6.9
★ 1.3k◇ 173Go
logicalclocks/hopsworks
5.8
★ 1.3k◇ 156Java
basetenlabs/truss
6.7
★ 1.1k◇ 98Python
alibaba/rtp-llm
6.1
★ 1.1k◇ 169Cuda
efeslab/Nanoflow
4.9
★ 952◇ 48Jupyter Notebook
mosecorg/mosec
6.4
★ 898◇ 72Python
openvinotoolkit/model_server
6.5
★ 856◇ 248C++
pipeless-ai/pipeless
4.9
★ 850◇ 53Rust
bentoml/Yatai
5.3
★ 838◇ 77TypeScript
ServerlessLLM/ServerlessLLM
5.9
★ 674◇ 70Python
1 / 2next →