Model Serving
Platforms for deploying and serving ML/AI models at scale
jundot/omlx
7.8
★ 15.7k◇ 1.3kPython
TensorRT-LLM
7.3
★ 13.8k◇ 2.4kPython
vllm-project/vllm-omni
7.3
★ 4.9k◇ 1.0kPython
beclab/Olares
7.0
★ 4.6k◇ 266Go
ahkarami/Deep-Learning-in-Production
4.5
★ 4.4k◇ 687
ModelTC/LightLLM
6.5
★ 4.1k◇ 332Python
HuaizhengZhang/AI-Infra-from-Zero-to-Hero
6.2
★ 4.1k◇ 393
thu-pacman/chitu
6.8
★ 3.1k◇ 266Python
containers/ramalama
7.5
★ 2.9k◇ 340Python
roboflow/inference
7.3
★ 2.3k◇ 269Python
tensorchord/envd
6.9
★ 2.2k◇ 168Go
vllm-project/vllm-ascend
7.2
★ 2.2k◇ 1.3kC++
microsoft/aici
4.9
★ 2.1k◇ 84Rust
superlinked/sie
6.6
★ 2.0k◇ 177Python
mlrun/mlrun
7.2
★ 1.7k◇ 305Python
kitops-ml/kitops
6.9
★ 1.3k◇ 170Go
logicalclocks/hopsworks
5.8
★ 1.3k◇ 158Java
alibaba/rtp-llm
6.0
★ 1.2k◇ 204Cuda
basetenlabs/truss
6.7
★ 1.2k◇ 107Python
efeslab/Nanoflow
4.7
★ 962◇ 49Jupyter Notebook
mosecorg/mosec
6.5
★ 900◇ 72Python
openvinotoolkit/model_server
6.5
★ 880◇ 253C++
pipeless-ai/pipeless
4.9
★ 850◇ 52Rust
bentoml/Yatai
6.2
★ 845◇ 76TypeScript
1 / 2next →