STACKQUADRANT

alibaba/rtp-llm

Model Serving

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

GitHub Metrics
Stars
1.1k
Forks
154
Open Issues
100
Watchers
18
Contributors
87
Weekly Commits
0
Language
Cuda
License
Apache-2.0
Last Commit
Mar 2, 2026
Created
Dec 27, 2023
Latest Release
v0.2.0
Release Date
Oct 31, 2025
Synced: Mar 3, 2026
Quality Scores
Documentation Qualityw: 20%
0.0
Community Healthw: 20%
0.0
Maintenance Velocityw: 15%
0.0
API Design & DXw: 20%
0.0
Production Readinessw: 15%
0.0
Ecosystem Integrationw: 10%
0.0
Tags
gptinferencellamallmllm-servingllmopsmodel-serving
Radar
No scores yet