STACKQUADRANT

InternLM/lmdeploy

Inference Engines

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

GitHub Metrics
Stars
7.6k
Forks
658
Open Issues
562
Watchers
55
Contributors
135
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Mar 2, 2026
Created
Jun 15, 2023
Latest Release
v0.12.1
Release Date
Feb 13, 2026
Synced: Mar 3, 2026
Quality Scores
Documentation Qualityw: 20%
0.0
Community Healthw: 20%
0.0
Maintenance Velocityw: 15%
0.0
API Design & DXw: 20%
0.0
Production Readinessw: 15%
0.0
Ecosystem Integrationw: 10%
0.0
Tags
codellamacuda-kernelsdeepspeedfastertransformerinternlmllamallama2llama3llmllm-inference
Radar
No scores yet