STACKQUADRANT

gpustack/gpustack

Inference Engines

Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

7.0
GitHub Metrics
Stars
5.1k
Forks
540
Open Issues
574
Watchers
39
Contributors
41
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Jun 3, 2026
Created
May 11, 2024
Latest Release
v2.1.2
Release Date
Apr 21, 2026
Synced: Jun 3, 2026
Quality Scores
Documentation Qualityw: 20%
7.3

Has docs site (https://gpustack.ai). Description: 128 chars. Stars signal: 5,094. Contributors: 41. Score: 7.3/10

Community Healthw: 20%
6.2

Stars: 5,094. Contributors: 41. Watchers: 39. Forks: 540. Issue ratio: 11.3%. Score: 6.2/10

Maintenance Velocityw: 15%
7.5

Last commit: 0d ago. Weekly commits: 0. Latest release: v2.1.2. Maturity bonus: 2.1y old. Score: 7.5/10

API Design & DXw: 20%
6.4

Stars/issues ratio: 9. Dynamic language: Python. Has documentation site. Permissive license: Apache-2.0. Popularity signal: 5,094 stars. Score: 6.4/10

Production Readinessw: 15%
7.3

Battle-tested: 5,094 stars. Peer review: 41 contributors. Versioned: v2.1.2. Licensed: Apache-2.0. Age: 2.1 years. Maintenance: last commit 0d ago. Score: 7.3/10

Ecosystem Integrationw: 10%
7.9

Fork interest: 540. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 5,094 stars. Has web presence. Score: 7.9/10

Tags
ascendcudadeepseekdistributed-inferencegenaihigh-performance-inferenceinferencellamallmllm-inference
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration