STACKQUADRANT

flashinfer-ai/flashinfer

Inference Engines

FlashInfer: Kernel Library for LLM Serving

GitHub Metrics
Stars
5.1k
Forks
751
Open Issues
465
Watchers
46
Contributors
229
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Mar 2, 2026
Created
Jul 22, 2023
Latest Release
v0.6.4
Release Date
Feb 19, 2026
Synced: Mar 3, 2026
Quality Scores
Documentation Qualityw: 20%
0.0
Community Healthw: 20%
0.0
Maintenance Velocityw: 15%
0.0
API Design & DXw: 20%
0.0
Production Readinessw: 15%
0.0
Ecosystem Integrationw: 10%
0.0
Tags
attentioncudadistributed-inferencegpujitlarge-large-modelsllm-inferencemoenvidiapytorch
Radar
No scores yet