STACKQUADRANT

Inference Engines

High-performance model inference and serving runtimes

2 repos

llama.cpp

8.3

llama.cpp — a leading open-source project in the AI/LLM ecosystem.

96.2k15.1kC++

vLLM

8.7

vLLM — a leading open-source project in the AI/LLM ecosystem.

71.5k13.8kPython