STACKQUADRANT

andrewkchan/yalm

Inference Engines

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

3.7
GitHub Metrics
Stars
584
Forks
62
Open Issues
3
Watchers
10
Contributors
1
Weekly Commits
0
Language
C++
License
Last Commit
Sep 13, 2025
Created
Oct 2, 2024
Latest Release
Release Date
Synced: Jun 3, 2026
Quality Scores
Documentation Qualityw: 20%
3.8

No dedicated docs site. Description: 82 chars. Stars signal: 584. Contributors: 1. Score: 3.8/10

Community Healthw: 20%
3.5

Stars: 584. Contributors: 1. Watchers: 10. Forks: 62. Issue ratio: 0.5%. Score: 3.5/10

Maintenance Velocityw: 15%
2.3

Last commit: 263d ago. Weekly commits: 0. No releases published. Maturity bonus: 1.7y old. Score: 2.3/10

API Design & DXw: 20%
5.9

Stars/issues ratio: 195. No dedicated API docs. No license specified. Popularity signal: 584 stars. Score: 5.9/10

Production Readinessw: 15%
2.3

Battle-tested: 584 stars. Peer review: 1 contributors. No versioned releases. No license (risky for production). Age: 1.7 years. Maintenance: last commit 263d ago. Score: 2.3/10

Ecosystem Integrationw: 10%
3.9

Fork interest: 62. Ecosystem: C++. No license (integration risk). Adoption: 584 stars. Score: 3.9/10

Tags
cppcudainference-enginellamallamacppllmllm-inferencemachine-learningmistral
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration