STACKQUADRANT

andrewkchan/yalm

Inference Engines

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

GitHub Metrics
Stars
553
Forks
55
Open Issues
3
Watchers
9
Contributors
1
Weekly Commits
0
Language
C++
License
Last Commit
Sep 13, 2025
Created
Oct 2, 2024
Latest Release
Release Date
Synced: Mar 3, 2026
Quality Scores
Documentation Qualityw: 20%
0.0
Community Healthw: 20%
0.0
Maintenance Velocityw: 15%
0.0
API Design & DXw: 20%
0.0
Production Readinessw: 15%
0.0
Ecosystem Integrationw: 10%
0.0
Tags
cppcudainference-enginellamallamacppllmllm-inferencemachine-learningmistral
Radar
No scores yet