🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

★ 4.2k◇ 404

ModelTC/LightLLM

6.6

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

★ 4.2k◇ 344Python

thu-pacman/chitu

7.1

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

★ 3.1k◇ 269Python

containers/ramalama

7.7

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

★ 3.0k◇ 348Python

vllm-project/vllm-ascend

7.2

Community maintained hardware plugin for vLLM on Ascend

★ 2.4k◇ 1.7kC++

roboflow/inference

7.3

Turn any computer or edge device into a command center for your computer vision projects.

★ 2.4k◇ 286Python

superlinked/sie

6.9

Superlinked Inference Engine is an Open-source inference server and production cluster for embeddings, reranking, and extraction.

★ 2.3k◇ 207Python

tensorchord/envd

6.9

🏕️ Reproducible development environment for humans and agents

★ 2.2k◇ 168Go

microsoft/aici

4.9

AICI: Prompts as (Wasm) Programs

★ 2.1k◇ 84Rust

mlrun/mlrun

7.4

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.

★ 1.7k◇ 311Python