STACKQUADRANT

RediSearch

RediSearch/RediSearch
7.2

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.

Vector Databases
6.2k587CNOASSERTIONtoday

aim

aimhubio/aim
7.0

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Prompt Engineering
6.1k394PythonApache-2.0today

chinese-llm-benchmark

jeinlee1991/chinese-llm-benchmark
6.3

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括335个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型, 以及kimi-k2、ernie4.5、minimax-M2、deepseek-v3.2、qwen3-2507、llama4、智谱GLM-4.6、gemma3、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。

Evaluation & Testing
6.1k247today

genkit

firebase/genkit
7.4

Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google

Vector Databases
6.1k756TypeScriptApache-2.0today

genkit

genkit-ai/genkit
7.4

Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google

Vector Databases
6.1k756TypeScriptApache-2.0today

zcf

UfoMiao/zcf
7.1

Zero-Config Code Flow for Claude code & Codex

Agent Frameworks
6.0k423TypeScriptMITtoday

enchanted

gluonfield/enchanted
5.3

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

LLM Frameworks
6.0k420SwiftApache-2.01y ago

atomic-agents

BrainBlend-AI/atomic-agents
6.9

Building AI agents, atomically

LLM Frameworks
6.0k511PythonMITtoday

Awesome-LLMOps

tensorchord/Awesome-LLMOps
6.6

An awesome & curated list of best LLMOps tools for developers

AI DevOps
5.8k806ShellCC0-1.013d ago

helicone

Helicone/helicone
7.4

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

AI DevOps
5.8k593TypeScriptApache-2.015d ago

flashinfer

flashinfer-ai/flashinfer
7.4

FlashInfer: Kernel Library for LLM Serving

Inference Engines
5.7k1.0kPythonApache-2.0today

pyspur

PySpur-Dev/pyspur
6.9

A visual playground for agentic workflows: Iterate over your agents 10x faster

LLM Frameworks
5.7k425TypeScriptApache-2.08d ago

rllm

rllm-org/rllm
7.4

Democratizing Reinforcement Learning for LLMs

Agent Frameworks
5.6k573PythonApache-2.0today

kserve

kserve/kserve
7.7

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Inference Engines
5.5k1.5kGoApache-2.0today

AgentGuide

adongwanai/AgentGuide
5.2

https://adongwanai.github.io/AgentGuide | AI Agent开发指南 | LangGraph实战 | 高级RAG | 转行大模型 | 大模型面试 | 算法工程师 | 面试题库 | 强化学习|数据合成

Agent Frameworks
5.5k555HTML5d ago

holaOS

holaboss-ai/holaOS
6.4

An Open Agent Computer for ANY digital work.

Agent Frameworks
5.5k402TypeScriptNOASSERTION1d ago

coze-loop

coze-dev/coze-loop
6.9

Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.

AI DevOps
5.5k763GoApache-2.0today

bifrost

maximhq/bifrost
7.4

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

AI DevOps
5.4k682GoApache-2.0today

zenml

zenml-io/zenml
7.6

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

AI DevOps
5.4k621PythonApache-2.0today

giskard-oss

Giskard-AI/giskard-oss
7.6

🐢 Open-Source Evaluation & Testing library for LLM Agents

AI DevOps
5.4k467PythonApache-2.0today

awesome-ai-tools

mahseema/awesome-ai-tools
6.0

A curated list of Artificial Intelligence Top Tools

Agent Frameworks
5.4k1.6kMIT5mo ago

TaskingAI

TaskingAI/TaskingAI
6.0

The open source platform for AI-native application development.

RAG Libraries
5.4k355PythonApache-2.01y ago

MineContext

volcengine/MineContext
6.5

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

Vector Databases
5.4k401PythonApache-2.027d ago

shimmy

Michael-A-Kuykendall/shimmy
6.3

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Inference Engines
5.3k503RustApache-2.01d ago

superduper

superduper-io/superduper
6.5

Superduper: End-to-end framework for building custom AI applications and agents.

Vector Databases
5.3k541PythonApache-2.09mo ago

cactus

cactus-compute/cactus
7.1

Low-latency AI engine for mobile devices & wearables

LLM Frameworks
5.3k419CNOASSERTIONtoday

Awesome-LLM-Inference

xlite-dev/Awesome-LLM-Inference
6.5

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Inference Engines
5.3k381PythonGPL-3.01mo ago

gpustack

gpustack/gpustack
7.0

Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

Inference Engines
5.1k540PythonApache-2.0today

LLM-Engineers-Handbook

PacktPublishing/LLM-Engineers-Handbook
6.7

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Evaluation & Testing
5.1k1.2kPythonMIT1mo ago

SPTAG

microsoft/SPTAG
6.6

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.

Vector Databases
5.0k618C++MIT2d ago

h2o-llmstudio

h2oai/h2o-llmstudio
7.6

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Fine-tuning Tools
5.0k529PythonApache-2.01d ago

eko

FellouAI/eko
7.0

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

Inference Engines
4.9k439TypeScriptMIT3mo ago

vllm-omni

vllm-project/vllm-omni
7.3

A framework for efficient model inference with omni-modality models

Model Serving
4.9k1.0kPythonApache-2.0today

mini-swe-agent

SWE-agent/mini-swe-agent
7.3

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Agent Frameworks
4.8k657PythonMITtoday

AutoRAG

Marker-Inc-Korea/AutoRAG
7.1

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

RAG Libraries
4.8k399PythonApache-2.0today

TencentDB-Agent-Memory

Tencent/TencentDB-Agent-Memory
6.3

TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.

Vector Databases
4.7k393TypeScriptNOASSERTION1d ago

helix-db

HelixDB/helix-db
7.2

HelixDB is an open-source graph-vector database built from scratch in Rust.

Vector Databases
4.7k250RustAGPL-3.0today

awesome-vibe-coding

filipecalegario/awesome-vibe-coding
5.9

A curated list of vibe coding references, collaborating with AI to write code.

Agent Frameworks
4.6k541CC0-1.01mo ago

ag2

ag2ai/ag2
7.8

AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/sNGSwQME3x

Agent Frameworks
4.6k646PythonApache-2.0today

Integuru

Integuru-AI/Integuru
6.4

The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.

Agent Frameworks
4.6k359PythonAGPL-3.08d ago

Olares

beclab/Olares
7.0

Olares: An Open-Source Personal Cloud to Reclaim Your Data

Model Serving
4.6k266GoAGPL-3.0today

youtu-agent

TencentCloudADP/youtu-agent
6.5

A simple yet powerful agent framework that delivers with open-source models

Agent Frameworks
4.6k466PythonNOASSERTION2mo ago

infinity

infiniflow/infinity
7.3

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

Vector Databases
4.5k423C++Apache-2.010d ago

m_flow

FlowElement-ai/m_flow
6.7

A bio-inspired cognitive memory engine — a new paradigm for Graph RAG.

Vector Databases
4.5k260PythonApache-2.01mo ago

cognita

truefoundry/cognita
6.8

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

LLM Frameworks
4.4k387PythonApache-2.02mo ago

crate

crate/crate
7.8

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Vector Databases
4.4k599JavaApache-2.0today

Deep-Learning-in-Production

ahkarami/Deep-Learning-in-Production
4.5

In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

Model Serving
4.4k6871y ago

semantic-router

vllm-project/semantic-router
7.5

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Fine-tuning Tools
4.3k693GoApache-2.0today