Evaluation & Testing
Frameworks for evaluating, benchmarking, and testing AI systems
PacificAI/langtest
5.9
★ 559◇ 49Python
Pacific-AI-Corp/langtest
5.9
★ 559◇ 49Python
relari-ai/continuous-eval
4.7
★ 516◇ 38Python
ifixai-ai/iFixAi
5.9
★ 462◇ 87Python
JonathanChavezTamales/llm-leaderboard
4.7
★ 361◇ 40JavaScript
rhesis-ai/rhesis
5.5
★ 358◇ 24Python
palico-ai/palico-ai
4.5
★ 342◇ 28TypeScript
PetroIvaniuk/llms-tools
4.9
★ 317◇ 44
faiscadev/fakecloud
5.4
★ 311◇ 19Rust
athina-ai/athina-evals
4.0
★ 300◇ 22Python
ai-dashboad/flutter-skill
5.3
★ 278◇ 36Dart
PramodDutta/qaskills
4.2
★ 133◇ 11TypeScript
← prev2 / 2