Evaluation & Testing
Frameworks for evaluating, benchmarking, and testing AI systems
rhesis-ai/rhesis
5.4
★ 311◇ 23Python
PetroIvaniuk/llms-tools
4.7
★ 306◇ 40
athina-ai/athina-evals
4.1
★ 299◇ 21Python
ai-dashboad/flutter-skill
5.1
★ 190◇ 23Dart
PramodDutta/qaskills
4.0
★ 102◇ 4TypeScript
← prev2 / 2