benchflow-ai/awesome-evals
Evaluation & TestingA curated, non-BS library of the best resources for building and evaluating AI agents — papers, blogs, talks, tools, benchmarks. Maintained by BenchFlow.
No dedicated docs site. Description: 153 chars. Stars signal: 558. Contributors: 4. Score: 3.8/10
Stars: 558. Contributors: 4. Watchers: 1. Forks: 41. Issue ratio: 0.2%. Score: 3.8/10
Last commit: 1d ago. Weekly commits: 2. No releases published. Score: 6.6/10
Stars/issues ratio: 558. No dedicated API docs. License: NOASSERTION. Popularity signal: 558 stars. Score: 6.4/10
Battle-tested: 558 stars. Peer review: 4 contributors. No versioned releases. Licensed: NOASSERTION. Age: 0.0 years. Maintenance: last commit 1d ago. Score: 3.9/10
Fork interest: 41. License: NOASSERTION. Adoption: 558 stars. Score: 3.2/10