lm-evaluation-harness vs simple-evals

Side-by-side comparison of two llm evaluation: tools.

Category
Llm Evaluation:
Llm Evaluation:
Overall Rank
#1017
#1019
Score
8
8
GitHub Stars
Free Tier
Yes
Yes
Starting Price
Sources
1
1

Score Comparison

lm-evaluation-harness

A framework for few-shot evaluation of language models.

View full profile

simple-evals

Eval tools by OpenAI.

View full profile