lm-evaluation-harness vs simple-evals

Side-by-side comparison of two llm evaluation: tools.

lm-evaluation-harness

Category

Llm Evaluation:

Llm Evaluation:

Overall Rank

#1017

#1019

Score

8

8

GitHub Stars

—

—

Free Tier

Yes

Yes

Starting Price

—

—

Sources

1

1

Score Comparison

lm-evaluation-harness

A framework for few-shot evaluation of language models.

View full profile

simple-evals

Eval tools by OpenAI.

View full profile