Migliori strumenti AI per Llm Evaluation:
8 strumenti classificati per segnali della comunità e dati.
1
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Gratis8 pt
2
lighteval
a lightweight LLM evaluation suite that Hugging Face has been using internally.
Gratis8 pt
3
simple-evals
Eval tools by OpenAI.
Gratis8 pt
4
OLMO-eval
a repository for evaluating open language models.
Gratis8 pt
5
HELM
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models.
Gratis8 pt
6
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out
Gratis8 pt
7
Giskard
Testing & evaluation library for LLM applications, in particular RAGs
Gratis8 pt
8
Ragas
a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines.
Gratis8 pt