Najlepsze narzędzia AI do Llm Evaluation:
8 narzędzi sklasyfikowanych według sygnałów społeczności i danych.
1
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Za darmo8 pkt
2
lighteval
a lightweight LLM evaluation suite that Hugging Face has been using internally.
Za darmo8 pkt
3
simple-evals
Eval tools by OpenAI.
Za darmo8 pkt
4
OLMO-eval
a repository for evaluating open language models.
Za darmo8 pkt
5
HELM
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models.
Za darmo8 pkt
6
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out
Za darmo8 pkt
7
Giskard
Testing & evaluation library for LLM applications, in particular RAGs
Za darmo8 pkt
8
Ragas
a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines.
Za darmo8 pkt