AdvancedVocabulary#ai#llm

LLM Evaluation

5 exercises on LLM evaluation vocabulary.

0 / 5 completed

1 / 5

In LLM evaluation, what is a "hallucination"?

2 / 5

What is an "eval set" (evaluation dataset) and why is it essential?

3 / 5

What is the "LLM-as-judge" evaluation technique?

4 / 5

In the RAGAS framework for evaluating RAG systems, what does "faithfulness" measure?

5 / 5

What is a "benchmark" like MMLU or HumanEval used for?