#evaluation
2 articles tagged #evaluation
All English for IT articles related to #evaluation.
-
English for ML Model Evaluation Discussions
Learn the vocabulary of machine learning model evaluation: precision/recall, AUC-ROC, BLEU/ROUGE, LLM-as-judge, RAGAS, hallucination rate, red-teaming, and benchmark saturation.
-
English for LLM Evaluation: Vocabulary Every AI Engineer Needs
Learn the English vocabulary for LLM evaluation: MMLU, HumanEval, BLEU, ROUGE, BERTScore, hallucination, ground truth, and judge LLMs for AI model assessment.