Model Evaluation Failure Vocabulary

Practice vocabulary for model evaluation failures: data leakage in evaluation, memorising test examples, benchmark gaming, fine-tuning on evaluation sets, and reliability failures.

0 / 5 completed
1 / 5
The post-mortem reveals ___ leakage in the evaluation: test examples appeared in the training set.