Intermediate Collocations #machine learning#evaluation#benchmarking#AI

ML Model Evaluation Language Collocations

ML evaluation has its own precise vocabulary. This quiz covers the standard collocations for evaluating performance, benchmarking models, running ablations, and reporting metrics.

0 / 5 completed

1 / 5

Fill in: 'Before deploying to production, the team needed to ___ the model's performance on the held-out test set.'

2 / 5

Fill in: 'They decided to ___ the new transformer model against the previous LSTM baseline on all five tasks.'

3 / 5

Fill in: 'To understand which features mattered, the researchers decided to ___ ablations on each input component.'

4 / 5

Fill in: 'The paper includes a table that allows readers to ___ baselines across five public datasets.'

5 / 5

Fill in: 'At the end of each experiment, the team must ___ metrics using the standardised reporting template.'