Production Model Evaluation Vocabulary

Practice vocabulary for evaluating models in production: online evaluation, A/B testing models, shadow mode, comparing model versions, and interpreting live metrics.

0 / 5 completed

1 / 5

The MLOps team uses ___ evaluation to measure model performance on live traffic.

2 / 5

To compare two recommendation models the team runs an A/___ test in production.

3 / 5

Before full rollout the new model runs in ___ mode: it receives live requests and makes predictions, but results are not served to users.

4 / 5

The engineering report reads: 'We're comparing ___ and ___ in production.' What are v1 and v2?

5 / 5

The weekly report says: 'The new model shows 3% better ___ in shadow mode.' What metric is being measured?