AdvancedVocabulary#data-science-ml#backend#architecture

Training-Serving Skew Vocabulary

Learn the vocabulary of a feature computed differently between a training pipeline and a live serving pipeline.

0 / 5 completed

1 / 5

At standup, a dev mentions a model performing noticeably worse in production than its offline test metrics suggested, traced to a feature being computed differently by the live inference pipeline than it was during training. What is this problem called?

2 / 5

During a design review, the team wants the exact same code and logic to compute a feature for both the training pipeline and the live serving pipeline, rather than maintaining two separate implementations. Which capability supports this?

3 / 5

In a code review, a dev notices a bug where the training pipeline computes a feature named 'avg_purchase' as an average over the last 30 days, while the online serving pipeline computes the same-named feature as an average over the last 7 days. What does this represent?

4 / 5

An incident report shows a model's live accuracy dropped sharply right after launch despite strong offline test metrics, and the root cause was traced to the serving pipeline computing a key feature differently than the training pipeline had computed the same-named feature. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team unifies feature computation behind a shared feature store instead of maintaining separate, independently optimized feature code for the training pipeline and the serving pipeline. What is the reasoning?