AdvancedVocabulary#data-science-ml#ai-llm#backend

Feature Engineering Pipeline Vocabulary

Practice the vocabulary of transforming raw data into a model's input features consistently.

0 / 5 completed

1 / 5

At standup, a dev mentions a repeatable, automated pipeline that transforms raw data into the exact input features a machine learning model expects, applied consistently for both training and live prediction. What is this pipeline called?

2 / 5

During a design review, the team wants the exact same feature-transformation logic used at training time to also run at live prediction time, rather than two separately maintained implementations that could drift apart. Which capability supports this?

3 / 5

In a code review, a dev notices the pipeline validates that a computed feature's distribution at prediction time still resembles what it looked like during training, flagging a meaningful shift. What does this represent?

4 / 5

An incident report shows a live prediction service computed a feature slightly differently than the training pipeline did, due to a subtle rounding difference between the two separately maintained implementations, silently hurting model accuracy. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team invests in a shared, single-implementation feature pipeline instead of letting the training and serving paths each maintain their own separate feature logic. What is the reasoning?