AdvancedVocabulary#ai#devops#developer-tools

Edge Inference Vocabulary

Build fluency in the vocabulary of running a trained model close to where data originates.

0 / 5 completed

1 / 5

At standup, a dev mentions running a trained machine learning model directly on a nearby edge device or server, close to where the data originates, instead of sending every request to a distant centralized cloud model. What is this pattern called?

2 / 5

During a design review, the team wants to shrink a large trained model's size and computational cost so it fits within an edge device's limited memory and processing power. Which capability supports this?

3 / 5

In a code review, a dev notices the edge device is configured to fall back to a centralized cloud model whenever its local, compressed model's confidence score falls below a defined threshold. What does this represent?

4 / 5

An incident report shows an edge device's compressed model produced a noticeably less accurate prediction than the original full model, and no one had measured that accuracy gap before deploying it. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team runs inference on the edge device instead of always sending the request to a centralized cloud model for every prediction. What is the reasoning?