AdvancedVocabulary#ai-llm#data-science-ml#backend

Embedding Drift Vocabulary

Practice the vocabulary of keeping stored vectors comparable across an embedding model upgrade.

0 / 5 completed

1 / 5

At standup, a dev mentions that vectors produced by an older embedding model version are no longer meaningfully comparable to vectors produced by a newer version, since the newer model encodes semantic meaning differently. What is this phenomenon called?

2 / 5

During a design review, the team wants every stored vector regenerated with the new embedding model whenever that model's version changes, rather than leaving old vectors untouched alongside new ones. Which capability supports this?

3 / 5

In a code review, a dev notices each stored vector carries a tag recording exactly which embedding model version produced it, letting the system detect a mismatch before comparing two vectors. What does this represent?

4 / 5

An incident report shows search relevance degraded sharply right after a new embedding model version was rolled out, because old vectors from the previous version were being compared directly against new query embeddings with no version check catching the mismatch. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team tags and re-embeds stored vectors after an embedding model upgrade instead of just leaving the old vectors in place alongside newly generated ones. What is the reasoning?