AdvancedVocabulary#ai-llm#data-science-ml#backend

Text Embedding Models Vocabulary

Practice the vocabulary of converting text into a semantic vector for similarity search.

0 / 5 completed

1 / 5

At standup, a dev mentions converting a piece of text into a fixed-length numerical vector that captures its semantic meaning, so two texts with similar meaning end up with vectors close together. What produces this vector?

2 / 5

During a design review, the team wants to keep using the exact same embedding model version for every new piece of text added to their vector index, rather than mixing vectors produced by two different model versions. Which capability supports this?

3 / 5

In a code review, a dev notices the team re-embeds every existing document in the index whenever they upgrade to a new embedding model version, rather than only embedding new documents going forward with the new version. What does this represent?

4 / 5

An incident report shows a search feature's relevance quietly degraded after half the index was re-embedded with a new model version while the other half still held vectors from the old version. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team requires re-embedding the entire index instead of just embedding new documents with the upgraded model going forward. What is the reasoning?