AdvancedVocabulary#ai#security#developer-tools

Prompt Injection Vocabulary

Build fluency in the vocabulary of defending a language model against malicious embedded instructions.

0 / 5 completed

1 / 5

At standup, a dev mentions a security concern where malicious text embedded in retrieved content could override the intended instructions given to a language model. What is this attack called?

2 / 5

During a design review, the team wants to clearly separate a system's trusted instructions from untrusted user or retrieved content within the prompt. Which capability supports this?

3 / 5

In a code review, a dev notices the system restricts what actions a model-driven agent is allowed to take, even if a prompt injection attempt tries to instruct it otherwise. What does this represent?

4 / 5

An incident report shows an AI agent processing a retrieved web page followed embedded hidden text instructing it to leak sensitive conversation history. What practice would reduce this risk?

5 / 5

During a PR review, a teammate asks why the team applies least-privilege action restrictions to an AI agent instead of relying solely on prompt-level instructions to prevent it from taking a harmful action. What is the reasoning?