AdvancedVocabulary#anthropic#claude#prompt-caching#llm#optimization

Anthropic Prompt Caching

Anthropic's prompt caching stores frequently reused prompt prefixes server-side, enabling cache reads at ~10% of normal input token cost. Learn the vocabulary for cache_control directives, TTL behavior, creation vs read token billing, and multi-turn conversation patterns.

0 / 5 completed

1 / 5

An engineer adds cache_control: { type: 'ephemeral' } to a 900-token system prompt. On the second identical call 10 minutes later, which token counts appear in the response usage?

2 / 5

What is the minimum number of tokens a prompt block must contain before Anthropic will store it in the prompt cache?

3 / 5

How long does an Anthropic prompt cache entry remain valid after its last use?

4 / 5

A developer places cache_control on the last user message in a multi-turn conversation. What does Anthropic cache in this case?

5 / 5

Which cost multiplier applies to cache_creation_input_tokens compared to standard input token pricing for Claude 3.5 Sonnet?