AdvancedVocabulary#ai-llm#backend#developer-tools

Prompt Caching Vocabulary

Practice the vocabulary of reusing a static prompt segment to reduce latency and cost.

0 / 5 completed
1 / 5
At standup, a dev mentions storing a long, unchanging portion of a prompt, like a system instruction, so the model doesn't have to reprocess it from scratch on every single call. What is this technique called?