Prompt Caching
API feature that lets you reuse a long static prompt prefix across many requests for a fraction of the token cost. Anthropic, OpenAI, and Google all offer it with slightly different rules.
API feature that lets you reuse a long static prompt prefix across many requests for a fraction of the token cost. Anthropic, OpenAI, and Google all offer it with slightly different rules.