Token Optimization
Practice of reducing the number of tokens sent to and received from an LLM without losing task quality. Includes prompt caching, message pruning, context window management, and structured output.
Practice of reducing the number of tokens sent to and received from an LLM without losing task quality. Includes prompt caching, message pruning, context window management, and structured output.