Posted inБлог
Pro tip to reduce Time-to-First-Token (TTFT) for long prompt
Pro tip to reduce Time-to-First-Token (TTFT) for long prompts via API: warm up the prompt cache. Send your system prompt ahead of the user prompt. Claude will cache it without…
