Agents and RAG apps re-send the same long prefix every turn. Prompt caching cuts input cost up to ~90% and speeds first tokens — the win most teams leave off.