Production LLM agents fail on cost, latency, and memory when the context window is filled like a bucket. Context engineering treats it as a budget allocated every turn.