Multi-tenant SaaS with LLM features hits a problem the rest of SaaS solved a decade ago: when one tenant's usage costs 50× another tenant's and the bill arrives as a single line item from OpenAI or Anthropic, unit economics quietly become fiction. This is the four-layer architecture that fixes it — gateway, audit log, versioned cost model, roll-up — implementation order week-by-week, the four shapes of noisy neighbour, three pricing models the architecture unlocks, 8 anti-patterns to retire, and the 5-stage maturity ladder. Globally portable across OpenAI, Anthropic, Bedrock, Vertex, and self-hosted vLLM.