Honest decision matrix for the 2026 Japanese LLM stack: when do ELYZA, Stockmark, PLaMo, and Sakana beat frontier APIs (GPT-4o, Claude, Gemini) — and when do they not? Tokenisation cost penalty, keigo handling, vertical vocabulary, sovereignty requirements, and the multi-LLM routing architecture that lets you have all three. Globally portable pattern: Japanese is the demanding instance, the routing layer that survives Japanese will survive Korean, Mandarin, Hindi, and every other language with similar characteristics.