AppScale Blog — AI, Cloud Architecture & System Design

Blog

Engineering-Insights

Tiefgehende Analysen zu KI-Systemen, Cloud-Architektur, verteilten Systemen und Engineering-Führung.

Hybrid Search and Re-ranking in Production RAG: BM25, Dense Vectors, Cross-encoders, and Everything In Between (2026)

The single biggest reason production RAG systems return confident wrong answers is not the LLM, the prompt, or the chunking — it is the retriever returning the wrong documents into the top-k. Dense-vector-only retrieval gives 70% recall on conceptual queries and 30% on exact-term queries — and a better embedding model does not fix it because the failure mode is structural. The architecture the field has converged on in 2026: sparse retriever (BM25 or SPLADE) + dense retriever (bi-encoder embeddings) running in parallel, fused via RRF or weighted-α, cross-encoder re-ranker over the top-50 candidates, MMR diversification, ACL/freshness pre-filter, query understanding in front. This article is the deep-dive on what each primitive is doing, why each fails, the latency budget, eight anti-patterns, and the five-stage maturity ladder from single-retriever to calibrated-fusion-with-online-feedback.

May 19, 2026Read

microservices-patterns1 min read

Modules vs Vertical Slices: Macro vs Micro Architecture in the Modular Monolith (2026)

The argument "Clean Architecture vs Vertical Slice Architecture" is a category error — the two operate on different axes. A module is a macro-architectural decision about bounded contexts, public contracts, data ownership and communication style. A vertical slice is a micro-architectural decision about feature folder organisation inside a module. The killer property of a real modular monolith is that the two axes are independent: heterogeneous internals (Clean Architecture in one module, vertical slices in another, transaction scripts in a third) live safely behind homogeneous module boundaries enforced by project references, ArchUnit rules, and schema grants. This article is the technical deep-dive: the five enforceable module properties, the four slice properties, the cross-module communication spectrum from in-process method calls to outbox-backed event buses, the per-module internal-style decision matrix, multi-layered boundary enforcement, eight anti-patterns, and the five-stage maturity ladder from layered monolith to deliberate modular-monolith target state.

Engineering-Insights

Hybrid Search and Re-ranking in Production RAG: BM25, Dense Vectors, Cross-encoders, and Everything In Between (2026)

Modules vs Vertical Slices: Macro vs Micro Architecture in the Modular Monolith (2026)

Agentic AI Debugging: When the Loop Doesn't Stop (2026)

Evaluation-Driven Development: Replacing TDD for LLM Systems (2026)

LLMjacking 2026: How Attackers Hijack Your Bedrock and OpenAI Quota — and the Seven-Layer Defence That Stops the $84,000 Weekend

AI Compliance Architecture: One Control Plane for EU AI Act, GDPR, DPDP, HIPAA, and APPI (2026)

Air-Gapped AI Architecture: Offline LLM Systems for Regulated and Classified Environments (2026)

Multi-Tenant RAG Isolation: The 7 Attack Vectors and the Architecture That Closes Them (2026)

Cost Engineering for LLM Features: From $100k to $1M Monthly Spend (2026)

Build a Multi-Agent AI System with LangGraph + MCP + A2A: Beginner-Friendly End-to-End Tutorial (2026)

Prompt Injection Defence in Depth (2026): Six Layers from Input Sanitisation to Output Firewall

Agritech AI Architecture: Pasture Vision, Livestock Behaviour Models, and Low-Bandwidth Edge (NZ Reference, 2026)

Game AI Architecture: Procedural Quest Systems and LLM-Driven NPC Dialogue (Budget Models, 2026)

Agentic AI for Mining and Resources: Shift Handover, Tool Use, and Fleet Coordination (2026)

AI Incident Response Runbook: RCA for LLM Failures (2026)

Agentic AI Meets Ringi: Decision Loop Architecture for Japanese Enterprise Approval Flows (2026)

Betriebsrat and AI Deployment: Co-Determination-Friendly Rollout Architecture (2026)

Data Sovereignty Architecture: Respecting Māori Data Principles in Tikanga-Aware ML Systems (2026)

AI Nearshoring Architecture: Poland as the EU AI Delivery Hub — Team Topology and Data Residency (2026)

Privacy-by-Design RAG Architecture for the Australian Privacy Act 2025 Reforms and the Statutory Tort (2026)

Bleiben Sie einen Schritt voraus