RAG looks simple on a slide. Index the data, retrieve a few chunks, let the model speak. Then reality shows up. Queries are messy. Sources are noisy. Latency bites. Costs creep. You ship a demo, but users catch the weak spots in five minutes. The goa...