TL;DR: Building AI memory at 10M nodes taught us hard lessons about query variability, static weights, and latency. Here's what broke and how we're fixing it.
You've built the RAG pipeline. Embeddings are working. Retrieval is fast. Then a user asks: "What