RAG - just a poor engineering workaround?
RAG - just a poor engineering workaround?
My week kicked off nicely with some inspiring talks on an internal conference. In one of the talks Johannes Langer dived deep on how to build production-ready RAG systems. I answered his opening questions to the audience - โWhat is RAG?โ - with โ๐๐ผ๐ณ๐ณ๐ฒ๐ฟ๐ธ๐น๐ฎ๐๐๐๐ฟโ, which translates to ๐ผ๐ฝ๐ฒ๐ป ๐ฏ๐ผ๐ผ๐ธ ๐๐ฒ๐๐ in my head.
Thinking more about this analogy, I find it is helpful to approach the question if RAG is just a workaround to overcome limitations of our current foundation models or is here to stay, one a more conceptual level. The German wikipedia article on โ๐๐ผ๐ณ๐ณ๐ฒ๐ฟ๐ธ๐น๐ฎ๐๐๐๐ฟโ talks about some of the motivations for this kind of test: huge efforts for students on memorising independent facts are eliminated, the test scope can be wider and the test is focussing more on the ability to creatively think and find new solutions approaches. In other words this approach is frugal with students resources and incentives creation of new solutions.
From my perspective this is very analogous to foundation models, which we are using today in ๐๐ฒ๐ป๐ฒ๐ฟ๐ฎ๐๐ถ๐๐ฒ AI. Similar to students those need some knowledge to be able to generate answers, but there is no reason to limit those to just their internal knowledge if there are other, more efficient, system component to provide them with additional knowledge. I really think that RAG is enabling a reasonable logical decomposition into using, primarily, Foundation models to their strengths and knowledge retrieval systems as their support. The famous Swiss Army knife, while a very handy tool in some situations, is usually beaten by special purpose tools. Hence the focus in today’s system design to find the best fitting components for a use case instead of using just multi-purpose tools.
Therefore Iโm really excited to see ongoing innovations in retrieval systems to improve their accuracy. Recently Anthropic released their Contextual Retrieval, which improves RAG architecture by adding additional context information which can be used to retrieve better matches to the actual queries. If you havenโt seen it yet, I recommend to have a look!
Contextual Retrieval: https://lnkd.in/eWcqPW8m “Kofferklausur” on wikipedia: https://lnkd.in/eusUbAA2.
Cross-posted to LinkedIn