I just signed up for the Software Architecture Superstream: Architecture Patterns and Antipatterns f
I just signed up for the “Software Architecture Superstream: Architecture Patterns and Antipatterns for AI”[1] which is taking place at 12th August CEST late afternoon. The lineup of speakers and topics to be covered sound very interesting. Maybe something for you too?
Glad to listen again to my dear colleague Luca Mezzalira ๐ as one of the speakers!
#softwarearchitecture #patterns #genai
Diving into designing multi-agent systems and got lost with all the different implementation options
Diving into designing multi-agent systems and got lost with all the different implementation options? MCP (x)or A2A?! - Heikoโs and Dr. Sokratis Kartakis (any way to mention you in here just by your first name, mate?) nice article got your back. Highly recommended read! Congrats to both for being published there!
“UPTIME? We donโt care - weโre a subscription business.”
This sounds wrong in so many aspects. Still it has some truth to it. Let me start with a disclaimer: Itโs neither originated from a current or former customer, nor does it reflect how current or prior employers of mine are operating or thinking. Itโs just me exaggerating a brainstorming with some folks.
But letโs start with what feels wrong with it? Where to start and where to end? โShows disregard for customer experience and service reliabilityโ โUndermines customer trust and loyaltyโ โCould lead to increased customer churnโ โMay violate terms of serviceโ [..]
LLMs for the rescue?! Or are we actually building Compound AI Systems?
LLMs for the rescue?! Or are we actually building Compound AI Systems?
LLMs rule the world, right?! - Only thing what matters is using the most powerful LLM available and everything falls in place. Looking for numbers - just consult the latest LLM benchmark. Hmm - or do we need to build systems?!
I think it’s not just a matter of choosing an LLM, or any foundation model for that matter, and if you are following me, you already know that. E.g. in my medium post on “How do you choose the foundation model for your Generative AI App โ like your car?"[2], I already argued how 1/ LLMs are just one part of your Generative AI application, but the overall application requires so much more components and engineering excellence and 2/ capabilities of frontier models become commodity with a ever increasing pace.
๐ Building your own RAG system is like deciding to build your own email server in 2024. Sure, you c
๐ “Building your own RAG system is like deciding to build your own email server in 2024. Sure, you could do it. But why would you want to?” - Alden Do Rosario in his article “Dear IT Departments, Please Stop Trying To Build Your Own RAG” (https://lnkd.in/ep9ZNJzq) on medium. Love it. Highly recommended read.
๐ก Don’t reinvent the wheel! The trap of building something, which on the first glance looks so simple, but then we you get into it you discover layers of hidden complexity.
What happens in Las Vegas ... Nah - let's have a look. All things (Gen) AI.
What happens in Las Vegas … Nah - let’s have a look. All things (Gen) AI.
At the time of starting this article, the first day of AWS re:Invent is over. AWS re:invent is the cloud computing conference, hosted annually by Amazon Web Services (AWS) in Las Vegas, Nevada. The 13th annual event takes place December 2-6, 2024. This year I’m not on-site, but still curious what it is happening there. Plan for the week is to update this article on a daily basis with the new things announced there. Let’s see how this goes.
RAG - just a poor engineering workaround?
RAG - just a poor engineering workaround?
My week kicked off nicely with some inspiring talks on an internal conference. In one of the talks Johannes Langer dived deep on how to build production-ready RAG systems. I answered his opening questions to the audience - โWhat is RAG?โ - with โ๐๐ผ๐ณ๐ณ๐ฒ๐ฟ๐ธ๐น๐ฎ๐๐๐๐ฟโ, which translates to ๐ผ๐ฝ๐ฒ๐ป ๐ฏ๐ผ๐ผ๐ธ ๐๐ฒ๐๐ in my head.
Thinking more about this analogy, I find it is helpful to approach the question if RAG is just a workaround to overcome limitations of our current foundation models or is here to stay, one a more conceptual level. The German wikipedia article on โ๐๐ผ๐ณ๐ณ๐ฒ๐ฟ๐ธ๐น๐ฎ๐๐๐๐ฟโ talks about some of the motivations for this kind of test: huge efforts for students on memorising independent facts are eliminated, the test scope can be wider and the test is focussing more on the ability to creatively think and find new solutions approaches. In other words this approach is frugal with students resources and incentives creation of new solutions.
โ Successful Building a GenAI use cases just requires the latest and greatest fr
โ Successful Building a GenAI use cases just requires the latest and greatest frontier model, right?!
๐ In my conversations with customers I often realize that the choice of the best & shiniest model is highly occupying their resources, while thinking and focusing on architecting and building the use cases which ultimately should fulfil users needs gets very little attention. This naturally fuelled by the fast-pace and loud announcements of new frontier models, which come withย superior benchmark results and new capabilities. What we often forget is that the majority of capabilities become commodity among different models very fast. Hence it often makes much more sense to focus on the use cases and building the generative application for the use case in a way it really addresses the userโs need and is able to evolve to utilize new release models where this makes sense. I wrote about this inย my blog post (https://lnkd.in/e_YhCinM.)
๐ Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up.
๐ Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up. It’s amazing to see how you continue to innovate on behalf of your customers. I love the outcome.
๐ What stands out for me from your technical report is your description of the why and how you moved away from your own, already very good, initial RAG implementation. Citing from the report:
“๐๐ฏ๐ฆ ๐ฐ๐ง ๐ฐ๐ถ๐ณ ๐ญ๐ฆ๐ข๐ณ๐ฏ๐ช๐ฏ๐จ๐ด ๐ง๐ณ๐ฐ๐ฎ ๐ต๐ฉ๐ฆ ๐ญ๐ข๐ด๐ต ๐ฃ๐ช๐จ ๐๐๐ ๐ฑ๐ณ๐ฐ๐ซ๐ฆ๐ค๐ต ๐ธ๐ข๐ด ๐ต๐ฉ๐ข๐ต ๐ธ๐ฆ ๐ฏ๐ฆ๐ฆ๐ฅ๐ฆ๐ฅ ๐ต๐ฐ ๐ด๐ฑ๐ฆ๐ฆ๐ฅ ๐ถ๐ฑ ๐ฐ๐ถ๐ณ ๐ฆ๐น๐ฑ๐ฆ๐ณ๐ช๐ฎ๐ฆ๐ฏ๐ต๐ข๐ต๐ช๐ฐ๐ฏ ๐ค๐บ๐ค๐ญ๐ฆ๐ด. ๐๐ฏ ๐ต๐ฉ๐ฆ ๐ต๐ฆ๐ค๐ฉ๐ฏ๐ช๐ค๐ข๐ญ ๐ด๐ช๐ฅ๐ฆ, ๐ต๐ฉ๐ข๐ต ๐ฎ๐ฆ๐ข๐ฏ๐ต ๐ถ๐ด๐ช๐ฏ๐จ ๐ฎ๐ฐ๐ณ๐ฆ ๐ฐ๐ง๐ง-๐ต๐ฉ๐ฆ-๐ด๐ฉ๐ฆ๐ญ๐ง ๐ด๐ฐ๐ญ๐ถ๐ต๐ช๐ฐ๐ฏ๐ด ๐ง๐ฐ๐ณ ๐๐๐ ๐ข๐ณ๐ค๐ฉ๐ช๐ต๐ฆ๐ค๐ต๐ถ๐ณ๐ฆ๐ด ๐ต๐ฉ๐ข๐ต ๐ฉ๐ข๐ท๐ฆ ๐ฃ๐ฆ๐ค๐ฐ๐ฎ๐ฆ ๐ข๐ท๐ข๐ช๐ญ๐ข๐ฃ๐ญ๐ฆ ๐ข๐ฏ๐ฅ ๐ณ๐ฆ-๐ถ๐ด๐ช๐ฏ๐จ ๐ฐ๐ถ๐ณ ๐ฐ๐ธ๐ฏ ๐ฆ๐น๐ช๐ด๐ต๐ช๐ฏ๐จ ๐ด๐ฐ๐ญ๐ถ๐ต๐ช๐ฐ๐ฏ๐ด.” and “๐๐ฐ ๐ฎ๐ข๐ฌ๐ฆ ๐ต๐ฉ๐ฆ ๐ฅ๐ฐ๐ค๐ถ๐ฎ๐ฆ๐ฏ๐ต๐ด ๐ด๐ฆ๐ข๐ณ๐ค๐ฉ๐ข๐ฃ๐ญ๐ฆ ๐ง๐ฐ๐ณ ๐ฐ๐ถ๐ณ ๐๐๐ ๐ด๐บ๐ด๐ต๐ฆ๐ฎ, ๐ธ๐ฆ ๐ถ๐ด๐ฆ๐ฅ ๐๐๐ ๐ฌ๐ฏ๐ฐ๐ธ๐ญ๐ฆ๐ฅ๐จ๐ฆ ๐ฃ๐ข๐ด๐ฆ๐ด. ๐๐ฉ๐ช๐ด ๐จ๐ข๐ท๐ฆ ๐ถ๐ด ๐ด๐ญ๐ช๐จ๐ฉ๐ต๐ญ๐บ ๐ง๐ฆ๐ธ๐ฆ๐ณ ๐ค๐ฐ๐ฏ๐ง๐ช๐จ๐ถ๐ณ๐ข๐ต๐ช๐ฐ๐ฏ ๐ฐ๐ฑ๐ต๐ช๐ฐ๐ฏ๐ด ๐ต๐ฉ๐ข๐ฏ ๐ข ๐ค๐ถ๐ด๐ต๐ฐ๐ฎ ๐ฃ๐ถ๐ช๐ญ๐ฅ ๐ฆ๐ข๐ณ๐ญ๐ช๐ฆ๐ณ ๐ด๐ฐ๐ญ๐ถ๐ต๐ช๐ฐ๐ฏ ๐ฃ๐ถ๐ต ๐ด๐ฑ๐ฆ๐ฅ ๐ถ๐ฑ ๐ช๐ฏ๐จ๐ฆ๐ด๐ต๐ช๐ฐ๐ฏ ๐ฃ๐บ ๐ด๐ฆ๐ท๐ฆ๐ณ๐ข๐ญ ๐ฎ๐ข๐จ๐ฏ๐ช๐ต๐ถ๐ฅ๐ฆ๐ด ๐ธ๐ฉ๐ช๐ค๐ฉ ๐ข๐ญ๐ญ๐ฐ๐ธ๐ฆ๐ฅ ๐ถ๐ด ๐ต๐ฐ ๐ช๐ต๐ฆ๐ณ๐ข๐ต๐ฆ ๐ฑ๐ณ๐ฆ-๐ฑ๐ณ๐ฐ๐ค๐ฆ๐ด๐ด๐ช๐ฏ๐จ ๐ฎ๐ถ๐ค๐ฉ ๐ง๐ข๐ด๐ต๐ฆ๐ณ.”
How do you choose the foundation model for your Generative AI App โ like your car?
How do you choose the foundation model for your Generative AI App โ like your car?
Just published a new blog post on medium:
๐ How do you choose the right foundation model for your Generative AI app? ๐ก It’s like picking the perfect car!ย ๐ Just like car buyers care more about features, price, and user experience, Generative AI app users prioritize functionality, cost, and ease of use over the specific model under the hood.ย ๐ทโโ๏ธ Building a successful Generative AI app requires a well-architected, cost-effective solution that meets customer needs, not just chasing the latest, most powerful model.ย ๐ง Models are crucial, but they’re just one component. A robust architecture with components like data management, model orchestration, and user interfaces is essential.ย ๐ฐ Cost-effectiveness matters. Choose the “Goldilocks” model that’s good enough for your use case, not overpowered (and overpriced).ย โฑ๏ธ Models evolve rapidly. Your app architecture and processes must allow seamless model updates or replacements to stay competitive.ย ๐งฉ Embrace model composability. Use different models for different tasks within your app for optimal cost and performance.ย ๐ฃ๏ธ Just like cars, Generative AI apps aren’t one-size-fits-all. Tailor your solution to your specific use case and customer needs.ย ๐ Don’t get caught up in the hype. Work backward from customer problems to build fantastic, innovative solutions with Generative AI.ย ๐ Enjoy the ride and have fun building your Generative AI app! It’s an exciting journey ahead.