July 23, 2025

I just signed up for the Software Architecture Superstream: Architecture Patterns and Antipatterns f

I just signed up for the “Software Architecture Superstream: Architecture Patterns and Antipatterns for AI”[1] which is taking place at 12th August CEST late afternoon. The lineup of speakers and topics to be covered sound very interesting. Maybe something for you too?

Glad to listen again to my dear colleague Luca Mezzalira 🙂 as one of the speakers!

[1] https://lnkd.in/ed-TfaMn

#softwarearchitecture #patterns #genai

June 19, 2025

Diving into designing multi-agent systems and got lost with all the different implementation options

Diving into designing multi-agent systems and got lost with all the different implementation options? MCP (x)or A2A?! - Heiko’s and Dr. Sokratis Kartakis (any way to mention you in here just by your first name, mate?) nice article got your back. Highly recommended read! Congrats to both for being published there!

Blog

“UPTIME? We don’t care - we’re a subscription business.”

This sounds wrong in so many aspects. Still it has some truth to it. Let me start with a disclaimer: It’s neither originated from a current or former customer, nor does it reflect how current or prior employers of mine are operating or thinking. It’s just me exaggerating a brainstorming with some folks.

But let’s start with what feels wrong with it? Where to start and where to end? “Shows disregard for customer experience and service reliability” “Undermines customer trust and loyalty” “Could lead to increased customer churn” “May violate terms of service” [..]

Blog

LLMs for the rescue?! Or are we actually building Compound AI Systems?

LLMs rule the world, right?! - Only thing what matters is using the most powerful LLM available and everything falls in place. Looking for numbers - just consult the latest LLM benchmark. Hmm - or do we need to build systems?!

I think it’s not just a matter of choosing an LLM, or any foundation model for that matter, and if you are following me, you already know that. E.g. in my medium post on “How do you choose the foundation model for your Generative AI App — like your car?"[2], I already argued how 1/ LLMs are just one part of your Generative AI application, but the overall application requires so much more components and engineering excellence and 2/ capabilities of frontier models become commodity with a ever increasing pace.

December 12, 2024

📖 Building your own RAG system is like deciding to build your own email server in 2024. Sure, you c

📖 “Building your own RAG system is like deciding to build your own email server in 2024. Sure, you could do it. But why would you want to?” - Alden Do Rosario in his article “Dear IT Departments, Please Stop Trying To Build Your Own RAG” (https://lnkd.in/ep9ZNJzq) on medium. Love it. Highly recommended read.

🎡 Don’t reinvent the wheel! The trap of building something, which on the first glance looks so simple, but then we you get into it you discover layers of hidden complexity.

Blog

What happens in Las Vegas ... Nah - let's have a look. All things (Gen) AI.

What happens in Las Vegas … Nah - let’s have a look. All things (Gen) AI.

At the time of starting this article, the first day of AWS re:Invent is over. AWS re:invent is the cloud computing conference, hosted annually by Amazon Web Services (AWS) in Las Vegas, Nevada. The 13th annual event takes place December 2-6, 2024. This year I’m not on-site, but still curious what it is happening there. Plan for the week is to update this article on a daily basis with the new things announced there. Let’s see how this goes.

Blog

RAG - just a poor engineering workaround?

My week kicked off nicely with some inspiring talks on an internal conference. In one of the talks Johannes Langer dived deep on how to build production-ready RAG systems. I answered his opening questions to the audience - “What is RAG?” - with “𝗞𝗼𝗳𝗳𝗲𝗿𝗸𝗹𝗮𝘂𝘀𝘂𝗿”, which translates to 𝗼𝗽𝗲𝗻 𝗯𝗼𝗼𝗸 𝘁𝗲𝘀𝘁 in my head.

Thinking more about this analogy, I find it is helpful to approach the question if RAG is just a workaround to overcome limitations of our current foundation models or is here to stay, one a more conceptual level. The German wikipedia article on “𝗞𝗼𝗳𝗳𝗲𝗿𝗸𝗹𝗮𝘂𝘀𝘂𝗿” talks about some of the motivations for this kind of test: huge efforts for students on memorising independent facts are eliminated, the test scope can be wider and the test is focussing more on the ability to creatively think and find new solutions approaches. In other words this approach is frugal with students resources and incentives creation of new solutions.

Blog

❓ Successful Building a GenAI use cases just requires the latest and greatest fr

❓ Successful Building a GenAI use cases just requires the latest and greatest frontier model, right?!

🔍 In my conversations with customers I often realize that the choice of the best & shiniest model is highly occupying their resources, while thinking and focusing on architecting and building the use cases which ultimately should fulfil users needs gets very little attention. This naturally fuelled by the fast-pace and loud announcements of new frontier models, which come with superior benchmark results and new capabilities. What we often forget is that the majority of capabilities become commodity among different models very fast. Hence it often makes much more sense to focus on the use cases and building the generative application for the use case in a way it really addresses the user’s need and is able to evolve to utilize new release models where this makes sense. I wrote about this in my blog post (https://lnkd.in/e_YhCinM.)

July 31, 2024

👏 Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up.

👏 Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up. It’s amazing to see how you continue to innovate on behalf of your customers. I love the outcome.

📖 What stands out for me from your technical report is your description of the why and how you moved away from your own, already very good, initial RAG implementation. Citing from the report:

“𝘖𝘯𝘦 𝘰𝘧 𝘰𝘶𝘳 𝘭𝘦𝘢𝘳𝘯𝘪𝘯𝘨𝘴 𝘧𝘳𝘰𝘮 𝘵𝘩𝘦 𝘭𝘢𝘴𝘵 𝘣𝘪𝘨 𝘙𝘈𝘎 𝘱𝘳𝘰𝘫𝘦𝘤𝘵 𝘸𝘢𝘴 𝘵𝘩𝘢𝘵 𝘸𝘦 𝘯𝘦𝘦𝘥𝘦𝘥 𝘵𝘰 𝘴𝘱𝘦𝘦𝘥 𝘶𝘱 𝘰𝘶𝘳 𝘦𝘹𝘱𝘦𝘳𝘪𝘮𝘦𝘯𝘵𝘢𝘵𝘪𝘰𝘯 𝘤𝘺𝘤𝘭𝘦𝘴. 𝘖𝘯 𝘵𝘩𝘦 𝘵𝘦𝘤𝘩𝘯𝘪𝘤𝘢𝘭 𝘴𝘪𝘥𝘦, 𝘵𝘩𝘢𝘵 𝘮𝘦𝘢𝘯𝘵 𝘶𝘴𝘪𝘯𝘨 𝘮𝘰𝘳𝘦 𝘰𝘧𝘧-𝘵𝘩𝘦-𝘴𝘩𝘦𝘭𝘧 𝘴𝘰𝘭𝘶𝘵𝘪𝘰𝘯𝘴 𝘧𝘰𝘳 𝘙𝘈𝘎 𝘢𝘳𝘤𝘩𝘪𝘵𝘦𝘤𝘵𝘶𝘳𝘦𝘴 𝘵𝘩𝘢𝘵 𝘩𝘢𝘷𝘦 𝘣𝘦𝘤𝘰𝘮𝘦 𝘢𝘷𝘢𝘪𝘭𝘢𝘣𝘭𝘦 𝘢𝘯𝘥 𝘳𝘦-𝘶𝘴𝘪𝘯𝘨 𝘰𝘶𝘳 𝘰𝘸𝘯 𝘦𝘹𝘪𝘴𝘵𝘪𝘯𝘨 𝘴𝘰𝘭𝘶𝘵𝘪𝘰𝘯𝘴.” and “𝘛𝘰 𝘮𝘢𝘬𝘦 𝘵𝘩𝘦 𝘥𝘰𝘤𝘶𝘮𝘦𝘯𝘵𝘴 𝘴𝘦𝘢𝘳𝘤𝘩𝘢𝘣𝘭𝘦 𝘧𝘰𝘳 𝘰𝘶𝘳 𝘙𝘈𝘎 𝘴𝘺𝘴𝘵𝘦𝘮, 𝘸𝘦 𝘶𝘴𝘦𝘥 𝘈𝘞𝘚 𝘬𝘯𝘰𝘸𝘭𝘦𝘥𝘨𝘦 𝘣𝘢𝘴𝘦𝘴. 𝘛𝘩𝘪𝘴 𝘨𝘢𝘷𝘦 𝘶𝘴 𝘴𝘭𝘪𝘨𝘩𝘵𝘭𝘺 𝘧𝘦𝘸𝘦𝘳 𝘤𝘰𝘯𝘧𝘪𝘨𝘶𝘳𝘢𝘵𝘪𝘰𝘯 𝘰𝘱𝘵𝘪𝘰𝘯𝘴 𝘵𝘩𝘢𝘯 𝘢 𝘤𝘶𝘴𝘵𝘰𝘮 𝘣𝘶𝘪𝘭𝘥 𝘦𝘢𝘳𝘭𝘪𝘦𝘳 𝘴𝘰𝘭𝘶𝘵𝘪𝘰𝘯 𝘣𝘶𝘵 𝘴𝘱𝘦𝘥 𝘶𝘱 𝘪𝘯𝘨𝘦𝘴𝘵𝘪𝘰𝘯 𝘣𝘺 𝘴𝘦𝘷𝘦𝘳𝘢𝘭 𝘮𝘢𝘨𝘯𝘪𝘵𝘶𝘥𝘦𝘴 𝘸𝘩𝘪𝘤𝘩 𝘢𝘭𝘭𝘰𝘸𝘦𝘥 𝘶𝘴 𝘵𝘰 𝘪𝘵𝘦𝘳𝘢𝘵𝘦 𝘱𝘳𝘦-𝘱𝘳𝘰𝘤𝘦𝘴𝘴𝘪𝘯𝘨 𝘮𝘶𝘤𝘩 𝘧𝘢𝘴𝘵𝘦𝘳.”

June 7, 2024

How do you choose the foundation model for your Generative AI App — like your car?

Just published a new blog post on medium:

😎 How do you choose the right foundation model for your Generative AI app? 💡 It’s like picking the perfect car! 🚗 Just like car buyers care more about features, price, and user experience, Generative AI app users prioritize functionality, cost, and ease of use over the specific model under the hood. 👷‍♂️ Building a successful Generative AI app requires a well-architected, cost-effective solution that meets customer needs, not just chasing the latest, most powerful model. 🧠 Models are crucial, but they’re just one component. A robust architecture with components like data management, model orchestration, and user interfaces is essential. 💰 Cost-effectiveness matters. Choose the “Goldilocks” model that’s good enough for your use case, not overpowered (and overpriced). ⏱️ Models evolve rapidly. Your app architecture and processes must allow seamless model updates or replacements to stay competitive. 🧩 Embrace model composability. Use different models for different tasks within your app for optimal cost and performance. 🛣️ Just like cars, Generative AI apps aren’t one-size-fits-all. Tailor your solution to your specific use case and customer needs. 🚀 Don’t get caught up in the hype. Work backward from customer problems to build fantastic, innovative solutions with Generative AI. 🎉 Enjoy the ride and have fun building your Generative AI app! It’s an exciting journey ahead.