Architecture

44 posts tagged Architecture  ยท  all tags

2026

CLI vs MCP, Part Two: The First Gap Just Closed

TL;DR: In March I argued the CLI vs MCP debate was the wrong debate, and that the CLIโ€™s advantages were a temporary artifact of training data, not a law of physics. One of those advantages was multi-aโ€ฆ

Context Engineering: The Skill That Replaced Prompt Engineering

TL;DR: Patrick Debois coined DevOps in 2009 by naming what practitioners were already doing. In 2026, heโ€™s doing it again with โ€œContext Engineeringโ€ and the CDLC (Context Development Lifecycle): Generโ€ฆ

Cognitive Debt: The Hidden Cost of AI-Generated Code

The Code Nobody Understands Hereโ€™s a pattern Iโ€™ve seen across multiple teams: a data pipeline ships, built almost entirely by an AI coding agent. Clean architecture. Full test coverage. Passes every rโ€ฆ

8 AWS Guides for Agentic AI โ€” Mapped to the 4 Pillars That Get You to Production

The Gap Between Demo and Deployment TL;DR: AWS released 8 prescriptive guides for building production-ready agentic AI. This post maps each guide to the four pillars that get agents from demo to deploโ€ฆ

Intelligence Is About Time, Not Parameters

The Question Every SA Gets Beyond a complexity threshold, larger models become less insightful โ€” the savant regime. โ€œWhich model should I use?โ€ I hear it in almost every customer conversation about geโ€ฆ

What Reasoning Actually Means (and Why It Matters for Your Architecture)

It Started with a Saturday Morning Experiment I recently ran a simple test. I asked a small language model the same questions three times, with zero, one, and three rounds of self-reflection, and publโ€ฆ

Is the AI Subsidy Era Ending? And Why That Might Be a Good Thing

I was listening to a recent episode of The AI Daily Brief โ€” โ€œThe AI Subsidy Era Is Overโ€ โ€” and my thoughts started spinning. Not because the argument was new, but because it connected dots Iโ€™d been seโ€ฆ

The Agent Security Stack Nobody Is Building

The Scenario Nobody Planned For Itโ€™s 11 PM. Your customer support agent, the AI one, is processing a refund request. It queries the order database, pulls the customerโ€™s payment history, and calls the โ€ฆ

Software Fundamentals Matter More Than Ever

The Talk That Confirmed What Iโ€™ve Been Seeing Matt Pocock stood on stage at the AI Engineer Summit and said something that most of the audience needed to hear: the developers who succeed with AI codinโ€ฆ

MCP Sampling & Elicitation: When Servers Talk Back

From Request-Response to Collaboration When I wrote about the CLI vs MCP debate [1], I focused on the infrastructure patterns underneath. But MCP itself has been evolving, and the latest additions chaโ€ฆ

Nvidia's Real Moat: What Jensen Huang Told Dwarkesh Patel

Electrons In, Tokens Out Long weekend drive, sunny weather, and nearly two hours of Jensen Huang arguing with Dwarkesh Patel about whether Nvidiaโ€™s moat will hold. As far as podcast entertainment goesโ€ฆ

Self-Improving Models: What MiniMax M2.7 Actually Does

The Headline vs The Reality โ€œModel trains itself over 100+ autonomous cycles.โ€ That was the headline when MiniMax released M2.7 on March 18, 2026 [1]. It sounds like science fiction: a model bootstrapโ€ฆ

The Citation Crisis: What AI Hallucinations Mean for Your Enterprise

The Reference I Almost Didnโ€™t Check A few days ago, I was reviewing an article my AI agent had drafted. The sources section looked clean: numbered references, proper formatting, plausible titles. One โ€ฆ

From Cloud-Native to AI-Native: What Actually Changes

The Fifteen-Year Echo Fifteen years apart. Same stage. Different world. In 2010, Adrian Cockcroft stood on the QCon stage and told the audience that Netflix was running its entire business on a publicโ€ฆ

The Protocol We Should Have Built for Humans

Namaste from 6,165 Meters I just summited Imja Tse (Island Peak, 6,165 meters) in Nepal. No Slack, no email, no MCP servers crashing in the background. Just ice, thin air, and the kind of clarity thatโ€ฆ

Is RAG Still Needed with 1M+ Token Context Windows?

The Kofferklausur, Revisited In September 2024, a colleague asked an audience: โ€œWhat is RAG?โ€ I answered: Kofferklausur [1]. For non-German speakers: a Kofferklausur is an open-book exam. You bring yoโ€ฆ

LLMs Don't Do Math โ€” They Predict What Math Looks Like

The Invisible Error To test this, I designed five calculations that anyone in business might ask an AI assistant, the kind of questions youโ€™d type into ChatGPT or Claude expecting a quick, reliable anโ€ฆ

Your AI Models Have an Expiry Date โ€” A Practical Guide to Model Lifecycle Management

Introduction โ€” The Promise I Made In my previous article [1], I explored the maintenance trap in IT โ€” how software systems are more like plants than stones, requiring constant care. I ended with a cliโ€ฆ

Most comprehensive overview on RAG I have seen. We came a long way from vanilla RAG. Still remember

Most comprehensive overview on RAG I have seen. We came a long way from vanilla RAG. Still remember the time of arguments that RAG is just a โ€œhot fixโ€ to be obsolete soon. Reality is it is not a fix bโ€ฆ

IT System Maintenance in the age of AI

IT System Maintenance in the age of AI Introduction - The Maintenance Trap in IT You donโ€™t need to be in the IT industry for long to have witnessed this firsthand. Even non-IT users do. Those systems โ€ฆ

๐Ÿ”ง ๐—ง๐—ต๐—ฒ ๐— ๐—ฎ๐—ถ๐—ป๐˜๐—ฒ๐—ป๐—ฎ๐—ป๐—ฐ๐—ฒ ๐—ง๐—ฟ๐—ฎ๐—ฝ: ๐—ช๐—ต๐˜† ๐—ฌ๐—ผ๐˜‚๐—ฟ ๐—œ๐—ง ๐—ฆ๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€ ๐—”๐—ฟ๐—ฒ ๐— ๐—ผ๐—ฟ๐—ฒ ๐—Ÿ๐—ถ๐—ธ๐—ฒ ๐—ฃ๐—น๐—ฎ๐—ป๐˜๐˜€ ๐—ง๐—ต๐—ฎ๐—ป ๐—ฆ๐˜๐—ผ๐—ป๐—ฒ๐˜€

๐Ÿ”ง ๐—ง๐—ต๐—ฒ ๐— ๐—ฎ๐—ถ๐—ป๐˜๐—ฒ๐—ป๐—ฎ๐—ป๐—ฐ๐—ฒ ๐—ง๐—ฟ๐—ฎ๐—ฝ: ๐—ช๐—ต๐˜† ๐—ฌ๐—ผ๐˜‚๐—ฟ ๐—œ๐—ง ๐—ฆ๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€ ๐—”๐—ฟ๐—ฒ ๐— ๐—ผ๐—ฟ๐—ฒ ๐—Ÿ๐—ถ๐—ธ๐—ฒ ๐—ฃ๐—น๐—ฎ๐—ป๐˜๐˜€ ๐—ง๐—ต๐—ฎ๐—ป ๐—ฆ๐˜๐—ผ๐—ป๐—ฒ๐˜€ After years of watching organizations struggle with outdated systems, Iโ€™ve written about a pattern we all know too wellโ€”theโ€ฆ

๐ŸŽฏ 'How do we pick the RIGHT AI agent use case?

๐ŸŽฏ โ€œHow do we pick the RIGHT AI agent use case? This is the question I hear most from customers exploring agentic AI. Hereโ€™s the mechanism I run through together with the customer: The 4-Quadrant Evaluโ€ฆ

2025

๐—ฃ๐—ฒ๐—ฟ๐—ณ๐—ฒ๐—ฐ๐˜ ๐˜๐—ถ๐—บ๐—ถ๐—ป๐—ด ๐—ณ๐—ผ๐—ฟ ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ ๐—ฐ๐—ฎ๐—ฟ๐—ฒ๐—ฒ๐—ฟ ๐—ฝ๐—น๐—ฎ๐—ป๐—ป๐—ถ๐—ป๐—ด! ๐ŸŽฏ

๐—ฃ๐—ฒ๐—ฟ๐—ณ๐—ฒ๐—ฐ๐˜ ๐˜๐—ถ๐—บ๐—ถ๐—ป๐—ด ๐—ณ๐—ผ๐—ฟ ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ ๐—ฐ๐—ฎ๐—ฟ๐—ฒ๐—ฒ๐—ฟ ๐—ฝ๐—น๐—ฎ๐—ป๐—ป๐—ถ๐—ป๐—ด! ๐ŸŽฏ I just dived deep into the book โ€œSolutions Architect Interview: Winning strategies and effective tactics for interview successโ€ by Saurabh Shrivastava, Neelaโ€ฆ

๐Ÿค” ๐—ช๐—ต๐—ฎ๐˜ ๐—ถ๐˜€ ๐˜†๐—ผ๐˜‚๐—ฟ ๐—”๐—œ ๐—ฆ๐˜๐—ฟ๐—ฎ๐˜๐—ฒ๐—ด๐˜†? Chasing single point solutions or exploring system-level AI solutions? Yesterday I had the pleasure of listening to a great presentation by Chris Nosko. Among other importaโ€ฆ

๐ŸŽฏ ๐——๐—ฒ๐—ฒ๐—ฝ ๐—ฑ๐—ถ๐˜ƒ๐—ฒ ๐—ถ๐—ป๐˜๐—ผ ๐—–๐—ฒ๐—น๐—น-๐—•๐—ฎ๐˜€๐—ฒ๐—ฑ ๐—”๐—ฟ๐—ฐ๐—ต๐—ถ๐˜๐—ฒ๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ๐˜€

๐ŸŽฏ ๐——๐—ฒ๐—ฒ๐—ฝ ๐—ฑ๐—ถ๐˜ƒ๐—ฒ ๐—ถ๐—ป๐˜๐—ผ ๐—–๐—ฒ๐—น๐—น-๐—•๐—ฎ๐˜€๐—ฒ๐—ฑ ๐—”๐—ฟ๐—ฐ๐—ต๐—ถ๐˜๐—ฒ๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ๐˜€ Last week I attended an outstanding presentation by my colleague Robert Himmelmann on โ€œCell-based Architecturesโ€ โ€“ one of the most insightful deep-dives Iโ€™ve โ€ฆ

๐—š๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—”๐—œ โ€“ ๐—ง๐—ต๐—ฒ ๐—ฃ๐—ฒ๐—ป๐—ฑ๐˜‚๐—น๐˜‚๐—บ ๐—ž๐—ฒ๐—ฒ๐—ฝ๐˜€ ๐—ฆ๐˜„๐—ถ๐—ป๐—ด๐—ถ๐—ป๐—ด?!

๐—š๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—”๐—œ โ€“ ๐—ง๐—ต๐—ฒ ๐—ฃ๐—ฒ๐—ป๐—ฑ๐˜‚๐—น๐˜‚๐—บ ๐—ž๐—ฒ๐—ฒ๐—ฝ๐˜€ ๐—ฆ๐˜„๐—ถ๐—ป๐—ด๐—ถ๐—ป๐—ด?! Enterprises have long followed a familiar rhythm. A major consulting firm arrives, declares centralization the new path to efficiency; a few years later, โ€ฆ

If you don't have the data available, implementing an AI use case becomes a data

If you donโ€™t have the data available, implementing an AI use case becomes a data gathering death march, often crossing organizational boundaries. Instead of spending 80% of the project time on buildinโ€ฆ

I just signed up for the Software Architecture Superstream: Architecture Patterns and Antipatterns f

I just signed up for the โ€œSoftware Architecture Superstream: Architecture Patterns and Antipatterns for AIโ€[1] which is taking place at 12th August CEST late afternoon. The lineup of speakers and topiโ€ฆ

Diving into designing multi-agent systems and got lost with all the different implementation options

Diving into designing multi-agent systems and got lost with all the different implementation options? MCP (x)or A2A?! - Heikoโ€™s and Dr. Sokratis Kartakis (any way to mention you in here just by your fโ€ฆ

LLMs for the rescue?! Or are we actually building Compound AI Systems?

LLMs for the rescue?! Or are we actually building Compound AI Systems? LLMs rule the world, right?! - Only thing what matters is using the most powerful LLM available and everything falls in place. Loโ€ฆ

2024

๐Ÿ“– Building your own RAG system is like deciding to build your own email server in 2024. Sure, you c

๐Ÿ“– โ€œBuilding your own RAG system is like deciding to build your own email server in 2024. Sure, you could do it. But why would you want to?โ€ - Alden Do Rosario in his article โ€œDear IT Departments, Pleaโ€ฆ

What happens in Las Vegas ... Nah - let's have a look. All things (Gen) AI.

RAG - just a poor engineering workaround?

โ“ Successful Building a GenAI use cases just requires the latest and greatest fr

โ“ Successful Building a GenAI use cases just requires the latest and greatest frontier model, right?! ๐Ÿ” In my conversations with customers I often realize that the choice of the best & shiniest model โ€ฆ

๐Ÿ‘ Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up.

๐Ÿ‘ Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up. Itโ€™s amazing to see how you continue to innovate on behalf of your customers. I love the outcome. ๐Ÿ“– Whโ€ฆ

How do you choose the foundation model for your Generative AI App โ€” like your car?

How do you choose the foundation model for your Generative AI App โ€” like your car? Just published a new blog post on medium: ๐Ÿ˜Ž How do you choose the right foundation model for your Generative AI app? โ€ฆ

Travelling into a long weekend and looking back to this weekโ€™s family reunion ev

Travelling into a long weekend and looking back to this weekโ€™s family reunion event aka #AWSSummit Berlin. To be honest, Iโ€™m a little exhausted but energised at the same time after meeting so many cusโ€ฆ

2023

Just in time for Santa, great book about about the role of an Solution Architect(SA) in the future.

Just in time for Santa, great book about about the role of an Solution Architect(SA) in the future. GenAI is evolving many roles - SA role is not an exception. Very interesting read, which a lot of inโ€ฆ

2022

Contextual targeting is gaining relevance in a post-3rd party cookie area as a means for publishers

Contextual targeting is gaining relevance in a post-3rd party cookie area as a means for publishers to monetise content and for advertisers to reach their audiences. The progress in ML and the availabโ€ฆ

2021

[German] Is your workload well-architected? Curious to understand the benefits of an architecture re

[German] Is your workload well-architected? Curious to understand the benefits of an architecture review based on best practices and to figure out how to successfully execute reviews? Tufan ร–zduman anโ€ฆ

Curious how the Twelve-Factor App methodology can be mapped to #InfrastructureAsCode on #AWS? My col

Curious how the Twelve-Factor App methodology can be mapped to #InfrastructureAsCode on #AWS? My colleague Wladi โ˜๏ธ Mitzel and I are. Give it a read - heads-up: potential extra challenge: article is iโ€ฆ

2020

Great work, Benedikt Stemmildt and team. Pleasure to work with you guys: Die Migration der gesamten

Great work, Benedikt Stemmildt and team. Pleasure to work with you guys: โ€œDie Migration der gesamten Onlineplattform in die AWS Cloud ermรถglichte dem Unternehmen, sein Geschรคft weiter zu skalieren undโ€ฆ

2019

Very good read. While I don't like the term lock-in cost, it's really a cost of change, this is a ve

Very good read. While I donโ€™t like the term lock-in cost, itโ€™s really a cost of change, this is a very valid point of view: Donโ€™t look at migration cost in isolation but consider opportunity gain at tโ€ฆ

โ† Back to all posts