TL;DR: The AWS Prescriptive Guidance paper on MCP gives the most useful, checkable rules in its tool-design section, so I wrote code to test them. A token-tax counter on a realistic 20-tool GitHub ser…
AgenticAI
24 posts tagged AgenticAI · all tags
2026
🎬 Also available as a blog walkthrough video with narrated diagrams. TL;DR: Running many interactive AI agents at once feels like Lieutenant Columbo lives in your terminal: every few minutes one turns…
🎬 Also available as a blog walkthrough video with narrated diagrams. TL;DR: AWS published a Prescriptive Guidance paper, “Model Context Protocol strategies on AWS,” that organizes MCP into three pilla…
TL;DR: In March I argued the CLI vs MCP debate was the wrong debate, and that the CLI’s advantages were a temporary artifact of training data, not a law of physics. One of those advantages was multi-a…
TL;DR: A May 2026 paper separates two capabilities that self-improving agents usually conflate: writing harness updates and benefiting from them. Writing is flat across model tiers: a 9B open model pr…
TL;DR: An agent skill starts life as a markdown file full of instructions. It works, sometimes. Then you watch it fail in ways that are hard to predict, and you notice a pattern: the steps that break …
TL;DR: My AWS colleague Dennis Traub made the case that Domain-Driven Design’s Ubiquitous Language is now critical infrastructure for AI agents. I want to take that one layer up, into architecture. Af…
TL;DR: I prepared my AWS Summit Hamburg talk through 6 recorded dry runs, each analyzed by a structured AI skill. The skill transcribes via Amazon Transcribe, counts fillers with word-level timestamps…
TL;DR: Michael I. Jordan, the man Science magazine named “the world’s most influential computer scientist,” has never considered himself an AI researcher. His thesis: intelligence is not a property of…
TL;DR: Most teams ship LLM judges without testing them against human labels. The result: judges that are confidently, consistently wrong on 30%+ of cases. Hamel Husain’s “critique shadowing” methodolo…
TL;DR: AWS Summit Hamburg 2026 was two days of community, conversations, and agentic AI everywhere. My talk on production challenges landed well — the question that stuck was about getting business pe…
Seven Teams, One Vision TL;DR: Seven teams built an AI-driven audio product in three days. AI agents didn’t replace human collaboration — they amplified individual speed, shifting the bottleneck to co…
I was listening to a recent episode of The AI Daily Brief — “The AI Subsidy Era Is Over” — and my thoughts started spinning. Not because the argument was new, but because it connected dots I’d been se…
The Scenario Nobody Planned For It’s 11 PM. Your customer support agent, the AI one, is processing a refund request. It queries the order database, pulls the customer’s payment history, and calls the …
Last year, I wrote about the dedicated Gen AI track at AWS Summit Hamburg 2025. The response was overwhelming — the track was packed, conversations spilled into the hallways, and the Fischbrötchen at …
From Request-Response to Collaboration When I wrote about the CLI vs MCP debate [1], I focused on the infrastructure patterns underneath. But MCP itself has been evolving, and the latest additions cha…
The Headline vs The Reality “Model trains itself over 100+ autonomous cycles.” That was the headline when MiniMax released M2.7 on March 18, 2026 [1]. It sounds like science fiction: a model bootstrap…
Fischbrötchen and Failure Rates Last year, the AWS Summit left Berlin for Hamburg. After years of presenting at the Berlin Summit, I wasn’t sure how I’d feel about the move. Then I opened the Generati…
The Fifteen-Year Echo Fifteen years apart. Same stage. Different world. In 2010, Adrian Cockcroft stood on the QCon stage and told the audience that Netflix was running its entire business on a public…
Namaste from 6,165 Meters I just summited Imja Tse (Island Peak, 6,165 meters) in Nepal. No Slack, no email, no MCP servers crashing in the background. Just ice, thin air, and the kind of clarity that…
$20 and Two Hours On February 28, 2026, security startup CodeWall gave an autonomous AI agent a single input: a domain name. Two hours and approximately $20 in API tokens later, the agent had full rea…
The Number Nobody Wants to Hear A few weeks ago, I wrote about running my entire workday through an AI agent [1], meetings, research, CRM, content creation. Eight hours of productive work, not a singl…
The Zombie Processes and the 50GB Cache A few weeks ago, I noticed my MacBook was sluggish. I found orphaned MCP server processes that had failed to shut down cleanly, a problem Didier Durand describe…
The Friday That Wrote Itself Last Friday, I used a coding agent for eight hours straight. I didn’t write a single line of code. I prepared a customer meeting by pulling context from Slack threads, cal…