<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>schristoph.online</title><link>https://schristoph.online/tags/inferenceoptimization/</link><description>Personal homepage and blog of Stefan Christoph</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><copyright>Stefan Christoph. All rights reserved.</copyright><lastBuildDate>Sat, 21 Mar 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://schristoph.online/tags/inferenceoptimization/index.xml" rel="self" type="application/rss+xml"/><item><title>When Thinking Twice Helps — And When It Doesn't</title><link>https://schristoph.online/blog/when-thinking-twice-helps/?utm=rss-feed</link><pubDate>Sat, 21 Mar 2026 00:00:00 +0000</pubDate><guid>https://schristoph.online/blog/when-thinking-twice-helps/</guid><description>&lt;h2 id="the-saturday-morning-experiment">The Saturday Morning Experiment&lt;/h2>
&lt;p>Last Saturday, I installed a Python library, pointed it at Amazon Bedrock, and asked a model the same questions three times, with zero, one, and three rounds of self-reflection.&lt;/p>
&lt;p>The results surprised me.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-fallback" data-lang="fallback">&lt;span class="line">&lt;span class="cl"> Q Refl Time Acc Comp Nuan Total
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 1 0 3.0s 4 3 3 10
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 1 1 5.5s 4 2 3 9
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 1 3 8.8s 4 3 4 11
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 2 0 2.6s 4 2 2 8
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 2 1 5.7s 4 2 2 8
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 2 3 8.5s 4 2 2 8
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 3 0 3.1s 1 1 1 3
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 3 1 5.2s 1 1 1 3
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> 3 3 8.6s 1 1 1 3
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Q is the question number, Refl the number of self-reflection rounds (0 = straight answer, 1 = one revision, 3 = three revisions). Acc, Comp, and Nuan are the judge&amp;rsquo;s scores for Accuracy, Completeness, and Nuance, each on a 1-5 scale, 15 max total.&lt;/p></description></item></channel></rss>