Intellectually Curious
Intellectually Curious is a podcast by Mike Breault featuring over 1,800 AI-powered explorations across science, mathematics, philosophy, and personal growth. Each short-form episode is generated, refined, and published with the help of large language models—turning curiosity into an ongoing audio encyclopedia. Designed for anyone who loves learning, it offers quick dives into everything from combinatorics and cryptography to systems thinking and psychology.
Inspiration for this podcast:
"Muad'Dib learned rapidly because his first training was in how to learn. And the first lesson of all was the basic trust that he could learn. It's shocking to find how many people do not believe they can learn, and how many more believe learning to be difficult. Muad'Dib knew that every experience carries its lesson."
― Frank Herbert, Dune
Note: These podcasts were made with NotebookLM. AI can make mistakes. Please double-check any critical information.
Intellectually Curious
The Art of Loop Engineering
Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.
We unpack Sydney Runkle’s loop engineering framework—a masterclass in turning a basic AI agent into a robust, autonomous system. From verification-driven loops (automated graders) and event-driven execution to a hill-climbing autonomous QA loop that rewrites its own prompts after each failure, this episode explains how to design feedback-rich environments where humans stay in the strategic driver’s seat while agents handle execution and self-improvement.
Note: This podcast was AI-generated, and sometimes AI can make mistakes. Please double-check any critical information.
Sponsored by Embersilk LLC
So last week, I uh I tried wiring up a smart plug and a basic script to automate my morning coffee.
SPEAKER_00Oh, I can already see where this is going.
SPEAKER_01Right. First day, flawless espresso. Second day, well, I had left the mug like maybe half an inch off center the night before. Oh no. Yeah. The script ran perfectly, just dutifully pouring 200 degrees of boiling water directly onto my bare countertop.
SPEAKER_00That is wow, that's a rough morning.
SPEAKER_01It really was. I mean, the execution was flawless, but it lacked the absolute most critical component, which is a feedback loop to, you know, actually realize the mug wasn't there. Trevor Burrus, Jr.
SPEAKER_00Which is the exact wall developers hit when they first start building AI agents.
SPEAKER_01Exactly.
SPEAKER_00You give an LLM API access and it just executes blindly. It takes action, sure, but it has zero self-awareness. Trevor Burrus, Jr.
SPEAKER_01Right. And that paradigm shift, moving from blindly prompting an AI to actually engineering an environment where it catches its own errors is the mission of today's deep dive.
SPEAKER_00Aaron Powell It's such an optimistic space to be in right now.
SPEAKER_01It really is. We're unpacking Sidney Runkel's piece on the art of loop engineering. For you listening, this is basically a masterclass in loopcraft.
SPEAKER_00Building dynamic, self-improving systems.
SPEAKER_01Exactly. Systems that actually help humanity progress.
SPEAKER_00Yeah.
SPEAKER_01But uh before we get into the architecture, a quick shout out. If you are inspired to build these kinds of systems yourself, you should really check out Embrasilk.
SPEAKER_00They do great work.
SPEAKER_01They really do. Whether you need help with AI training, automation, integration, or you know, software development, they are absolute pros at uncovering exactly where agents can make the biggest impact for your business or personal life. Check them out at embrasilk.com.
SPEAKER_00So getting back to the baseline, you build the standard agent setup, right? Yeah. Loop one, you give an AI a system prompt and access to external tools so it can act until a goal is met.
SPEAKER_01Aaron Powell Like handing a chef a kitchen and a recipe?
SPEAKER_00Precisely. But as your flooded kitchen proved, just taking action is dangerous. The first thing you realize in production is that models hallucinate or they misuse tools.
SPEAKER_01You can't just let an agent run wild on a live database.
SPEAKER_00You absolutely cannot. So the immediate architectural necessity becomes a verification layer. Loop two.
SPEAKER_01Right.
SPEAKER_00You essentially build an automated grader-like, another model that evaluates the agent's output against a strict rubric before returning it.
SPEAKER_01Aaron Powell Okay, I'll bite because I see this debate like all the time. Doesn't piping every single action through an LLM grader add massive latency.
SPEAKER_00Oh, totally.
SPEAKER_01And cost for that matter. I mean, if we're waiting an extra five seconds for the AI to double-check its own math, doesn't that kind of defeat the instant appeal of the technology?
SPEAKER_00It absolutely adds latency, but you have to ask yourself what you're actually building here. Okay. If you want a fast parlor trick to generate a poem, sure, skip the verification. But if you want an agent you can trust to reconcile your company's financial ledgers or rewrite live code.
SPEAKER_01Aaron Powell You'll happily wait that extra five seconds.
SPEAKER_00Exactly. For guaranteed correctness, the trade-off is entirely worth it. It's about real-world reliability.
SPEAKER_01Aaron Ross Powell Accuracy over speed. That makes sense. But um once we have an accurate verified system, we still have a bottleneck, which is me.
SPEAKER_00The manual trigger.
SPEAKER_01All right, I still have to hit go.
SPEAKER_00Aaron Powell And true automation means taking the human out of the trigger mechanism entirely. That's loop three. Moving to an event-driven architecture.
SPEAKER_01Aaron Ross Powell So wiring it up to the ecosystem.
SPEAKER_00Aaron Powell Yeah. You wire the agent directly in so it listens for specific state changes. Like an updated JIRA ticket, a webhook from Stripe, whatever it might be.
SPEAKER_01Aaron Ross Powell So it's just running continuously in the background.
SPEAKER_00Exactly. Executing those verified workflows at scale.
SPEAKER_01Aaron Ross Powell Okay. So we've got an accurate system and it's running autonomously. But wait, what happens when the underlying environment changes?
SPEAKER_00Ah.
SPEAKER_01Because if it's just running the same prompts forever, it's eventually got a break when an API updates or like the data format shifts.
SPEAKER_00That is the most fascinating part of Runkel's framework. Loop four. You don't just leave it static, you build what's called a hill climbing loop.
SPEAKER_01A shill climbing loop?
SPEAKER_00Yeah. You essentially deploy an autonomous QA team. Every time the agent runs, it generates an execution trace.
SPEAKER_01Aaron Powell Which is like a step-by-step audit log of its own logic.
SPEAKER_00Exactly.
SPEAKER_01Wait, so it's parsing its own failed traces, figuring out where it went off the rails, and tweaking its own system instructions.
SPEAKER_00Precisely. To avoid that specific hour next time, it's an optimization engine.
SPEAKER_01That is wild.
SPEAKER_00The analysis agent looks at the failures, isolates the prompt weakness, and literally rewrites the primary agent's configuration for the next run.
SPEAKER_01Aaron Powell Like a genetic algorithm for prompt engineering.
SPEAKER_00Yes. The system naturally evolves and just gets more robust with every single failure.
SPEAKER_01Aaron Powell That is a massive paradigm shift. I mean, it's not just a tool anymore, it's an environment that manages and improves itself.
SPEAKER_00It really is.
SPEAKER_01But you know, with that level of self-optimization, where do we fit in?
SPEAKER_00Well, it's actually an incredibly optimistic reality for us. We aren't being replaced, we're being elevated.
SPEAKER_01I love that. Elevated how.
SPEAKER_00When the AI handles all the heavy lifting of execution and self-correction, we remain fundamentally in the loop for what matters most.
SPEAKER_01The strategy.
SPEAKER_00Right. Providing strategic oversight, exercising human taste, and approving highly sensitive actions. We get to direct the orchestra while the agents tune the instruments.
SPEAKER_01Directing the orchestra, a true partnership. I love that so much. Well, I want to leave you listening with this thought. If you could design a hill climbing loop to automatically analyze the traces of your daily habits and optimize just one of them over time.
SPEAKER_00Oh, that's a great question.
SPEAKER_01What would you choose to continuously improve? Just something to mull over.
SPEAKER_00That is a fantastic puzzle to think about.
SPEAKER_01If you enjoyed this deep dive, please subscribe to the show. Hey, leave us a five star review if you can. It really does help get the word out. Thanks for tuning in.
SPEAKER_00Thanks everyone.
SPEAKER_01And tomorrow morning, if you're automating your coffee, just uh make sure you've engineered a feedback loop to check if the mug is actually there.