The Art of Loop Engineering Artwork

Intellectually Curious

Intellectually Curious is a podcast by Mike Breault featuring over 1,800 AI-powered explorations across science, mathematics, philosophy, and personal growth. Each short-form episode is generated, refined, and published with the help of large language models—turning curiosity into an ongoing audio encyclopedia. Designed for anyone who loves learning, it offers quick dives into everything from combinatorics and cryptography to systems thinking and psychology.

Inspiration for this podcast:

"Muad'Dib learned rapidly because his first training was in how to learn. And the first lesson of all was the basic trust that he could learn. It's shocking to find how many people do not believe they can learn, and how many more believe learning to be difficult. Muad'Dib knew that every experience carries its lesson."

― Frank Herbert, Dune

Note: These podcasts were made with NotebookLM. AI can make mistakes. Please double-check any critical information.

Show More

Intellectually Curious

The Art of Loop Engineering

June 18, 2026 • Mike Breault

0:00 | 6:00

We unpack Sydney Runkle’s loop engineering framework—a masterclass in turning a basic AI agent into a robust, autonomous system. From verification-driven loops (automated graders) and event-driven execution to a hill-climbing autonomous QA loop that rewrites its own prompts after each failure, this episode explains how to design feedback-rich environments where humans stay in the strategic driver’s seat while agents handle execution and self-improvement.

Note: This podcast was AI-generated, and sometimes AI can make mistakes. Please double-check any critical information.

Sponsored by Embersilk LLC

SPEAKER_01 0:00

So last week, I uh I tried wiring up a smart plug and a basic script to automate my morning coffee.

SPEAKER_00 0:06

Oh, I can already see where this is going.

SPEAKER_01 0:08

Right. First day, flawless espresso. Second day, well, I had left the mug like maybe half an inch off center the night before. Oh no. Yeah. The script ran perfectly, just dutifully pouring 200 degrees of boiling water directly onto my bare countertop.

SPEAKER_00 0:24

That is wow, that's a rough morning.

SPEAKER_01 0:27

It really was. I mean, the execution was flawless, but it lacked the absolute most critical component, which is a feedback loop to, you know, actually realize the mug wasn't there. Trevor Burrus, Jr.

SPEAKER_00 0:37

Which is the exact wall developers hit when they first start building AI agents.

SPEAKER_01 0:41

Exactly.

SPEAKER_00 0:41

You give an LLM API access and it just executes blindly. It takes action, sure, but it has zero self-awareness. Trevor Burrus, Jr.

SPEAKER_01 0:48

Right. And that paradigm shift, moving from blindly prompting an AI to actually engineering an environment where it catches its own errors is the mission of today's deep dive.

SPEAKER_00 0:57

Aaron Powell It's such an optimistic space to be in right now.

SPEAKER_01 1:00

It really is. We're unpacking Sidney Runkel's piece on the art of loop engineering. For you listening, this is basically a masterclass in loopcraft.

SPEAKER_00 1:08

Building dynamic, self-improving systems.

SPEAKER_01 1:11

Exactly. Systems that actually help humanity progress.

SPEAKER_00 1:14

Yeah.

SPEAKER_01 1:14

But uh before we get into the architecture, a quick shout out. If you are inspired to build these kinds of systems yourself, you should really check out Embrasilk.

SPEAKER_00 1:23

They do great work.

SPEAKER_01 1:24

They really do. Whether you need help with AI training, automation, integration, or you know, software development, they are absolute pros at uncovering exactly where agents can make the biggest impact for your business or personal life. Check them out at embrasilk.com.

SPEAKER_00 1:39

So getting back to the baseline, you build the standard agent setup, right? Yeah. Loop one, you give an AI a system prompt and access to external tools so it can act until a goal is met.

SPEAKER_01 1:51

Aaron Powell Like handing a chef a kitchen and a recipe?

SPEAKER_00 1:53

Precisely. But as your flooded kitchen proved, just taking action is dangerous. The first thing you realize in production is that models hallucinate or they misuse tools.

SPEAKER_01 2:03

You can't just let an agent run wild on a live database.

SPEAKER_00 2:06

You absolutely cannot. So the immediate architectural necessity becomes a verification layer. Loop two.

SPEAKER_01 2:13

Right.

SPEAKER_00 2:14

You essentially build an automated grader-like, another model that evaluates the agent's output against a strict rubric before returning it.

SPEAKER_01 2:22

Aaron Powell Okay, I'll bite because I see this debate like all the time. Doesn't piping every single action through an LLM grader add massive latency.

SPEAKER_00 2:33

Oh, totally.

SPEAKER_01 2:34

And cost for that matter. I mean, if we're waiting an extra five seconds for the AI to double-check its own math, doesn't that kind of defeat the instant appeal of the technology?

SPEAKER_00 2:42

It absolutely adds latency, but you have to ask yourself what you're actually building here. Okay. If you want a fast parlor trick to generate a poem, sure, skip the verification. But if you want an agent you can trust to reconcile your company's financial ledgers or rewrite live code.

SPEAKER_01 2:57

Aaron Powell You'll happily wait that extra five seconds.

SPEAKER_00 2:59

Exactly. For guaranteed correctness, the trade-off is entirely worth it. It's about real-world reliability.

SPEAKER_01 3:04

Aaron Ross Powell Accuracy over speed. That makes sense. But um once we have an accurate verified system, we still have a bottleneck, which is me.

SPEAKER_00 3:12

The manual trigger.

SPEAKER_01 3:13

All right, I still have to hit go.

SPEAKER_00 3:14

Aaron Powell And true automation means taking the human out of the trigger mechanism entirely. That's loop three. Moving to an event-driven architecture.

SPEAKER_01 3:23

Aaron Ross Powell So wiring it up to the ecosystem.

SPEAKER_00 3:25

Aaron Powell Yeah. You wire the agent directly in so it listens for specific state changes. Like an updated JIRA ticket, a webhook from Stripe, whatever it might be.

SPEAKER_01 3:34

Aaron Ross Powell So it's just running continuously in the background.

SPEAKER_00 3:37

Exactly. Executing those verified workflows at scale.

SPEAKER_01 3:40

Aaron Ross Powell Okay. So we've got an accurate system and it's running autonomously. But wait, what happens when the underlying environment changes?

SPEAKER_00 3:49

Ah.

SPEAKER_01 3:49

Because if it's just running the same prompts forever, it's eventually got a break when an API updates or like the data format shifts.

SPEAKER_00 3:57

That is the most fascinating part of Runkel's framework. Loop four. You don't just leave it static, you build what's called a hill climbing loop.

SPEAKER_01 4:05

A shill climbing loop?

SPEAKER_00 4:06

Yeah. You essentially deploy an autonomous QA team. Every time the agent runs, it generates an execution trace.

SPEAKER_01 4:12

Aaron Powell Which is like a step-by-step audit log of its own logic.

SPEAKER_00 4:15

Exactly.

SPEAKER_01 4:16

Wait, so it's parsing its own failed traces, figuring out where it went off the rails, and tweaking its own system instructions.

SPEAKER_00 4:23

Precisely. To avoid that specific hour next time, it's an optimization engine.

SPEAKER_01 4:29

That is wild.

SPEAKER_00 4:30

The analysis agent looks at the failures, isolates the prompt weakness, and literally rewrites the primary agent's configuration for the next run.

SPEAKER_01 4:39

Aaron Powell Like a genetic algorithm for prompt engineering.

SPEAKER_00 4:42

Yes. The system naturally evolves and just gets more robust with every single failure.

SPEAKER_01 4:47

Aaron Powell That is a massive paradigm shift. I mean, it's not just a tool anymore, it's an environment that manages and improves itself.

SPEAKER_00 4:53

It really is.

SPEAKER_01 4:54

But you know, with that level of self-optimization, where do we fit in?

SPEAKER_00 4:58

Well, it's actually an incredibly optimistic reality for us. We aren't being replaced, we're being elevated.

SPEAKER_01 5:03

I love that. Elevated how.

SPEAKER_00 5:04

When the AI handles all the heavy lifting of execution and self-correction, we remain fundamentally in the loop for what matters most.

SPEAKER_01 5:11

The strategy.

SPEAKER_00 5:12

Right. Providing strategic oversight, exercising human taste, and approving highly sensitive actions. We get to direct the orchestra while the agents tune the instruments.

SPEAKER_01 5:22

Directing the orchestra, a true partnership. I love that so much. Well, I want to leave you listening with this thought. If you could design a hill climbing loop to automatically analyze the traces of your daily habits and optimize just one of them over time.

SPEAKER_00 5:37

Oh, that's a great question.

SPEAKER_01 5:39

What would you choose to continuously improve? Just something to mull over.

SPEAKER_00 5:43

That is a fantastic puzzle to think about.

SPEAKER_01 5:46

If you enjoyed this deep dive, please subscribe to the show. Hey, leave us a five star review if you can. It really does help get the word out. Thanks for tuning in.

SPEAKER_00 5:53

Thanks everyone.

SPEAKER_01 5:54

And tomorrow morning, if you're automating your coffee, just uh make sure you've engineered a feedback loop to check if the mug is actually there.