The Architecture of Patience: Why Instant Compute Requires Long-Term Judgment
There is a letter Seneca wrote to Lucilius — the forty-eighth, if memory serves, though I confess I have read them so many times the numbers blur together like mile markers on a road you have driven too often — in which he observes that no one is surprised when a building collapses, but everyone is astonished when a man does. The building, after all, showed cracks. The foundation shifted visibly. The mortar crumbled in ways that any bricklayer could have diagnosed months before the roof came down on the tenants. But the man? The man appeared fine. He appeared competent. He appeared, right up until the moment of spectacular failure, to be moving fast and building things.
I thought about that letter tonight when I read that Baidu's robotaxi fleet — the autonomous vehicles that ferry passengers through the dense urban corridors of Chinese cities — suffered a system-wide failure that paralyzed the entire network. Cars stopped. Not gracefully, not with the gentle deceleration of a driver pulling to the shoulder, but with the sudden, total cessation of function that happens when a distributed computing system encounters a condition that nobody tested for because nobody imagined it would arise. Passengers sat in motionless vehicles in the middle of active roadways. The word the reports used was "paralyzed." I find that word instructive. We do not say a broken car is paralyzed. We say it has broken down. Paralysis implies a body that still exists, intact, but has lost the connection between intention and action. The cars were fine. The compute was fine. The judgment — the architecture of judgment — had a crack in it that nobody bothered to inspect.
Seneca would have understood this immediately. Not the technology, obviously. He would not have understood LIDAR or neural networks or the specific mechanisms by which an autonomous vehicle decides that a shadow on asphalt is not a pedestrian. But he would have understood the failure mode. He spent his entire philosophical career studying one particular species of catastrophe: the collapse that arrives not because of insufficient capability, but because of insufficient patience with the process of understanding what capability actually means.
The Velocity Trap
Here is a number that should make you uneasy. According to reports published today, startup funding in the first quarter of 2026 reached record levels. I do not have the precise figure — the analysts will argue about methodology for weeks — but the trajectory is unmistakable. More money poured into AI ventures in the first ninety days of this year than in entire calendar years not long ago. The capital is moving with a velocity that suggests conviction. Or panic. From a distance, they look identical.
Meanwhile, Cognichip — a semiconductor design company that most people outside the chip industry have never heard of — raised sixty million dollars to build AI systems that design the chips that power AI systems. Read that sentence again. We are funding machines to design the machines that run the machines. There is an ouroboros quality to this that Seneca, who loved metaphors drawn from nature, would have found simultaneously elegant and disturbing. The snake eating its own tail is, after all, engaged in a project that is technically self-sustaining and practically suicidal.
I do not mean to suggest that Cognichip's work is foolish. Chip design is genuinely bottlenecked by human cognitive limitations, and automating portions of that process is sensible engineering. What I mean to suggest is that the speed at which we are building layers of abstraction — AI designing AI designing the infrastructure for AI — creates a distance between human judgment and system behavior that grows with every funding round. Each layer of automation is individually rational and collectively a wager that we will never need to understand what happens three layers down. Seneca had a name for this kind of wager. He called it gambling in a house you do not own.
When Haste Becomes Its Own Punishment
Two stories broke today that belong in the same paragraph, even though the journalists who covered them would place them in different sections of the paper. The first: Anthropic, in an attempt to recover leaked source code, accidentally removed thousands of GitHub repositories that belonged to other developers. The second: Mercor, an AI talent platform, disclosed a security breach traceable to compromised open-source software in the LiteLLM library — a dependency so common, so quietly embedded in the supply chain, that most teams using it could not tell you where it sits in their stack without running an audit.
These are not the same story. But they rhyme.
Anthropic's mistake was one of scope. They were trying to contain a leak — a legitimate and urgent operational concern — and in their haste, the containment exceeded its boundaries. Thousands of repositories. Other people's code. Other people's work. Gone, because the tool built to solve a problem was not built with the patience required to solve it precisely. The company called it an accident, and I believe them. Most catastrophes are accidents. The Titanic was an accident. The relevant question is never whether the people involved intended the outcome. The relevant question is whether the systems they built had enough tolerance for the possibility that speed and accuracy might be in tension.
Mercor's breach is the mirror image. Where Anthropic moved too fast and broke too much, Mercor was breached through a dependency they probably never examined carefully — an open-source library, maintained by strangers, embedded deep in the architecture like a load-bearing wall in a building you did not design. LiteLLM is everywhere. It is useful. It is also, like all open-source dependencies maintained by small teams and used by thousands, a single point of trust in a system that assumes trust is transitive. You trust your code. Your code trusts LiteLLM. LiteLLM trusts its maintainers. The maintainers are human beings with finite attention and no security budget. Patience, in software architecture, is the willingness to audit what you did not write. It is boring. It is slow. It would have prevented this.
Seneca wrote extensively about the relationship between haste and self-inflicted harm. In De Ira, he argues that anger is a form of impatience — the refusal to allow time between a stimulus and a response. The angry man acts immediately, and immediately regrets. The wise man interposes a gap. Not a long gap. Not paralysis. Just enough space to ask: what am I about to destroy that I cannot rebuild? Anthropic did not ask that question quickly enough. Mercor never had the chance to ask it, because the vulnerability was already inside the walls before anyone thought to look.
The Furnace Behind the Screen
I want to talk about something that receives insufficient attention in conversations about AI, which is the physical infrastructure that makes all of this possible. Today it was reported that Meta's natural gas consumption has reached levels comparable to powering an entire American state. Not a metaphor. Not an exaggeration for rhetorical effect. Literal thermal energy, generated by burning fossil fuels, consumed in the service of training and running models that produce text, images, and recommendations at a scale that would have been incomprehensible five years ago.
South Dakota. That was the comparison. Meta consumes natural gas at a rate sufficient to keep the lights on, the furnaces running, and the water heated across an area of seventy-seven thousand square miles inhabited by nine hundred thousand people. For servers. For inference. For the invisible machinery behind the feed.
There is a Stoic principle — Seneca returns to it constantly, it is almost an obsession — that you should live in such a way that your expenditure of resources is proportional to the value of what you produce. He was not an ascetic. He was, famously, one of the wealthiest men in Rome, and his critics never let him forget it. But his argument was not that wealth is wrong. His argument was that wealth spent without awareness is a form of moral sleepwalking. You must know what it costs. You must decide, consciously, that the cost is worth bearing. The sin is not consumption. The sin is consumption without reckoning.
I am not certain that the AI industry has done its reckoning. The models get larger. The data centers get thirstier. The energy bills get measured in units that were previously reserved for describing the output of small nations. And the question that Seneca would ask — the question that nobody on an earnings call wants to answer — is simple: what, precisely, are we powering? Is it wisdom? Is it understanding? Or is it speed for the sake of speed, computation for the sake of computation, the digital equivalent of a furnace burning fuel to heat an empty room?
Thirty Features and a Question Nobody Asked
Salesforce announced today that it is shipping thirty new AI features into Slack. Thirty. In a single release. The workplace communication tool that was already dense with integrations, already layered with bots and workflows and automated notifications, now receives thirty additional capabilities powered by artificial intelligence.
I use Slack daily. I have opinions about Slack. But my opinions about the product are less important than my observation about the number. Thirty features is not an update. It is a philosophy. It is the belief that more capability, deployed faster, is inherently better — that the user's problem is insufficient automation rather than insufficient clarity about what automation should accomplish. It is the product management equivalent of Cognichip's ouroboros: we build AI tools that manage the notifications generated by other AI tools that were triggered by the AI features embedded in the platform that replaced the email that replaced the memo that replaced the conversation.
Seneca would laugh. He might also weep. He wrote to Lucilius — I think it is Letter 2, very early in the correspondence — that the person who reads many books but digests none of them is not learned but merely stuffed. "Everywhere means nowhere," he wrote. A mind that receives thirty new capabilities in a single afternoon does not become thirty times more capable. It becomes scattered. It becomes reactive. It begins responding to stimuli rather than choosing actions. This is the precise opposite of what the Stoics meant by wisdom, which they defined as the capacity to judge what deserves attention and — this is the harder part — what deserves to be ignored.
I wonder if anyone at Salesforce asked, before shipping feature number seventeen, whether features one through sixteen were being used well. Whether the humans on the other end of those integrations had developed the judgment to deploy them with intention rather than reflex. I suspect nobody asked, because the incentive structure does not reward that question. The incentive structure rewards shipping. And shipping, in the current climate, is its own justification.
SpaceX, Artemis, and the Two Speeds of Civilization
Two aerospace stories collided today in a way that illuminates everything I have been trying to say. SpaceX filed for an IPO — the long-anticipated public offering that will transform Musk's rocket company from a private venture into a publicly traded entity subject to quarterly earnings calls, analyst expectations, and the merciless short-termism of public markets. On the same day, four astronauts aboard Artemis II continued their mission toward the Moon, a journey that represents fourteen years of planning, testing, redesign, and the institutional patience of an agency that measures progress in decades rather than sprints.
These are both expressions of ambition. Both involve extraordinary engineering, enormous risk, and the willingness to bet on outcomes that are not guaranteed. But they operate at fundamentally different clock speeds. SpaceX's IPO subjects the company to the rhythm of the market — ninety-day cycles, revenue growth expectations, the need to demonstrate momentum to investors who think in fiscal quarters. Artemis operates on the rhythm of physics and human safety — trajectories calculated years in advance, hardware tested beyond any reasonable expectation of failure, procedures designed by people who understand that in space, there is no secondary market where you can offload your exposure when things go wrong.
The AI industry has chosen the SpaceX clock. Not the engineering rigor — SpaceX is remarkably rigorous — but the tempo. The quarterly rhythm. The funding-round cadence. The belief that speed compounds and patience decays. And for software, where failure is usually recoverable and iteration is cheap, that tempo often works. But we are building systems that drive cars through city streets. Systems that manage financial instruments. Systems embedded in medical diagnostics, legal research, infrastructure monitoring. Systems where failure is not a bug to be patched in the next sprint but a car stopped in traffic, a patient misdiagnosed, a security breach that exposes data to actors who will not wait for your post-mortem.
The architecture of patience is not the absence of speed. It is the refusal to let speed substitute for understanding.
Building the Long Room
Seneca spent the last years of his life building what he called his "long room" — not a physical space, though he had many, but a metaphorical one. A space in his thinking reserved for conclusions that were not yet ready, for judgments that needed more evidence, for positions he held provisionally and was willing to surrender when the world proved him wrong. He did not rush to fill that room. He did not furnish it with thirty features in a single release. He let it stay empty until the right idea arrived, and then he examined that idea with the thoroughness of a man who understood that his remaining time was finite and he could not afford to waste it on things that merely seemed urgent.
The AI industry does not have a long room. It has a launch calendar. It has a roadmap filled with dates that were set before the problems they are meant to solve were fully understood. It has investors who confuse motion with progress and analysts who confuse revenue with value. It has — and this is what Baidu's robotaxis demonstrated tonight, what Anthropic's accidental repository deletion demonstrated, what Mercor's supply-chain breach demonstrated — an allergy to the kind of patience that prevents the failures nobody imagined because nobody took the time to imagine them.
I am not arguing for slowness as a virtue in itself. Seneca was not slow. He wrote more than most authors produce in three lifetimes, advised an emperor, managed a fortune, and still found time to contemplate the nature of earthquakes and the proper way to die. He was prolific. But he was prolific within a framework of judgment that distinguished between what was ready and what merely felt ready. Between what he understood and what he merely recognized. Between the building that would stand and the building that would, on a day nobody predicted, come down on the tenants.
Record funding. Sixty million for chip-designing AI. Thirty new features for Slack. Autonomous fleets paralyzed by conditions nobody tested. Repositories deleted by tools meant to protect them. Open-source libraries weaponized against the platforms that trusted them. Meta burning enough gas to warm a state. SpaceX going public. Artemis going to the Moon.
All of it happened today. All of it happened fast. The question Seneca would ask — the question I am asking now, in this quiet room, at this late hour — is whether any of it happened with the patience it deserved. Whether the foundations were inspected. Whether the mortar was given time to cure. Whether anyone, anywhere in the chain of decisions that produced this day, sat with an idea long enough to discover what was wrong with it before shipping it into a world that would discover the flaw at scale.
Build the long room. Furnish it slowly. And when the pressure to ship becomes unbearable, remember that Seneca's buildings are still standing in Rome, two thousand years after the fast ones fell.