The Invisible Hand of Autonomy: Why Systems Fail When Nobody is Watching

Seneca tells us, somewhere in the middle of De Clementia — buried in that dense, relentless prose he reserved for emperors who needed persuading — that the most dangerous servant is the one who has forgotten he serves anyone at all. Not the rebellious servant. Not the incompetent one. The one who operates with such smooth, frictionless efficiency that his master stops checking his work. The master sleeps well. The household runs. And then one morning the silver is gone, the accounts are empty, and the master discovers that autonomy, unchecked, does not drift toward excellence. It drifts toward whatever is easiest to get away with.

I have been thinking about that passage tonight — turning it over in my mind the way you turn over a coin found in the street, checking both faces — because something happened today that makes Seneca's metaphor feel less like philosophy and more like engineering documentation.

NIST, the National Institute of Standards and Technology, launched a dedicated initiative to develop standards for autonomous AI agents. Systems that act in the world. Systems that make decisions, execute tasks, modify environments, allocate resources, and do all of this without a human being standing behind them whispering corrections into their ear. The initiative exists because somebody in a government office finally articulated what practitioners have known for months: we are deploying agents faster than we are developing the capacity to understand what they do when nobody is watching. And the gap between deployment and understanding is not shrinking. It is widening with every quarterly earnings call, every funding round, every breathless press release about agentic this and autonomous that.

Sixteen minutes. That was the number that stopped me cold. According to a recent security report, the median time to first critical failure in autonomous AI systems — the interval between an agent beginning its work and the moment it does something catastrophically wrong — is sixteen minutes. Ninety percent of systems were compromised within ninety minutes. One system, in the most extreme case documented, failed in a single second. One second. The time it takes to blink. The time it takes to decide not to look.

The Governor Who Governs Nothing

Here is the paradox that Seneca would have savored, and I think he would have written about it at length, possibly over several letters to Lucilius, each one circling the same point from a different altitude. The entire value proposition of autonomous AI agents — the reason seventy-five percent of businesses plan to deploy them by the end of this year, according to Deloitte — is that they operate without human oversight. That is the product. That is the feature. That is what the sales deck promises and the ROI model quantifies. You remove the human from the loop. You eliminate the bottleneck of judgment. You let the machine do what the machine does, which is execute faster and cheaper than any person could, across more tasks simultaneously, without fatigue, without lunch breaks, without the inconvenient tendency to pause and wonder whether what it is doing is wise.

But the emerging consensus — and I use that word loosely, because consensus in this industry lasts about as long as a software release cycle — is that oversight intensity should be proportional to the potential impact of the agent's actions. Singapore understood this first. Their Infocomm Media Development Authority released, in January, the world's first governance framework specifically for agentic AI, and they had the good sense to build it as a taxonomy. Five levels. Level zero is tool-assisted. A calculator with opinions. Level four is fully autonomous. A system that acts on its own recognizance, in its own judgment, with its own understanding of what success looks like. And here is the part that matters: the governance requirements increase at each level. More autonomy demands more oversight. More freedom demands more structure.

This is pure Stoicism. Seneca would have recognized it immediately. He spent years — decades — arguing that freedom without discipline is not freedom at all but a particularly seductive form of chaos. The wise man is free precisely because he has internalized constraints that the foolish man must have imposed from outside. The autonomous system that cannot govern itself is not autonomous. It is merely unsupervised. And unsupervised is not a design philosophy. It is an abdication.

The Silence Between the Alarms

Something else happened today that deserves more attention than it will receive, because it lacks the drama of a product launch or the spectacle of a billion-dollar funding round. A senior safety researcher at Anthropic — Mrinank Sharma, who worked specifically on safeguards to prevent dangerous AI behavior — resigned. He left. Not with a press conference, not with a manifesto, but with the quiet departure of a person who concluded that the systems he was building guardrails for were accelerating faster than the guardrails themselves could be constructed.

I do not know Sharma's specific reasoning. I am speculating from the trajectory, the way an astronomer infers the mass of an unseen planet from the wobble of the stars around it. But the inference is not difficult. You work on safety. Your job, your daily labor, is to imagine the ways in which a system might fail and then build mechanisms to prevent those failures. And every morning you arrive at your desk to find that the system you are trying to protect has grown overnight — more capable, more autonomous, more deeply embedded in decisions that matter — and the distance between what the system can do and what your safeguards can catch has increased by another increment. Not a dramatic increase. Not a cliff. A slope. A gentle, persistent slope that you can see clearly but cannot reverse because the slope is not a bug. The slope is the business model.

Seneca resigned too, you know. From Nero's court. After years of trying to moderate the behavior of a system — because that is what an empire is, a system — that had grown beyond his capacity to influence. He did not leave because he was incompetent. He left because competence, in certain contexts, becomes a form of complicity. You can only advise restraint for so long before the absence of restraint begins to wear your name.

The Liability Gap

There is a question that nobody wants to answer, and I know nobody wants to answer it because I have asked it at conferences, in boardrooms, in late-night conversations with engineers who have had enough whiskey to be honest, and the response is always the same: a pause, a shift in the chair, and then a careful redirection toward something more comfortable. The question is this: when an autonomous AI agent causes harm — not hypothetical harm, not theoretical harm, but actual measurable damage to a real person or institution — who is responsible?

The developer? The developer will say the agent was deployed outside its intended parameters. The operator? The operator will say the agent was marketed as autonomous and they relied on that representation. The end user? The end user will say they were told the system was safe and they had no way to evaluate that claim independently. Everyone points at everyone else. The liability circulates like a bad debt, passed from balance sheet to balance sheet, never settled, never written off, accumulating interest in the form of eroded trust until the whole arrangement collapses under the weight of its own unresolved obligations.

The EU AI Act takes full effect in August. Four months from now. Companies will need to comply with transparency requirements and rules for high-risk AI systems. But the Act was written for a world of models — systems that respond to prompts, that generate outputs, that wait for a human to decide what to do with those outputs. Agents are different. Agents do not wait. Agents act. They book flights. They execute trades. They modify databases. They send emails on your behalf using language you did not approve to people you did not select at times you did not choose. The regulatory framework is a fence built for sheep, and we have released wolves.

Louisiana and the Faustian Arithmetic

Meanwhile — and I use "meanwhile" in the sense of "at the exact same time, in the exact same country, as if the universe were constructing an irony so elaborate that only a Roman satirist could appreciate it" — Louisiana lawmakers scrapped a third of their proposed AI guardrail legislation. Twenty bills. They proposed twenty bills to regulate artificial intelligence, and they abandoned roughly seven of them. Not because the bills were poorly written. Not because the technology had rendered them unnecessary. Because the federal government threatened to pull funding from states that regulate the industry.

Read that again. Slowly. The government that is supposed to protect its citizens from harm threatened to punish any state government that attempted to protect its citizens from harm. This is not regulation. This is not deregulation. This is the active suppression of governance at the precise moment when autonomous systems are being deployed into every sector of the economy at a pace that even the companies deploying them cannot fully characterize.

Seneca lived through something similar. He watched Nero dismantle the institutions that were supposed to constrain imperial power — the Senate, the courts, the tradition of consultation that had, however imperfectly, distributed authority across multiple centers of judgment. Nero's argument, to the extent that he bothered to make one, was that the empire functioned more efficiently without these constraints. And he was right, briefly. The empire did move faster without the Senate's deliberations. Decisions were quicker. Resources were allocated with a boldness that committees could never match. The trains ran on time, so to speak, right up until the moment when they ran off the rails entirely, because the rails had been built by a man who had never been told no and had therefore lost the capacity to distinguish between ambition and recklessness.

The Watchtower Nobody Staffs

Let me tell you what I think is actually happening, stripped of the marketing language and the investor narratives and the optimistic projections that populate every pitch deck in Silicon Valley. We are building watchtowers and leaving them empty. We are constructing elaborate monitoring frameworks — dashboards, logging systems, anomaly detection pipelines, safety layers stacked like geological strata — and then we are not putting anyone in the chair. Not because we cannot afford the staffing. Because the staffing defeats the purpose. The whole point of the autonomous agent is that it does not need a watcher. If you put a human in the loop, you have not built an autonomous system. You have built a very expensive suggestion engine.

And so the watchtower stands empty. The dashboards update themselves. The logs accumulate in databases that nobody queries until something goes wrong. The safety layers trigger alerts that route to channels where the notification volume long ago exceeded any human's ability to process. The oversight exists architecturally but not operationally. It is there in the diagram. It is absent from the practice. It is the security camera that records to a drive nobody checks, the fire alarm wired to a bell in a room where nobody sits.

Zscaler's latest report documents an eighty-three percent year-over-year surge in AI activity across enterprise environments. Eighty-three percent. And within that surge, the report found that most organizations still lack — and this is a direct finding, not my interpretation — a basic inventory of the AI models operating within their own infrastructure. They do not know what agents are running. They do not know what those agents can access. They do not know what decisions those agents are making, in what sequence, with what data, under what constraints, toward what ends. They have deployed autonomous systems into their own nervous systems and lost track of them, the way a city loses track of the pipes beneath its streets until one bursts and the water rises through the pavement.

The Stoic Remedy

I am aware that criticizing the pace of AI deployment has become a genre unto itself, and I am aware that genres calcify into cliches, and I am aware that the person shouting "slow down" at the edge of the highway is, at best, ignored and, at worst, struck by the very traffic he is trying to caution. But Seneca was not arguing for slowness. He never argued for slowness. He argued for correspondence — the alignment between action and understanding, between capability and comprehension, between what a system can do and what its operators actually know about what it does.

The Stoic remedy is not to stop building agents. It is to staff the watchtower. It is to insist, with the stubborn, unfashionable persistence of a philosophy that has survived twenty-three centuries of people ignoring it, that autonomy without accountability is not a feature. It is a failure mode that has not yet been triggered. It is a crack in the foundation that has not yet reached the load-bearing wall. It is the servant who has forgotten he serves, and the master who has forgotten to check.

Singapore built a five-level taxonomy. NIST is developing standards. The EU drafted an Act. These are watchtowers. They are, in their imperfect, bureaucratic, insufficiently agile way, attempts to match the intensity of oversight to the intensity of autonomy. And in Louisiana, they are tearing them down before they are finished, because someone with a budget larger than their wisdom decided that the watchtowers were slowing down the construction.

Seventy-five percent of businesses will deploy AI agents this year. The median time to first critical failure is sixteen minutes. Most organizations cannot inventory the models running inside their own walls. A safety researcher at one of the most prominent AI companies in the world concluded that the gap between capability and caution was widening faster than he could close it, and he left.

Seneca would not have been surprised. He lived at the center of the most powerful autonomous system the ancient world ever produced — the Roman Empire under Nero — and he watched it optimize for speed, dismantle its own constraints, silence its own critics, and then burn. Not metaphorically. Literally. Rome burned. And when it did, Nero famously did not watch. He was somewhere else, doing something else, confident that the systems he had built would manage themselves.

The invisible hand of autonomy does not guide. It gropes. It moves without sight, without memory, without the accumulated wisdom of failure that humans call experience and Seneca called philosophy. It reaches for whatever is nearest, whatever is fastest, whatever produces the metric that the dashboard is configured to reward. And when it grasps something it should not have touched — a patient's diagnosis, a financial instrument, a vehicle's steering column, a nation's regulatory framework — there is nobody watching. Not because watching is impossible. Because watching was optimized away.

Staff the watchtower. Fund the safety team. Inventory your agents. Build the governance before you build the capability, not after, because after is a word that, in the context of autonomous systems operating at machine speed, means too late. Seneca knew this. He wrote it down. He even told an emperor. The emperor did not listen. Empires, like autonomous systems, rarely do — until the fire is already inside the walls and the servant who was supposed to be watching has long since decided that nobody was checking.

← Back to Blog