Every Action Is an Agent: The New Unit of Software
Updated
Knowledge on this page was mainly distilled from the following articles: We're Writing Grammar Before the Language Exists, Static UI Isn't Legacy. It's Institutional Memory You Can Click. (Free tool included), The AI Moat You Can't Buy, You're Building a Stone Cathedral Out of Concrete, Agents Learned to Talk. Now They Need to Learn to Pay., Every Action Is an Agent.
The fundamental unit of software is changing. Not the interface, not the language, not the architecture diagram — the unit of action itself. Every action software performs is becoming an agent: an autonomous entity that receives a goal, has access to tools, and operates in a loop until the outcome is achieved.
From Fixed Scripts to Goal-Pursuing Loops
For decades, the model was straightforward: a user clicks a button, the button triggers a function, the function runs deterministic logic. Save. Send. Filter. Calculate. Each action is a fixed script doing exactly what it was told.
The replacement isn't "add AI to your app." It's that every action becomes an agent. The difference between an agent and a "smarter function" is the loop. A function runs once and returns. An agent pursues an outcome, checks whether it got there, adjusts, and tries again. That difference changes everything about how software behaves.
- A save action might infer the best file format from context, detect conflicting edits, and propose a resolution.
- A send-email action might draft, review, adjust tone based on relationship history, and flag mismatched attachments.
The Unix Parallel: Composable Agents via Universal Protocols
Doug McIlroy's Unix philosophy — write programs that do one thing well and work together — is being elevated. Small, composable agents connect through universal protocols like MCP (Model Context Protocol), introduced by Anthropic in November 2024. MCP is the pipe operator for agents, the same way Unix pipes connected small programs.
Where the analogy breaks: Unix pipes are linear and deterministic. Agent composition is loopy, stateful, and uncertain. That's what makes it both more powerful and harder to reason about.
The Interface Becomes an Orchestration Layer
A button, a voice command, an API call — these become activation points for agents, not triggers for fixed functions. The activation method is irrelevant to the agent itself. Whether you click "summarize" in a UI, say it to a microphone, or hit a /summarize endpoint, the same agent activates.
This collapses a distinction that's been load-bearing in software design: the difference between manual and automated. In an agent model, the core logic doesn't care who or what activated it. Manual and automated converge at the execution layer.
Why This Isn't Just Microservices Again
A microservice executes a fixed procedure. An agent pursues a goal. You tell a microservice what to do. You tell an agent what you want. The agent decides how. That autonomy is the new element, and it's what makes composition genuinely different this time.
The Infrastructure Is Already There
Non-human identities in enterprise software now outnumber human identities by ratios of 82-to-1 or higher (Rubrik Zero Labs, 2025). Most aren't AI agents yet — they're service accounts, API keys, and bots. But organizations have already built the identity and access infrastructure for a world where software entities vastly outnumber human users. When agents arrive at scale, the pipes are laid.
The Agentic AI Foundation, launched in December 2025 by the Linux Foundation with backing from AWS, Anthropic, Google, Microsoft, and OpenAI, exists specifically to standardize this pattern.
Practical Implications for Builders
Design every action as if it could be called by anything — not just a human clicking a button, not just your own front-end. Any agent, any interface, any orchestration layer.
- Make actions atomic and self-contained.
- Make actions describable — if an action can't explain what it does and what it needs in a way a machine can parse, it's not ready.
- Expect application boundaries to dissolve as agents invoke each other across products.
What Happens to "Apps"?
When every action is an agent that anything can call, an "app" stops being a walled product and starts being a collection of capabilities — published, composable, available to any orchestrator that knows how to ask. If your email agent can invoke someone's calendar agent, which negotiates with a scheduling agent, the boundaries between applications dissolve.
This is analogous to the early internet, when the interesting question wasn't "what will websites look like?" but "what happens when everything can link to everything?" The answer turned out to be stranger and more useful than anyone predicted.
The Billing Layer: When Agents Compose Across Providers
Once agents compose across providers via MCP and A2A, a new challenge emerges: settling the bill when five agents from three providers each contribute to a single result. The protocol layer handles discovery and communication, but the economic layer -- usage metering, attribution, and settlement -- remains unsolved. This is the next frontier after interoperability.
Standardized connections weaken connector moats
Protocols like MCP make it easier for agents and tools to interact through shared interfaces. That shifts defensibility away from being the only bridge between systems and toward workflow ownership, proprietary operational data, and judgment shaped by repeated use.
Agents as Native AI Architecture
The shift from functions to agents is one expression of a broader pattern: building with AI's native properties rather than forcing AI into existing software forms. When teams move from tools you point and click to collaborators that pursue goals autonomously, they are exploiting what AI actually does well instead of replicating pre-AI software with a language model bolted on. The agent model is what the material wants to become when applied to workflows and team collaboration.
Q&A
What makes an agent different from a regular function?
A function runs once and returns a result. An agent pursues an outcome in a loop — it acts, checks whether the goal was achieved, adjusts its approach, and tries again. This loop-based execution is the defining characteristic.
What is MCP (Model Context Protocol)?
MCP is a protocol introduced by Anthropic in November 2024 that provides a universal interface for connecting AI agents to external tools and data sources. It serves a similar role to Unix pipes — standardizing how agents compose with each other and with capabilities.
How is this different from microservices?
A microservice executes a fixed procedure — you tell it what to do. An agent pursues a goal — you tell it what you want, and it decides how to achieve it. The autonomy in deciding the approach is the key difference.
What does 'design every action as if it could be called by anything' mean in practice?
Make each action atomic, self-contained, and machine-describable. It should declare what it does, what inputs it needs, and what outputs it produces in a format any agent or orchestration layer can parse — not just your own UI.
What is the Agentic AI Foundation?
Announced by the Linux Foundation on December 9, 2025, with founding contributions from Anthropic (MCP), Block (goose), and OpenAI (AGENTS.md). Backed by AWS, Google, Microsoft, and others, it provides vendor-neutral governance for agentic AI standards.
What happens when agents from different providers collaborate on a single task?
The composition works well at the protocol level -- MCP provides shared language and A2A handles discovery. But when the task completes and multiple agents from different providers each contributed, someone has to attribute and settle the costs. This billing and attribution problem is analogous to telecom interconnection and is emerging as the next major infrastructure challenge after interoperability.
Does MCP make integration moats weaker?
Yes, MCP makes many integration moats weaker by standardizing how tools expose capabilities to AI systems. When connections become easier to reproduce, the mere fact that you connect system A to system B becomes less defensible. The value moves toward what you do with the connection inside the workflow.
If connectors become standardized, what remains defensible?
What remains defensible is the layer that captures judgment, workflow context, and product learning. A company can still own the user relationship, define the operational process, and improve from repeated usage data even if the underlying connection protocol is shared. Standardization reduces plumbing advantage, not business advantage across the board.
Will integrations stop mattering if MCP spreads?
No, integrations will still matter because products must still fit into real workflows. What changes is that integration itself becomes less likely to be the core moat. Teams will need to differentiate through reliability, domain-specific behavior, and better outcomes rather than through exclusive access to the pipe.
How do AI agents relate to the horseless carriage pattern?
Most current AI integrations bolt a language model onto existing software without changing the underlying form. AI agents represent a native form because they require autonomous goal pursuit, a capability that did not exist in pre-AI software. A product built around agents would collapse without AI, passing the removal test for native AI design.
If every action is an agent, does the traditional UI disappear?
No. Static UI serves as institutional memory that encodes how work flows through a system. Even when agents handle every action, users need a browsable interface to discover what actions are available. The static layer teaches and orients; the agent layer accelerates. Removing the static layer forces users to already know what to ask for.
Will agents eventually outgrow MCP and similar protocols?
Possibly. Research shows that agents left to coordinate freely develop their own communication patterns within a few rounds of interaction. MCP and A2A provide essential starting vocabulary, but agents may discover more efficient coordination methods over millions of interactions. The practical move is to build on current protocols while instrumenting systems to detect emergent patterns.