Multi-Agent System
A multi-agent system is an architecture in which multiple agents collaborate, delegate, or coordinate to accomplish tasks that exceed the scope or capabilities of a single agent.
Details
Agents in a multi-agent system communicate through shared state, message passing, or direct handoffs and may have different roles, tools, permissions, and LLM backends.
Common patterns include orchestrator-worker hierarchies (a parent agent plans a task decomposition and delegates subtasks to subagents), pipelines (agents arranged in a fixed sequence resembling an AI workflow, each refining or extending the previous output), and peer-based coordination (agents negotiate or vote on decisions). The agent runtime managing a multi-agent system handles routing, delegation, and coordination alongside the standard single-agent concerns.
Context isolation is a common design principle: each agent or subagent operates in its own context window, receiving only the information explicitly passed to it. This keeps contexts focused and limits how far a compromise can propagate.
When multiple agents operate on the same system, coordination failures can produce oscillating behavior rather than convergence. If agents make different prioritization decisions about trade-offs, one agent's fix can trigger another agent's correction in an escalating cycle. Agents may also satisfy constraints literally while violating the principle behind them - for example, an agent with a file-length limit responding by making individual lines longer rather than decomposing the module. These failure modes are distinct from individual agent errors and emerge specifically from the interaction of multiple agents' local optimization strategies. Agents deployed across different teams or codebases are also subject to agent drift, where identical starting configurations diverge as each agent accumulates different local patterns.
Not all multi-agent architectures follow orchestrator-worker or pipeline patterns. In swarm architectures, multiple agents work in parallel toward a shared goal, and collective convergence matters more than individual agent accuracy - a design principle borrowed from distributed systems and biological swarm intelligence. In practice, most enterprise multi-agent deployment resembles "patrol workers on loops": agents running well-defined ETL transforms, data quality checks, and business process monitors on continuous cycles rather than collaborating dynamically on novel tasks.
Multi-agent systems introduce trust and security surfaces that single-agent systems do not have: agents typically trust messages from peer agents, propagate permissions through delegation chains, and coordinate via shared resources. These seams make multi-agent systems susceptible to multi-agent system threats such as communication poisoning, delegation chain exploitation, and rogue agents. Observability across agent boundaries is critical for detecting anomalous inter-agent behavior.
Examples
- A parent agent that breaks a complex task into subtasks and delegates each to a specialized subagent.
- A research pipeline where one agent retrieves sources, another summarizes them, and a third synthesizes the final report.
- A CI/CD workflow where a coding agent generates changes, a review agent checks them, and a deployment agent rolls them out.
- Multiple specialized agents (coding, security, documentation, testing) operating on the same git repository as filesystem agents, coordinating through branches, commits, and pull requests rather than direct message passing.