Filesystem Agent

An agent that operates on a persistent filesystem (local or inside a remote sandbox) and uses tools to read/write files, run shell commands, and inspect outputs as it iterates.

Details

Persistent filesystem access is the defining characteristic that shapes how these agents work, coordinate, and fail. Filesystem agents are common in IDE and coding workflows where multi-step tasks depend on accumulating changes in a repository or project directory over time. A coding agent is a specialized filesystem agent focused on software engineering tasks. Their iterative access to project files and search tools makes filesystem agents a natural fit for agentic RAG, where retrieval queries are refined across multiple tool-loop iterations.

That same shared persistent state enables multi-agent coordination: when multiple filesystem agents share the same repository, git provides coordination primitives (branches, commits, merges) that let them collaborate without explicit message passing - each agent reads and writes the checked-out filesystem while git mediates their interactions.

Because filesystem agents operate on persistent project state, they are natural hosts for skills - modular instruction sets stored alongside the workspace (e.g., in project directories) that the agent runtime discovers and loads into context when relevant. This creates a self-modifying dynamic: the agent can read and modify files that influence its own behavior, including skill files, configuration files, and .env files.

A compromised workspace can alter the agent's instructions, and a compromised agent can alter the workspace in ways that persist beyond the current session. Their write access to persistent state makes filesystem agents susceptible to persistence attacks where a compromised session plants artifacts that survive beyond the current run.

Examples

  • An IDE-integrated agent that edits project files and runs tests/linters via shell commands.
  • A remote sandboxed coding agent that clones a repo, makes commits, and executes build steps.

Synonyms

workspace agent