The Agentic AI Operating System — EY's Blueprint, Translated to SMB Scale
In May 2026, EY disclosed the rollout of an internal agentic AI operating system to more than 300,000 professionals — one of the largest disclosed enterprise AI deployments to date. The components are what matter: a shared knowledge layer, a governed prompt and policy registry, an agent orchestration layer, and a monitoring layer. None of those four components are inherently expensive. None require Big 4 budgets to assemble. The architecture is exactly what every SMB needs the moment AI becomes load-bearing across more than one workflow — and the SMB version, built on existing tools, costs a fraction of one mid-level salary to stand up.
The Architecture in Four Components
Knowledge
Shared facts layer
Notion, Confluence, file index
Prompts
Governed instructions
Versioned prompt + policy library
Orchestration
Agent coordination
n8n, Make, custom workflows
Monitoring
Outcome visibility
Dashboard + weekly review
What EY Actually Built
EY's internal disclosure describes a unified agentic AI platform — branded EYQ — deployed to its global professional workforce as secure enterprise chat, domain assistants, governed prompt tooling, and a controlled environment for safe generative AI experimentation across service lines. The product detail matters less than the architectural intent: turn AI from a per-team tool into shared organisational capability. The connective tissue is what distinguishes EYQ from any individual AI feature. Anyone can adopt ChatGPT; only an organisation that built the connective tissue can deploy hundreds of coherent agentic workflows on top of it.
The interesting question for SMB owners is not “could we afford to build EYQ?” — the answer is obviously no. The interesting question is “what is the connective tissue actually made of?” — because the answer is “four small components, none of which are expensive at SMB scale.” The rest of this article is what those four components look like for a 10-100 employee business.
Component 1: The Shared Knowledge Layer
Every AI agent needs facts. Where the facts live, and how the agent retrieves them, is the single most consequential architectural choice. At enterprise scale, this becomes a vector database fed by content pipelines from dozens of systems. At SMB scale, the shared knowledge layer is much simpler — a Notion or Confluence workspace with three structured sections: company facts (services, pricing, policies, FAQs), operational SOPs (how each workflow runs, who owns what, escalation paths), and customer-facing language (approved phrasing, brand voice, what we do and do not say).
Crucially, the knowledge layer is the source of truth — not the agent's memory and not the model's training data. Agents pull from the knowledge layer at run time. When facts change, you update the knowledge layer once and every agent gets the new fact on its next call. Without this single-source pattern, each agent ends up with its own divergent picture of the business, and the resulting inconsistency is the second-most-common cause of agent deployment failure after vague success criteria.
Component 2: The Prompt and Policy Registry
The second component answers the question “how does each agent know how to behave?” At EY scale, this is a governed prompt and policy library with version control, role-scoped access, approval workflows, and audit logging. At SMB scale, it is a single repository — usually a private Git repo, a structured Notion section, or a tool like PromptLayer — that stores the live prompts each agent uses, the policies that constrain behaviour (tone, escalation rules, do-not-say lists, compliance constraints), and the version history.
The discipline is treating prompts the way you treat code. They are versioned. Changes are reviewed. The current production version is unambiguous. Nobody edits the prompt directly inside the agent UI and hopes for the best. This sounds like overhead; the first time an agent says something embarrassing in production and you need to know exactly what prompt was live at that moment, the registry repays its setup cost ten times over.
Component 3: The Orchestration Layer
The orchestration layer is what makes “multiple AI features” into a coherent system. It is the workflow engine that decides which agent runs when, what each agent passes to the next, where human review enters the loop, and how exceptions are handled. At enterprise scale, this is often custom-built. At SMB scale, off-the-shelf workflow tools — n8n (self-hosted or managed), Make, Zapier with its native agent capability, or workflows built directly on top of HubSpot, ClickUp, or Slack — handle the orchestration well enough for the next several years of growth.
The key architectural rule is that the orchestration layer is the integration layer. Agents do not call each other directly. Agents send their output to the orchestration layer, which routes it to the next step. This indirection is what lets you swap one agent for another, add monitoring, insert human review, or rebuild a step without rebuilding the whole pipeline. Direct agent-to-agent calls feel simpler in week one and become painful in month six; orchestration-layer routing is one extra hop that pays back permanently.
Component 4: The Monitoring and Review Layer
The last component answers the question that ultimately decides whether the system survives: “how do we know it's working?” At enterprise scale, this becomes an observability stack with model-quality metrics, latency monitoring, drift detection, and human evaluation pipelines. At SMB scale, it is much simpler: a dashboard tracking the day-90 metric you set as the definition of done, a log of agent decisions for the previous seven days, an error-routing pathway for cases the agent cannot handle, and a weekly 15-minute review where the owner of each workflow looks at recent runs and decides if anything needs attention.
The monitoring layer is the least exciting of the four components and the one SMBs most consistently underbuild. The pattern is predictable: the agents work, the dashboard is on someone's to-do list, three months pass, an incident happens, no one was watching, and the project loses the executive sponsor's confidence. Build the dashboard in week one alongside the first workflow. Hold the weekly review even when nothing has gone wrong. The monitoring layer is what converts a working system into a durable one.
How to Build It — A Realistic SMB Sequence
Weeks 1-2
Knowledge Layer First, Single Workflow Second
Stand up the shared knowledge layer in Notion or Confluence with company facts, the SOP for one workflow, and the customer-facing language for that workflow. Only then design the first agent. The order matters — building the agent before the knowledge layer guarantees the agent picks up ad hoc facts from wherever it can find them, and those facts immediately diverge from reality.
Weeks 3-4
Prompt Registry and First Agent Live
Move the agent's prompt into a versioned registry (private Git repo or PromptLayer). Deploy the first agent into a single workflow with clear input and output boundaries. Run it in shadow mode (output goes to a review queue, not to customers) for the first week to validate behaviour.
Weeks 5-6
Orchestration Layer Goes In Even Before Second Agent
Route the first agent's output through n8n or Make (or HubSpot workflows, or ClickUp automations) rather than letting the agent call downstream systems directly. The cost in week 5 is small. The cost of skipping this and going direct is paid later when adding the second and third agents requires unpicking direct integrations.
Weeks 7-8
Monitoring Dashboard and Weekly Review Cadence
Stand up the day-90 metric dashboard. Build the agent decision log. Schedule the weekly 15-minute review with whoever owns the workflow. Hold the first review even if nothing has gone wrong. The cadence is the point — the dashboard without the review is unused.
Weeks 9-12
Second Workflow Plugs Into Existing Architecture
The first workflow took eight weeks to build. The second workflow takes three. The architecture pays back from the second workflow forward — because the knowledge layer, prompt registry, orchestration, and monitoring are already in place. By the fourth workflow, the marginal cost of adding a workflow has dropped to about one-quarter of the first one's cost.
When Not to Build the Architecture
If you are running a single one-workflow AI experiment to prove whether AI fits your business, do not build the agentic AI OS. The architecture is overhead until it is load-bearing — which means until you have two or more workflows that depend on shared facts, governed behaviour, or coordinated action. The right test: if a fact about your business changes (a new product, a new policy, updated pricing), how many separate places does that fact live? If the answer is one (a single Notion page), you have shared knowledge. If the answer is “in each agent's system prompt,” you do not, and the second workflow will pay the cost.
Frequently Asked Questions
What is an agentic AI operating system?
The connective architecture that lets multiple AI agents share context, follow governed instructions, access role-scoped data, and coordinate actions across the tools the business already uses. Four components: shared knowledge layer, prompt and policy registry, agent orchestration layer, monitoring and review layer. Not a product — a small set of components.
Can an SMB really build EY-style architecture?
Yes — same components, different scale. EY rolled out to 300,000 professionals; an SMB rolls out to 30 employees. The architecture is identical in shape; the cost differs by orders of magnitude because the SMB version uses existing tools (HubSpot, QuickBooks, Slack, ClickUp, ServiceTitan) as the substrate.
What are the four components for an SMB?
Shared knowledge layer (Notion or Confluence — single source of facts), prompt and policy registry (versioned prompt library — Git repo, structured Notion, or PromptLayer), orchestration layer (n8n, Make, HubSpot workflows, or ClickUp automations), monitoring and review layer (metric dashboard plus weekly review).
Do SMBs need this architecture today or can they wait?
Wait if AI is a one-workflow experiment. Build if AI is becoming load-bearing across two or more workflows. The cost of bolting architecture on later — after three workflows have been built without shared knowledge or governance — is dramatically higher than building it from the second workflow forward.
What is the highest-risk mistake?
Building on a new platform instead of on existing tools. The integration debt of a new platform that asks you to re-pipe your data compounds faster than its benefits. The existing stack — CRM, billing, project tool, communications, document store — is the right substrate for an SMB agentic AI OS.
Related Articles
Atul Dongargaonkar
Founder & Lead Engineer · Swift Headway AI
16+ years building production systems and operational tooling across SaaS and data-infrastructure teams.
Build the Right Architecture, Once
Map the SMB Agentic AI OS to Your Stack
Book a free Operations Audit. We'll map the four-component architecture to the tools you already use — HubSpot, QuickBooks, Slack, ClickUp, ServiceTitan — and scope the first workflow to set the foundation right.
Get Free Operations Audit →