WILLOW ai-infra Scan 2026-06-04 to 2026-06-04 Run 20260605160048

Change-control plane that approves employee-built AI agents before they can act in Salesforce, Jira, GitHub, and Google Workspace.

Companies are letting employees and department leads deploy AI agents into support, sales, product, and internal IT workflows before security teams can see what those agents are allowed to do across core SaaS systems. Once an agent can read, write, or trigger actions in tools like Salesforce, Jira, GitHub, and Google Workspace, the real risk is no longer bad text output but silent privilege sprawl, broken approvals, and incidents no one can trace to a human owner.

By Bizidea Research 2026-06-05

Overall rating 3.8 / 5.0

4
Market
$2.5B TAM and 33%-54% adoption growth support a large category, but five credible competitors and native stacks make it competitive.
4
Differentiation
Approval packets, entitlement simulation, and one-click revocation form a clear wedge versus broad AI-security suites, though parts are copyable.
3
Execution
Five planned hires and 5.6x LTV/CAC with 12-month payback are solid, but four model flags and a load-bearing Series A raise execution risk.
4
Timeliness
Fresh June 4 funding, a reported 65% incident rate, and four converging signals make the governance gap timely and concrete.

Section

Why now

A reported 65% AI-agent incident rate means enterprises already have enough failures to justify a dedicated control budget.
The risk surface is shifting toward how agents connect into internal systems, which creates a concrete security wedge around entitlements and actions.
Because agents are already embedded in employee workflows, governance has to work for decentralized business teams rather than only a central AI lab.
Seed financing for Willow shows investors believe the control layer is becoming its own category before incumbents fully absorb it.

Catalyst. Willow's financing and the cited rate of AI-agent incidents show enterprises already have agent-driven workflow risk, while agents connecting directly to internal systems make pre-deployment approval and rapid revocation newly urgent.

Section

The idea

The product plugs into major agent builders, identity providers, and SaaS admin APIs to create a live registry of every internal AI agent, who created it, which tools it can touch, and what actions it can take. Before a new agent goes live, the platform generates an approval packet that shows its connected systems, requested scopes, risky write actions, and required human owner. It can block unapproved agents, set time-bound access, and trigger a kill switch if an incident, policy violation, or ownership gap appears. Security teams get a workflow-native control plane instead of scattered OAuth logs and manual app reviews, while business teams keep the speed of self-serve agent deployment. Over time, the registry becomes the authoritative inventory and policy engine for every agent acting inside the enterprise.

What's different. Existing IAM and SaaS-security tools can show permissions by user or app, but they are not designed around agents created by employees, tied to a workflow, and capable of chaining actions across multiple systems. AI observability tools focus on prompts, models, and outputs after the fact, while this product owns the approval, ownership, and revocation loop before and during deployment. Its moat comes from a growing graph of agent identities, action scopes, risky workflow patterns, and policy templates tuned specifically to employee-built agents rather than human users.

Startup thesis
Beachhead	1,000-5,000 employee digital-product and BPO-style service companies that let support, revenue-ops, and internal IT managers deploy self-serve AI agents connected to Salesforce, Zendesk, Jira, GitHub, and Google Workspace.
Wedge	An agent change-control layer that inventories every employee-built agent, simulates its reachable systems and actions, requires approval before high- privilege connections go live, and provides a one-click kill switch when a workflow drifts or misbehaves.
Non-obvious insight	The new control gap is not model observability; it is machine-identity change management for employee-built agents that can quietly gain action privileges across many internal systems. As agent creation shifts from a central engineering team to workflow owners, the scarce capability becomes approving, scoping, and revoking agent entitlements at the workflow level before incidents happen.
Venture-scale path	Start with approval and kill-switch workflows for employee-built internal agents, then expand into runtime policy enforcement, third-party agent onboarding, vendor-risk reviews, agent identity lifecycle management, and a cross-enterprise system of record for every machine worker touching business software.

Target user
Primary user	Security and enterprise AI platform teams at 1,000-5,000 employee digital-product and services companies rolling out self-serve AI agents across support, revenue operations, and internal IT.
Secondary user	Department operations leaders whose teams build or deploy agents connected to core SaaS systems.
Economic buyer	CISO, CIO, or Head of Enterprise AI Platform.

Go-to-market seed
First customer	A 2,000-person digital-services company where support and revops managers already use Microsoft Copilot Studio or similar agent builders to connect agents into Salesforce, Zendesk, Jira, and Google Workspace without a centralized review workflow.
Buying trigger	An AI-agent incident, audit finding, or broader rollout decision that forces the CISO or CIO to prove which employee-built agents can act in core business systems.
Current alternative	SaaS admin consoles, manual OAuth reviews, SSO logs, ticket-based approvals, and generic DLP or CASB policies.
Switching reason	The first customer switches because this wedge gives them one inventory, approval path, and kill switch for employee-built agents across many apps, which existing identity and SaaS-admin tools do not provide.
Pricing hypothesis	Annual subscription priced by connected systems and active governed agent identities, with premium incident-response and runtime-enforcement modules.

Jobs to be done

Job	Current alternative	Success metric
When a business team launches a new internal AI agent, help security prove what systems it can read from or write to, so we can approve it without slowing every automation project.	Manual ticket reviews and app-by-app OAuth checks.	Time to approve or reject a new internal agent falls from weeks to less than three days.
When an AI-agent incident or ownership gap appears, help IT revoke risky agent access immediately, so we can contain damage before it spreads across core SaaS workflows.	Searching admin consoles, SSO logs, and workflow docs to find and disable the right integrations.	Mean time to identify owner and disable a risky agent drops to under 15 minutes.

Employee agent control loop

flowchart LR
  Buyer[CISO or Head of Enterprise AI] --> Pain[Unapproved agents acting in core SaaS systems]
  Pain --> Product[Employee agent change-control plane]
  Product --> Outcome[Safer self-serve agent rollout with instant revocation]

Idea scorecard — average4.6 / 5 · 5axes

Signal · 4/5The control gap is concrete and tied to a fresh funding event plus an incident statistic, but the cluster still rests on one verified source.
Pain · 5/5Uncontrolled agents with live access to internal systems can create immediate security, compliance, and operational incidents across multiple teams.
Wedge · 5/5Approval, inventory, and kill-switch workflows for employee-built agents form a narrow first product with a clear buyer and trigger.
Defense · 4/5The agent identity graph, risky-action policy templates, and cross-system incident patterns can compound into a durable control-plane dataset.
Scale · 5/5As every large company accumulates internal and third-party agents, the system of record for approving and revoking machine workers can become foundational infrastructure.

Business model canvas

Key partners

Identity providers and SaaS-management vendors
Agent-builder platforms
Security consultancies leading AI rollout programs

Key activities

Discovering and classifying employee-built agents
Simulating scopes and enforcing approval workflows
Monitoring ownership drift and triggering revocation

Key resources

Agent identity and entitlement graph
Integrations with agent builders, IdPs, and SaaS admin APIs
Policy templates for risky agent actions and system combinations

Value propositions

Inventory every internal AI agent and its connected systems in one control plane
Approve or block risky agent connections before they go live
Provide a kill switch and ownership trail for agent incidents

Customer relationships

High-touch onboarding around agent discovery and policy setup
Security reviews tied to each new agent deployment
Expansion into more departments, systems, and third-party agents

Channels

Direct sales to CISO, CIO, and enterprise AI leaders
Design-partner pilots with companies already running self-serve agent builders
Partnerships with identity, SaaS-management, and agent-platform vendors

Customer segments

Digital-product companies deploying self-serve internal AI agents
BPO and service operators with many workflow-specific agent builders
Enterprise AI platform and security teams governing business-led automation

Cost structure

Integration engineering
Policy engine and control-plane infrastructure
Enterprise sales and security-focused customer success

Revenue streams

Annual software subscription
Usage fees for governed agent identities or approval workflows
Premium incident-response and runtime-enforcement modules

Section

Market

Market sizing

Market sizing overview
TAM	$2.5B Estimate: ~50,000 qualifying firms globally x roughly $50k blended annual contract value for cross-platform agent governance, anchored by U.S. firm-count references and cross-checked against current enterprise agent-platform spending patterns.
SAM	$360.0M Estimate: ~8,000 reachable English-first digital-product and services firms in the beachhead x roughly $45k ACV after narrowing to organizations actively operationalizing agents and decentralized builders.
SOM	$6.0M Estimate: 120 customers by year three x roughly $50k ACV, assuming initial win rates through direct sales plus identity and SaaS-security partner channels.

Executive takeaways

The real wedge is not generic AI security but cross-platform change control for employee-built agents that can act inside business SaaS.
Native platform governance is arriving fast, but it remains vendor-siloed; mixed-stack enterprises still lack one approval path, one owner record, and one rapid revocation workflow across agent builders.
Buyer urgency is being pulled forward by actual incident rates, shadow-agent discovery, and governance immaturity rather than by abstract future regulation alone.
The most defensible product layer combines entitlement simulation, ownership assignment, and rapid revocation with an inventory graph of agents, tools, and delegated actions.
Go-to-market should start where business teams already ship agents without central review: support, revops, and internal IT inside Microsoft-, Google-, and Salesforce-heavy service firms.

Market definition

Cross-platform governance software for employee-built and business-deployed AI agents that can read, write, or trigger actions in enterprise SaaS systems.

Customer and buyer

Primary user is the enterprise AI platform or security team that must approve and trace agent access; economic buyer is typically the CISO, CIO, or head of enterprise AI platform.

Buying triggers

A security incident, scope violation, or shadow-agent discovery creates immediate demand for one owner registry and one rapid revocation workflow. [17][18][19]
A broader rollout of Copilot Studio, Agentforce, Gemini, or similar builders forces central teams to replace ticket reviews with explicit approval controls. [2][3][5][6][7]
Compliance programs become urgent when enterprises realize governance maturity trails adoption and upcoming oversight expectations. [9][10][13][14][15]

Willingness to pay

Enterprises are already paying for agent capacity on usage and per-user models, while incident and visibility gaps create a separate security budget justification. A control layer can price against avoided review labor and faster incident containment, not just against raw model spend. [7][13][17][18][19]

Category dynamics

Growth signal 33% to 54% operational AI-agent adoption over roughly two years in KPMG’s panel

Tailwinds

Adoption is moving from pilots to day-to-day operations, which expands the surface area that needs governance rather than ad hoc review.
Unknown agents, scope violations, and incidents are already common enough to justify dedicated controls.
Large platforms are training enterprises to expect centralized admin controls and logs for AI features, which normalizes budget for governance layers.

Headwinds

Native controls may be good enough for single-platform deployments, especially early in rollout.
Identity and SaaS-security vendors can extend into agent governance with adjacent data and relationships.
Discovery remains hard when agent ownership is unclear and credentials are shared.

Validation signals

Willow’s funding and Wix design-partner endorsement validate that enterprises already perceive agent governance as a distinct problem.
CSA’s surveys show unknown agents, scope violations, and incidents are already common, which supports an urgent control-plane wedge.
Gravitee’s report suggests security approval coverage is still far behind production deployment, leaving a clear operational gap.
Major platforms are racing to add their own governance panes and policy surfaces, confirming buyer demand even if solutions remain siloed.

Regulatory & technical constraints

Any product in this category has to map delegated agent authority to explicit least-privilege controls, approval evidence, and audit logs across connected tools.
Prompt injection and tool-misuse risks mean approval-only products eventually need runtime hooks or fast revocation paths.
Enterprises need traceable human ownership and documented oversight to align with evolving AI-governance expectations.
Service-account reuse and shared credentials break attribution, so identity-layer integrations are a technical requirement, not just a nice-to-have.

Agent governance market map

Section

Competition

The field is splitting into three camps: platform-native governance inside major clouds and SaaS suites, identity/NHI vendors extending into agents, and AI-security vendors focused on posture plus runtime detection. The whitespace is a workflow-native approval plane that works across all three.

Competitor	Stage	Wedge	Pricing	Strength	Weakness vs. us
Willow	seed	Agent management and access layer for transparency and control over how AI agents connect to internal systems and what they are allowed to do.	Not public	Pure-play narrative tightly aligned to the exact governance gap and already validated by a relevant design partner in Wix.	Early product and distribution maturity are unproven; the wedge still needs broader proof outside initial partners.
Zenity	scale-up	Enterprise AI security platform spanning observability, posture management, and threat protection for agents across platforms.	Custom / not public	Strong enterprise security framing plus both pre-deployment posture and runtime controls.	Broader AI-security positioning can dilute focus on change-control approvals, business ownership, and cross-app rollout workflows.
Noma Security	scale-up	End-to-end AI and agent discovery, secure-by-design boundaries, and runtime protection across models, tools, and MCP servers.	Custom / not public	Comprehensive AI-security platform story with strong funding signal and broad coverage.	Feels optimized for security breadth; the pitch is less specifically about approval packets, entitlement simulation, and workflow-native revocation for business-built agents.
Astrix	scale-up	AI-agent and non-human-identity discovery plus least-privilege deployment with short-lived credentials and audit trails.	Custom / not public	Strong identity-first approach and explicit control-plane language around secure-by-design deployment.	Identity-centric value proposition may under-serve the cross-functional approval and ownership UX that security and business teams need together.
Microsoft native stack	incumbent	Copilot Studio governance, Power Platform data policies, and Microsoft-wide agent governance guidance inside the Microsoft ecosystem.	Bundled / platform-led	Default distribution, rich telemetry, and credible native controls for Microsoft-created agents.	Does not win by default in mixed-stack environments spanning Google, Salesforce, GitHub, Jira, and third-party or custom agents.

Why incumbents do not win by default

Cloud platforms. Microsoft, Google, and Salesforce can add strong controls inside their own stacks, but mixed-stack enterprises still need one inventory and one approval layer across builders, connectors, and external SaaS actions.
Identity and NHI vendors. Identity-first players are strong at credentials, service accounts, and least privilege, but they do not automatically own the business workflow for pre-launch approval, owner assignment, and delegated-action signoff.
SaaS security vendors. Discovery-heavy SaaS security tools can expose shadow AI and app sprawl, yet they are not the default place where security and business teams jointly simulate and approve agent reach before go-live.
Generic AI security platforms. Runtime monitoring is important, but buyers still need pre-deployment guardrails and an ownership system of record before a risky agent ever reaches production data.

Section

Business plan

Employee Agent Change Control should start as a cross-platform approval plane for business-built agents inside 1,000-5,000 employee digital-product and services companies, not as another generic AI security dashboard, agent runtime, or single-vendor admin add-on. The first customer is a roughly 2,000-person services or software-heavy operator where support, revops, and internal IT teams already connect Copilot Studio, Agentforce, or Gemini-style agents into Salesforce, Zendesk, Jira, GitHub, and Google Workspace without a central review path. The buying trigger is an incident, audit finding, or enterprise rollout decision that forces the CISO or CIO to prove which employee-built agents can act in core systems and who owns them. Research supports a sizable market at an estimated $2.5B TAM, $360.0M SAM, and $6.0M modeled year-3 SOM if the company stays focused on mixed-stack enterprises rather than chasing single-platform deployments. The MVP should begin with agent inventory, entitlement simulation, owner assignment, approval packets, and a fast kill switch across the few builders and SaaS systems that dominate early deployments. The company can win if it becomes the workflow-native system of record for approved agent reach across Microsoft-, Google-, and Salesforce-heavy estates that native controls do not unify. The main risk is that platform, identity, and AI-security incumbents ship good-enough governance faster than the startup builds distribution. A second major gap is that direct standalone pricing evidence and discovery completeness behind shared service accounts are still unproven, so the first 12 months must test whether buyers will pay for a neutral approval overlay and whether inventory can be complete enough to earn trust.

Problem

Security and enterprise AI platform teams cannot see, approve, and revoke employee-built agents consistently once business teams connect them into Salesforce, Jira, GitHub, Google Workspace, Zendesk, and similar core SaaS systems.
Existing IAM, SaaS-admin, DLP, and AI-observability tools govern humans, apps, or runtime behavior in isolation, but they do not provide one cross-platform approval workflow with a named owner, scoped entitlements, and immediate revocation before go-live.

Solution

Build a control plane that inventories every internal agent, maps creator and owner, simulates reachable systems and risky actions, and generates an approval packet before high-privilege connections go live.
Start with read-mostly approval, time-bound access, and one-click revocation across the most common builders and SaaS systems, then add runtime drift and enforcement only after approval workflow adoption is proven.

Why we win

The product solves a mixed-stack problem that Microsoft, Google, and Salesforce address only inside their own ecosystems.
Approval packets, owner assignment, and fast revocation map directly to the buyer's blocked-rollout problem more tightly than broad AI-security posture or observability products do.
A growing graph of agents, scopes, owners, downstream actions, and drift events can compound into a differentiated risk and policy dataset across customers.

Strategic choices
Beachhead	English-first digital-product companies and BPO-style service operators with 1,000-5,000 employees, decentralized support or revops automation teams, and active use of Copilot Studio, Agentforce, Gemini, or similar builders across Salesforce, Zendesk, Jira, GitHub, and Google Workspace.
Wedge rationale	This wedge creates faster proof than broad enterprise AI governance because the same teams already have incidents, shadow-agent discovery, or blocked rollout reviews tied to agents that can take action in business SaaS. One approval plane, one owner record, and one kill switch answer a single urgent question that releases budget: which employee-built agents are allowed to go live right now?
Sequencing	Product, GTM, hiring, and partnerships should start with inventory, entitlement simulation, approval workflow, and revocation because that is the lowest-friction path into security-led pilots and keeps the company out of a premature runtime-security arms race. Once the startup proves that mixed-stack buyers will pay for approval first, it can add drift monitoring, runtime enforcement, and third-party agent onboarding without becoming a services-heavy integration shop.
Not yet	Single-vendor Microsoft-only, Google-only, or Salesforce-only deployments where native controls may be sufficient · Customer-facing external agents · Full runtime security and SOC-style response before approval workflow adoption is proven · Long-tail SMB accounts with low agent counts and weak centralized ownership

Go-to-market
Wedge	Sell a paid design-partner pilot that inventories one business unit's employee-built agents, simulates their reachable actions across core SaaS, and gives the buyer a named-owner approval path plus kill switch before a broader rollout proceeds.
Channels	Founder-led direct sales to CISOs, CIOs, and enterprise AI platform leaders after incidents, audit findings, or unmanaged builder rollouts · Design-partner pilots sourced through identity, NHI, and SaaS-security partners already selling discovery and least-privilege conversations · AI-governance consultancies and transformation programs that need an operational approval layer inside regulated or high-change accounts
Funnel targets	Target account→qualified discovery 15-25%, qualified discovery→paid pilot 20-30%, pilot→production 50%+, and pilot kickoff→production decision within 120 days.
Pricing	Start with a paid pilot and convert to an annual subscription priced by connected systems and governed agent identities, because buyers are paying to reduce approval delay and incident exposure across a live workflow estate rather than to buy seats. Direct standalone pricing evidence is still thin, so the first design partners should test whether the product can support an assumed $15k-$25k pilot and convert to roughly $45k-$60k annual value for the first production deployment, in line with the research model's blended ACV.

Product roadmap
MVP	The MVP should support the narrowest common builder and system bundle across early customers: Copilot Studio, Agentforce, and one Google or custom-agent path, plus Salesforce, Zendesk, Jira, GitHub, and Google Workspace. It should discover agents and owners, simulate scopes and risky write actions, produce approval packets, and let security revoke or time-limit access without promising full runtime enforcement on day one.
6 months	Ship 2-3 paid design-partner pilots with agent registry, owner assignment, entitlement simulation, approval packets, time-bound approvals, and one-click kill switch on the initial connector set.
12 months	Convert at least 2 pilots into annual production deployments, add policy templates for support, revops, and internal IT workflows, integrate more deeply with identity signals, and launch drift alerts when live behavior or ownership diverges from the approved state.
24 months	Expand from approval and revocation into a broader machine-identity control plane with runtime enforcement, third-party agent onboarding, and agent lifecycle management across more departments and mixed vendor stacks.
Key bets	Mixed-stack enterprises will pay for a neutral approval layer before they standardize on one platform vendor's governance pane. · The first builder and SaaS bundle covers most risky business-led deployments in the beachhead. · Fast approvals and pre-approved templates can reduce shadow bypass behavior rather than increase it. · Approval and drift data across customers will create a stronger moat than simple discovery or logging alone.

Business model
Revenue streams	Annual platform subscription for inventory, approval workflow, owner assignment, and revocation controls · Tiered fees tied to governed agent identities, connected systems, or approval volume · Premium modules for runtime drift monitoring, enforcement, and incident-response workflows · Limited professional services for initial policy setup and connector onboarding
Unit of value	Governed agent identities and connected systems under active approval control
Target gross margin	70%
Expansion levers	Expand from one business unit to support, revops, internal IT, and additional departments inside the same account · Add runtime drift, enforcement, and lifecycle modules after the approval system of record is established · Increase wallet share through identity, NHI, and SaaS-security partner distribution in mixed-stack enterprises

Strategy map
North-star metric	Percentage of active employee-built agents in covered workflows that have a named owner, approved scope, and revocable access in the control plane
Input metrics	Paid pilot to production conversion rate · Median time to approve or reject a new high-privilege agent · Percentage of discovered agents with complete owner and scope records · Mean time to revoke or disable a risky agent after an incident signal · Number of governed agent identities and connected systems per production customer
Moats to build	Cross-platform graph linking agent builders, owners, credentials, downstream tools, and approved actions · Dataset of approved, rejected, revoked, and drifted agent workflows by department and risk pattern · Reusable policy templates for business-led agent rollouts across mixed Microsoft, Google, Salesforce, and SaaS estates
Kill criteria	Fewer than 3 paid pilots after 30 qualified beachhead account conversations · Pilot-to-production conversion below 50% across the first 6 pilots · Less than 80% discovery completeness for the first 10 pilot environments after combining builder, admin-log, and identity evidence · More than 60% of qualified prospects choose native platform controls over a neutral approval layer after a live demo

Milestones

0–12 months

Sign 3-5 paid pilots in the beachhead segment.
Ship the initial builder and SaaS connector bundle with approval packets and one-click revocation.
Complete first-value onboarding in under 30 days for at least 2 customers.
Convert at least 2 pilots into annual production contracts.

12–24 months

Launch drift monitoring and policy templates for support, revops, and internal IT workflows.
Establish 2 partner channels that can source qualified mixed-stack opportunities.
Expand within existing customers from one business unit to multiple governed departments.
Package a procurement-ready security-review kit that shortens enterprise review cycles.

24–36 months

Reach roughly 120 production customers or equivalent ARR consistent with the modeled SOM.
Add runtime enforcement and broader agent lifecycle controls only if approval adoption and retention stay strong.
Become the system of record for approved machine workers across more departments and third-party agent sources.

Strategy map

flowchart LR
  Wedge[Mixed-stack agent approval wedge] --> MVP[Inventory plus approval packet MVP]
  MVP --> Proof[Named owners, faster approvals, and rapid revocation]
  Proof --> Expansion[Drift monitoring and broader control plane]

Founding team

Role	Start timing	Rationale
Founder CEO	Month 0	Own founder-led sales, customer discovery, pricing, and the cross-functional trust narrative with CISO, CIO, and AI-platform buyers.
Founding eng	Month 0	Build the agent graph, approval workflow, kill switch, and first connector bundle without outsourcing core product learning.
Product security lead	Month 2	Translate governance requirements into reusable policy templates, security-review artifacts, and drift-monitoring design.
Integration engineer	Month 3	Productize builder and SaaS integrations so onboarding stays inside a repeatable deployment window.
GTM lead	Month 9	Formalize pipeline generation and partner management only after the founder proves pilot conversion and buyer ownership.

Experiment roadmap

Horizon	Experiment	Hypothesis	Success metric	Owner
0–90 days	Interview 15 security, enterprise AI platform, and operations leaders about one recent uncontrolled-agent event or blocked rollout.	A named incident or rollout review creates a near-term purchase window for a cross-platform approval layer.	At least 10 target accounts describe a live approval problem and at least 6 match the beachhead workflow and stack.	Founder CEO
0–90 days	Run a concierge inventory and approval-packet exercise for two design partners using exported builder, admin, and identity data.	A single inventory plus scope simulation will surface enough hidden reach or owner gaps to justify a paid pilot.	At least 2 target accounts identify previously untracked agents or risky scopes and at least 1 signs a pilot or LOI.	Founding eng
90–180 days	Test the minimum connector bundle across 3 paid pilots.	The initial builder and SaaS bundle is sufficient to reach first value without custom integration sprawl.	At least 2 pilots complete discovery, approval packet generation, and kill-switch setup within 30 days using only the initial support matrix.	Product and eng lead
90–180 days	Pilot pricing and packaging test	A paid approval-led pilot converts better than free proofs of concept and still supports the modeled first-year ACV.	At least 3 signed pilot scopes at target pricing and no worse than 50% pilot-to-production conversion across the first 6 pilots.	Founder CEO
6–12 months	Launch drift monitoring for the first production customers.	Customers that adopt approval workflow will also pay for post-launch drift visibility tied to the approved state.	At least 2 production customers enable drift monitoring for 90 days and log actionable detections without major false-positive backlash.	Product security lead
12–18 months	Partner-sourced pipeline motion with one identity or SaaS-security partner.	Partners already in least-privilege and discovery conversations can source qualified pilots without lowering win rates.	At least 25% of qualified pipeline comes from 2 active partners and partner-sourced pilots convert at least as well as direct pilots.	GTM lead

Risk assessment

Business plan risks — 5 mapped

Impact →

High

R3 R5

R1 R2

Medium

Low

Medium

High

Likelihood →

R1Platform-native governance catches up fast enough that mixed-stack buyers delay or avoid standalone spend. · Highlikelihood / Highimpact — Differentiate on cross-platform depth, ownership workflow, and rapid revocation across Microsoft, Google, Salesforce, and third-party agent estates.
R2Discovery blind spots from service-account reuse or generic OAuth apps make the inventory feel incomplete. · Highlikelihood / Highimpact — Start with builders and systems that expose richer telemetry, require identity integrations early, and qualify out accounts where attribution is too weak for proof.
R3Business teams see the product as a new approval bottleneck and route around it with shadow automation. · Mediumlikelihood / Highimpact — Default low-risk templates, fast approval SLAs, and clear escalation paths so governance removes friction for safe launches.
R4Runtime abuse and prompt-injection concerns force customers to demand enforcement before the startup is ready. · Mediumlikelihood / Mediumimpact — Sequence drift monitoring and revocation hooks early and treat full enforcement as a gated expansion path tied to repeated customer demand.
R5Pilot pricing or conversion is too weak to support software-like margins. · Mediumlikelihood / Highimpact — Test paid pilots early, keep the integration bundle narrow, and avoid expanding GTM headcount until production ACV and onboarding time are repeatable.

Risk	Likelihood	Impact	Mitigation
Platform-native governance catches up fast enough that mixed-stack buyers delay or avoid standalone spend.	High	High	Differentiate on cross-platform depth, ownership workflow, and rapid revocation across Microsoft, Google, Salesforce, and third-party agent estates.
Discovery blind spots from service-account reuse or generic OAuth apps make the inventory feel incomplete.	High	High	Start with builders and systems that expose richer telemetry, require identity integrations early, and qualify out accounts where attribution is too weak for proof.
Business teams see the product as a new approval bottleneck and route around it with shadow automation.	Medium	High	Default low-risk templates, fast approval SLAs, and clear escalation paths so governance removes friction for safe launches.
Runtime abuse and prompt-injection concerns force customers to demand enforcement before the startup is ready.	Medium	Medium	Sequence drift monitoring and revocation hooks early and treat full enforcement as a gated expansion path tied to repeated customer demand.
Pilot pricing or conversion is too weak to support software-like margins.	Medium	High	Test paid pilots early, keep the integration bundle narrow, and avoid expanding GTM headcount until production ACV and onboarding time are repeatable.

First customer
Title	Head of Enterprise AI Platform or security architecture lead at a 2,000-person digital-services company
Profile	A mixed-stack operator where support, revops, and internal IT teams already deploy self-serve agents into Salesforce, Zendesk, Jira, GitHub, and Google Workspace without a centralized review workflow.
Trigger	An agent incident, audit finding, or large rollout decision forces leadership to document which employee-built agents can take action in core systems and who owns them.
Buyer	CISO, CIO, or Head of Enterprise AI Platform
Initial contract	Assumption: an $15k-$25k paid pilot covering one business unit and the first connector bundle, converting to roughly $45k-$60k annual subscription value for the first production deployment if approvals centralize across multiple departments.

What must be true

Mixed-stack enterprises must view cross-platform agent approval as a funded problem rather than a temporary extension of manual review.
The initial builder and SaaS connector bundle must cover most risky business-led agent deployments in the beachhead.
Discovery must be complete enough to expose shadow agents and support owner assignment without large manual cleanup projects.
Buyers must adopt approval workflow and kill switch controls before demanding full runtime enforcement in the first contract.
The first production deployment must support roughly $45k+ annual value while keeping onboarding productizable.

Open diligence questions

How often does a mixed-stack customer actually cross a vendor boundary where native controls stop being sufficient?
What percentage of risky agents in early pilots can the product discover when service accounts and generic OAuth apps are involved?
Which specific approval artifact changes the buying decision: inventory, entitlement simulation, owner record, or kill switch?
Who owns the first budget in practice once the problem moves from incident response to scaled rollout: security, identity, or enterprise AI platform?
What pilot packaging and pricing produce the best conversion without turning onboarding into consulting?

Investor verdict
Call	Meet / investigate further
Conviction	Strong wedge and buyer timing, but conviction depends on proving discovery completeness and standalone budget before native platform controls catch up.
Why believe	The company attacks a concrete mixed-stack rollout blocker with a coherent first customer, trigger, and product scope that incumbents do not naturally unify across vendor boundaries.
Why doubt	Competition is intense, pricing evidence is still thin, and the product fails if inventory remains incomplete or buyers decide platform-native controls are good enough.
Next diligence	Verify with 3-5 paid pilots that buyers fund a neutral approval layer, discover enough shadow agents to matter, and convert to annual contracts within the modeled ACV range.

Section

Financial model

3-year totals
Year 1 revenue	$135K EBITDA $-994K · Cash EOP $3.01M
Year 2 revenue	$776K EBITDA $-1.55M · Cash EOP $9.45M
Year 3 revenue	$3.31M EBITDA $-1.23M · Cash EOP $8.22M

Unit economics
ARPU (annual)	$50K
Gross margin	70%
CAC	$35K Payback 12.0 months
LTV / CAC	5.6x LTV $194K

Funding ask
Round	seed · $4.0M
Runway	24 months
Milestone	Reach 5+ production customers and $250K ARR to prove product-market fit ahead of Series A

Model sanity

Revenue engine. Annual subscriptions at $50K ARPU scale through a 60% pilot-to-production conversion funnel, compounding from 4 customers in Y1 to 110 by end Y3 as AE and partner channels augment founder-led sales from Q1Y2 onward.
Must go right. The first 3–5 paid pilots must sign by Month 8 and convert at 50%+ to sustain the Series A milestone of 18+ customers by Q3Y2; a single quarter of stalled pilot signings delays every subsequent funding and headcount milestone.
Model breaks if. Microsoft, Google, or Salesforce ships credible cross-vendor agent governance within 18 months, compressing ARPU below $35K and reducing Y3 revenue by ~$990K per the ARPU sensitivity row, which makes the Series A milestone unreachable on the current cost structure.
Next-round proof. Series A at Q3Y2 requires 18 production customers and an ARR run-rate above $850K demonstrated across both direct and at least one partner channel, consistent with the base-case cash low-point of $1,875.8K and the Y3 burn multiple of 0.29x.

Revenue, cash, and EBITDA — 12-month Y1 + 8-quarter Y2/Y3

Revenue (line, area)
Cash EOP (dashed)
EBITDA (bars, gray = loss)

Use of funds — $4.0M seed

Headcount build by role — peak18 FTE

CEO
Engineering
Product & Security
GTM & Marketing
Customer Success

Year-3 scenarios — base / downside / upside

	Y3 revenue	Y3 EBITDA	Cash low point	Description
Downside	$2.10M	-$1.90M	$900K	Platform-native controls accelerate; ARPU compresses to $38K; monthly churn rises to 2.5%; pilot conversion drops to 35%; Y3 customers fall to roughly 65.
Base	$3.31M	-$1.23M	$1.88M	Base case as modeled: $50K ARPU, 60% pilot conversion, 1.5% monthly churn, 110 production customers EOP Y3, Series A of $8M closes Q3Y2.
Upside	$5.00M	-$600K	$2.20M	Partner channels accelerate; ARPU expands to $62K via runtime add-ons; monthly churn falls to 0.8%; 145 production customers EOP Y3.

Sensitivity — Y3 cash and revenue impact, sorted by magnitude

Variable	Downside	Upside	Cash impact	Revenue impact
sales cycle	35% pilot conversion — buyers default to platform-native controls after pilot	80% pilot conversion — strong early champions and low discovery friction	-$1.00M	-$1.00M
ARPU	$35K — platform-native governance compresses blended ACV by 30%	$65K — runtime drift and lifecycle add-on modules raise ACV by 30%	-$990K	-$990K
churn	2.5% monthly — product-market fit gaps reduce net revenue retention below 100%	0.5% monthly — strong NPS and expansion drive 120%+ net revenue retention	-$500K	-$500K
gross margin	60% — services-heavy onboarding for complex multi-vendor inventories raises COGS	75% — infrastructure automation and self-serve onboarding compress COGS	-$331K	$0K
CAC	$50K — long enterprise procurement cycles and two-stage security review	$20K — partner-sourced leads skip initial discovery; faster trust transfer	-$300K	-$300K
hiring pace	2 quarters behind plan — technical sourcing bottleneck delays product and GTM	1 quarter ahead on GTM — employer brand enables faster AE and SE hiring	-$130K	-$490K

Scenarios

Scenario	Y3 revenue	Y3 EBITDA	Cash low point	Description	Key changes
Downside	$2.10M	$-1.90M	$900K	Platform-native controls accelerate; ARPU compresses to $38K; monthly churn rises to 2.5%; pilot conversion drops to 35%; Y3 customers fall to roughly 65.	ARPU drops from $50K to $38K due to platform-native price pressure Monthly churn rises from 1.5% to 2.5% Pilot-to-production conversion falls from 60% to 35%
Base	$3.31M	$-1.23M	$1.88M	Base case as modeled: $50K ARPU, 60% pilot conversion, 1.5% monthly churn, 110 production customers EOP Y3, Series A of $8M closes Q3Y2.	All assumptions as modeled per A1–A22
Upside	$5.00M	$-600K	$2.20M	Partner channels accelerate; ARPU expands to $62K via runtime add-ons; monthly churn falls to 0.8%; 145 production customers EOP Y3.	ARPU expands from $50K to $62K via drift-monitoring and lifecycle add-ons Partner channels source 40% of pipeline by Q3Y2 Monthly churn falls to 0.8% (130% net revenue retention)

Sensitivity

Variable	Downside	Base	Upside
ARPU	$35K — platform-native governance compresses blended ACV by 30%	$50K — midpoint of BP $45K–$60K ACV range (A1)	$65K — runtime drift and lifecycle add-on modules raise ACV by 30%
churn	2.5% monthly — product-market fit gaps reduce net revenue retention below 100%	1.5% monthly — enterprise annual contracts at 85% renewal rate (A11)	0.5% monthly — strong NPS and expansion drive 120%+ net revenue retention
sales cycle	35% pilot conversion — buyers default to platform-native controls after pilot	60% pilot conversion — BP target 50%+ used as mid-point (A6)	80% pilot conversion — strong early champions and low discovery friction
gross margin	60% — services-heavy onboarding for complex multi-vendor inventories raises COGS	70% — BP target gross margin with productized SaaS delivery (A3)	75% — infrastructure automation and self-serve onboarding compress COGS
CAC	$50K — long enterprise procurement cycles and two-stage security review	$35K — founder-led sales converting to AE motion with partner assist (A12)	$20K — partner-sourced leads skip initial discovery; faster trust transfer
hiring pace	2 quarters behind plan — technical sourcing bottleneck delays product and GTM	On plan per model headcount schedule (A14, A15)	1 quarter ahead on GTM — employer brand enables faster AE and SE hiring

Key assumptions (22)

ID	Name	Value	Unit	Source
A1	ARPU (annual subscription per production customer)	50.0	K USD per year	BP pricing: $45K–$60K ACV range; model uses midpoint $50K
A2	Pilot fee (one-time design-partner engagement)	20.0	K USD	BP investorMemo.firstCustomer: $15K–$25K paid pilot; model uses midpoint $20K
A3	Target gross margin	70	percent	BP businessModel.targetGrossMarginPct: 70
A4	COGS as percent of revenue	30	percent	Derived from A3; COGS includes cloud infrastructure, data-processing, and customer-success labor allocated to delivery; improves to 27–28% in Y3 per A17
A5	Pilot fee revenue recognition period	3	months	Industry heuristic: design-partner engagements billed ratably over pilot duration; 3-month pilot is standard for enterprise SaaS proof-of-value
A6	Pilot-to-production conversion rate	60	percent	BP funnelTargets: 50%+ pilot-to-production; model uses 60% as mid-point for base-case credibility
A7	Time from pilot start to production contract	5	months	BP funnelTargets: pilot kickoff to production decision within 120 days (~4 months); model uses 5 months conservatively
A8	All-in cost per FTE	18.0	K USD per month	Industry heuristic: $216K/year all-in (base salary, payroll taxes, benefits) for enterprise SaaS startup; excludes equity; consistent with market rates for seniority bands in BP.team
A9	Seed raise amount and timing	4.0	M USD	BP fundingAsk.targetFundingRangeUsd: $3–5M; model uses midpoint $4.0M closing at Month 1
A10	Series A raise amount and timing	8.0	M USD	Heuristic: milestone-driven raise at end of Q3Y2 after 18+ customers and ~$850K ARR run-rate; $8M calibrated to fund 18-month growth plan to Series B; reflected as cash inflow in Q3Y2 cashEopK
A11	Monthly churn rate	1.5	percent per month	Industry heuristic: early-stage enterprise SaaS with 12-month annual contracts; ~85% annual renewal = 15% gross annual churn = ~1.35%/month; model rounds to 1.5%
A12	Blended CAC (fully-loaded sales and marketing cost per new production customer)	35.0	K USD	Derived from Y2 model: S&M spend ~$788K / 22 new customers = $35.8K; consistent with 12-month payback heuristic for enterprise SaaS at $50K ACV
A13	Y1 headcount ramp	CEO+FoundEng M1, ProductSec M2, IntegEng M3, GTM Lead M9	roles and months	BP team section; direct mapping of stated hire timing
A14	Y2 headcount additions ending at 11 FTE	+2 Q1Y2 (AE+SE), +2 Q2Y2 (Eng), +1 Q3Y2 (CS), +1 Q4Y2 (Eng)	FTE additions by quarter	Heuristic: scaling from 5 to 11 FTE supports 26 production customers and two partner channels by end Y2; consistent with BP 12–24 month milestones
A15	Y3 headcount additions reaching 18 FTE	+2 Q1Y3, +2 Q2Y3, +2 Q3Y3, +1 Q4Y3	FTE additions by quarter	Heuristic: GTM and engineering scaling to support 110 customers; revenue/FTE ~$220K by Y3 is within $200–400K SaaS benchmark
A16	Non-payroll monthly overhead Y1	13 to 22	K USD per month	Heuristic: legal $4–5K, cloud tools $3–5K, travel/marketing $2–5K, misc $3–5K; grows from $13K (M1) to $22K (M9+) as GTM activities increase
A17	Y3 gross margin improvement	72 to 73	percent	Heuristic: infrastructure unit economics improve ~2–3 percentage points per year of SaaS operation via deployment automation; moves from 70% (Y1–Y2) to 72% (Q1–Q2 Y3) and 73% (Q3–Q4 Y3)
A18	Y1 pilot count	5	pilots	BP milestone 0–12 months: sign 3–5 paid pilots; model uses 5 (upper range) signed in M4, M6, M7, M10, M11
A19	Y1 production customers EOP	4	customers	BP milestone 0–12 months: convert at least 2 pilots into annual production contracts; model uses 4 (pilots 1–3 convert M9–M11, plus one accelerated conversion M12)
A20	Y3 production customers EOP	110	customers	BP milestone 24–36 months: roughly 120 production customers; model uses 110 (8% below target) as conservative base case
A21	Model start month	2026-07	YYYY-MM	One month after business-plan date 2026-06-05; allows seed closing and team assembly
A22	Revenue composition	subscription + pilot fees	description	Revenue = production-customer subscriptions (customersEop × $4.167K/month) plus pilot fees ($20K/pilot recognized ratably over 3 months per A5); ARPU in unitEconomics reflects annual subscription only

unit economics flow

flowchart LR
  Leads --> Pilots
  Pilots --> Production
  Production --> ARR
  ARR --> GrossProfit
  GrossProfit --> EBITDA
  EBITDA --> Cash
  Partners --> Pilots
  Churn --> Production

Flags: Series A close in Q3Y2 is load-bearing: the $8M raise is reflected as a $9,875.8K cash jump in that quarter; without it the company exhausts seed before reaching Q2Y3 · Pilot-to-production conversion assumed at 60%; if it falls below 40% for two consecutive quarters the Series A milestone is missed and the model requires a bridge or immediate burn reduction · Discovery completeness is unproven — BP operating assumption requires 80%+ agents discoverable; incomplete inventory reduces pilot confidence and threatens the $50K ACV anchor · Platform-native governance risk is rated high-likelihood and high-impact in the BP risk register; ARPU compression from $50K to $35K reduces Y3 revenue by ~$990K per sensitivity analysis

Section

Top risks

Incumbent bundling. Identity, SaaS-management, or agent-platform vendors may ship partial approval and inventory features before the startup gets distribution. Mitigation: Win on cross-platform depth, workflow-specific policy templates, and faster incident revocation across mixed agent stacks rather than single-vendor environments.
Discovery blind spots. If many employee-built agents are created outside supported builders or hide behind generic service accounts, the inventory could feel incomplete. Mitigation: Start with the few builders and SaaS systems that account for most business-led deployments, then expand discovery through identity, OAuth, and admin-log integrations.
Security friction backlash. Business teams may resist a product that feels like a new approval bottleneck and route around it with shadow automation. Mitigation: Lead with fast approvals, clear ownership, and one-click templates that let safe agents launch quickly while escalating only high-risk scopes.

Section

Evidence

Cited sources (30)

CTech. Wix CEO backs Willow as startup raises $7 million for AI agent control layer | Ctech · https://www.calcalistech.com/ctechnews/article/bj9iwj1zge
Microsoft Learn. Governance and security for AI agents across the organization - Cloud Adoption Framework | Microsoft Learn · https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/ai-agents/governance-security-across-organization
Microsoft Learn. Security and governance - Microsoft Copilot Studio | Microsoft Learn · https://learn.microsoft.com/en-us/microsoft-copilot-studio/security-and-governance
Microsoft Learn. Configure data policies for agents - Microsoft Copilot Studio | Microsoft Learn · https://learn.microsoft.com/en-us/microsoft-copilot-studio/admin-data-loss-prevention
Google Workspace Help. Explore the AI control center | Google Workspace with Gemini | Google Workspace Help · https://knowledge.workspace.google.com/admin/gemini/explore-the-ai-control-center
Google Workspace Blog. Enterprise security controls for Gemini in Google Workspace | Google Workspace Blog · https://workspace.google.com/blog/ai-and-machine-learning/enterprise-security-controls-google-workspace-gemini
Salesforce. Salesforce Agentforce Pricing | Salesforce · https://www.salesforce.com/agentforce/pricing/?bc=OTH
Salesforce. Salesforce exec on how Agentforce addresses agentic AI pain points - Salesforce · https://www.salesforce.com/news/stories/agentforce-addresses-agentic-ai-painpoints/?bc=OTH
NIST. AI Risk Management Framework | NIST · https://www.nist.gov/itl/ai-risk-management-framework
European Commission. AI Act | Shaping Europe’s digital future · https://digital-strategy.ec.europa.eu/en/policies/regulatory-framework-ai
OWASP. Prompt Injection | OWASP Foundation · https://owasp.org/www-community/attacks/PromptInjection
Model Context Protocol. Security Best Practices - Model Context Protocol · https://modelcontextprotocol.io/docs/tutorials/security/security_best_practices
Deloitte. The State of AI in the Enterprise - 2026 AI report | Deloitte US · https://www.deloitte.com/us/en/what-we-do/capabilities/applied-artificial-intelligence/content/state-of-ai-in-the-enterprise.html
KPMG. KPMG AI Quarterly Pulse Survey · https://kpmg.com/us/en/articles/2025/ai-quarterly-pulse-survey.html
Capgemini. Generative AI Report 2025 - Capgemini · https://www.capgemini.com/us-en/insights/research-library/generative-ai-in-organizations-2025/
IBM. IBM Study: Businesses View AI Agents as Essential, Not Just Experimental - Jun 10, 2025 · https://newsroom.ibm.com/2025-06-10-IBM-Study-Businesses-View-AI-Agents-as-Essential,-Not-Just-Experimental
Cloud Security Alliance. New Cloud Security Alliance Survey Reveals 82% of Enterprises | CSA · https://cloudsecurityalliance.org/press-releases/2026/04/21/new-cloud-security-alliance-survey-reveals-82-of-enterprises-have-unknown-ai-agents-in-their-environments
Cloud Security Alliance. More Than Half of Organizations Experience AI Agent Scope | CSA · https://cloudsecurityalliance.org/press-releases/2026/04/16/more-than-half-of-organizations-experience-ai-agent-scope-violations-cloud-security-alliance-study-finds
Gravitee. State of AI Agent Security 2026 Report: When Adoption Outpaces Control · https://www.gravitee.io/blog/state-of-ai-agent-security-2026-report-when-adoption-outpaces-control
Zenity. AI Security Platform for Enterprise-Grade AI Agent Governance · https://zenity.io/platform
Noma Security. Noma Security | AI Security Platform for LLMs, RAG, & AI Agents · https://noma.security/
Astrix. Identity Security for AI Agents & NHIs | Astrix Security · https://astrix.security/
Grip Security. Grip Security | Complete SaaS + AI Control · https://www.grip.security/
Okta. AI Agent Security Series: Rebuilding IAM for Autonomous Trust | Okta · https://www.okta.com/blog/ai/ai-agent-security-series/
Palo Alto Networks. Securing Your SaaS and Data in the Age of AI Agents - Palo Alto Networks Blog · https://www.paloaltonetworks.com/blog/sase/securing-your-saas-and-data-in-the-age-of-ai-agents/
Wiz. AI Agent Security Best Practices | Wiz · https://www.wiz.io/academy/ai-security/ai-agent-security
Google Cloud Blog. Google Agentspace enables the agent-driven enterprise | Google Cloud Blog · https://cloud.google.com/blog/products/ai-machine-learning/google-agentspace-enables-the-agent-driven-enterprise
Microsoft Open Source. Introducing the Agent Governance Toolkit: Open-source runtime security for AI agents | Microsoft Open Source Blog · https://opensource.microsoft.com/blog/2026/04/02/introducing-the-agent-governance-toolkit-open-source-runtime-security-for-ai-agents/
U.S. Census Bureau. Statistics of U.S. Businesses (SUSB) · https://www.census.gov/programs-surveys/susb.html
U.S. Census Bureau. 2022 SUSB Annual Data Tables by Establishment Industry · https://www.census.gov/data/tables/2022/econ/susb/2022-susb-annual.html

Why now

The idea

Jobs to be done

Market

Executive takeaways

Market definition

Customer and buyer

Buying triggers

Willingness to pay

Category dynamics

Tailwinds

Headwinds

Validation signals

Regulatory & technical constraints

Competition

Why incumbents do not win by default

Business plan

Problem

Solution

Why we win

Milestones

Founding team

Experiment roadmap

Risk assessment

What must be true

Open diligence questions

Financial model

Model sanity

Scenarios

Sensitivity

Top risks

Evidence

Cited sources (30)

Related dossiers

Policy-safe trace relay for AI vendors in customer VPCs, exporting redacted support evidence without raw-data exfiltration.

Knowledge expiry gate that quarantines stale docs before support and employee AI agents answer from them.

Control plane that shadow-tests email and CRM permissions before support agents can act on customer conversations.