BizIdea

WILLOW ai-infra Scan 2026-06-04 to 2026-06-04 Run 20260605160048

Change-control plane that approves employee-built AI agents before they can act in Salesforce, Jira, GitHub, and Google Workspace.

Companies are letting employees and department leads deploy AI agents into support, sales, product, and internal IT workflows before security teams can see what those agents are allowed to do across core SaaS systems. Once an agent can read, write, or trigger actions in tools like Salesforce, Jira, GitHub, and Google Workspace, the real risk is no longer bad text output but silent privilege sprawl, broken approvals, and incidents no one can trace to a human owner.

Overall rating 3.8 / 5.0
  1. 4
    Market

    $2.5B TAM and 33%-54% adoption growth support a large category, but five credible competitors and native stacks make it competitive.

  2. 4
    Differentiation

    Approval packets, entitlement simulation, and one-click revocation form a clear wedge versus broad AI-security suites, though parts are copyable.

  3. 3
    Execution

    Five planned hires and 5.6x LTV/CAC with 12-month payback are solid, but four model flags and a load-bearing Series A raise execution risk.

  4. 4
    Timeliness

    Fresh June 4 funding, a reported 65% incident rate, and four converging signals make the governance gap timely and concrete.

Section

Why now

  1. A reported 65% AI-agent incident rate means enterprises already have enough failures to justify a dedicated control budget.
  2. The risk surface is shifting toward how agents connect into internal systems, which creates a concrete security wedge around entitlements and actions.
  3. Because agents are already embedded in employee workflows, governance has to work for decentralized business teams rather than only a central AI lab.
  4. Seed financing for Willow shows investors believe the control layer is becoming its own category before incumbents fully absorb it.

Catalyst. Willow's financing and the cited rate of AI-agent incidents show enterprises already have agent-driven workflow risk, while agents connecting directly to internal systems make pre-deployment approval and rapid revocation newly urgent.

Section

The idea

The product plugs into major agent builders, identity providers, and SaaS admin APIs to create a live registry of every internal AI agent, who created it, which tools it can touch, and what actions it can take. Before a new agent goes live, the platform generates an approval packet that shows its connected systems, requested scopes, risky write actions, and required human owner. It can block unapproved agents, set time-bound access, and trigger a kill switch if an incident, policy violation, or ownership gap appears. Security teams get a workflow-native control plane instead of scattered OAuth logs and manual app reviews, while business teams keep the speed of self-serve agent deployment. Over time, the registry becomes the authoritative inventory and policy engine for every agent acting inside the enterprise.

What's different. Existing IAM and SaaS-security tools can show permissions by user or app, but they are not designed around agents created by employees, tied to a workflow, and capable of chaining actions across multiple systems. AI observability tools focus on prompts, models, and outputs after the fact, while this product owns the approval, ownership, and revocation loop before and during deployment. Its moat comes from a growing graph of agent identities, action scopes, risky workflow patterns, and policy templates tuned specifically to employee-built agents rather than human users.

Startup thesis
Beachhead 1,000-5,000 employee digital-product and BPO-style service companies that let support, revenue-ops, and internal IT managers deploy self-serve AI agents connected to Salesforce, Zendesk, Jira, GitHub, and Google Workspace.
Wedge An agent change-control layer that inventories every employee-built agent, simulates its reachable systems and actions, requires approval before high- privilege connections go live, and provides a one-click kill switch when a workflow drifts or misbehaves.
Non-obvious insight The new control gap is not model observability; it is machine-identity change management for employee-built agents that can quietly gain action privileges across many internal systems. As agent creation shifts from a central engineering team to workflow owners, the scarce capability becomes approving, scoping, and revoking agent entitlements at the workflow level before incidents happen.
Venture-scale path Start with approval and kill-switch workflows for employee-built internal agents, then expand into runtime policy enforcement, third-party agent onboarding, vendor-risk reviews, agent identity lifecycle management, and a cross-enterprise system of record for every machine worker touching business software.
Target user
Primary user Security and enterprise AI platform teams at 1,000-5,000 employee digital-product and services companies rolling out self-serve AI agents across support, revenue operations, and internal IT.
Secondary user Department operations leaders whose teams build or deploy agents connected to core SaaS systems.
Economic buyer CISO, CIO, or Head of Enterprise AI Platform.
Go-to-market seed
First customer A 2,000-person digital-services company where support and revops managers already use Microsoft Copilot Studio or similar agent builders to connect agents into Salesforce, Zendesk, Jira, and Google Workspace without a centralized review workflow.
Buying trigger An AI-agent incident, audit finding, or broader rollout decision that forces the CISO or CIO to prove which employee-built agents can act in core business systems.
Current alternative SaaS admin consoles, manual OAuth reviews, SSO logs, ticket-based approvals, and generic DLP or CASB policies.
Switching reason The first customer switches because this wedge gives them one inventory, approval path, and kill switch for employee-built agents across many apps, which existing identity and SaaS-admin tools do not provide.
Pricing hypothesis Annual subscription priced by connected systems and active governed agent identities, with premium incident-response and runtime-enforcement modules.

Jobs to be done

Job Current alternative Success metric
When a business team launches a new internal AI agent, help security prove what systems it can read from or write to, so we can approve it without slowing every automation project. Manual ticket reviews and app-by-app OAuth checks. Time to approve or reject a new internal agent falls from weeks to less than three days.
When an AI-agent incident or ownership gap appears, help IT revoke risky agent access immediately, so we can contain damage before it spreads across core SaaS workflows. Searching admin consoles, SSO logs, and workflow docs to find and disable the right integrations. Mean time to identify owner and disable a risky agent drops to under 15 minutes.
Employee agent control loop
flowchart LR
  Buyer[CISO or Head of Enterprise AI] --> Pain[Unapproved agents acting in core SaaS systems]
  Pain --> Product[Employee agent change-control plane]
  Product --> Outcome[Safer self-serve agent rollout with instant revocation]
Idea scorecard — average4.6 / 5 · 5axes
Signal4/5Pain5/5Wedge5/5Defense4/5Scale5/5
  • Signal · 4/5The control gap is concrete and tied to a fresh funding event plus an incident statistic, but the cluster still rests on one verified source.
  • Pain · 5/5Uncontrolled agents with live access to internal systems can create immediate security, compliance, and operational incidents across multiple teams.
  • Wedge · 5/5Approval, inventory, and kill-switch workflows for employee-built agents form a narrow first product with a clear buyer and trigger.
  • Defense · 4/5The agent identity graph, risky-action policy templates, and cross-system incident patterns can compound into a durable control-plane dataset.
  • Scale · 5/5As every large company accumulates internal and third-party agents, the system of record for approving and revoking machine workers can become foundational infrastructure.
Business model canvas
Key partners
  • Identity providers and SaaS-management vendors
  • Agent-builder platforms
  • Security consultancies leading AI rollout programs
Key activities
  • Discovering and classifying employee-built agents
  • Simulating scopes and enforcing approval workflows
  • Monitoring ownership drift and triggering revocation
Key resources
  • Agent identity and entitlement graph
  • Integrations with agent builders, IdPs, and SaaS admin APIs
  • Policy templates for risky agent actions and system combinations
Value propositions
  • Inventory every internal AI agent and its connected systems in one control plane
  • Approve or block risky agent connections before they go live
  • Provide a kill switch and ownership trail for agent incidents
Customer relationships
  • High-touch onboarding around agent discovery and policy setup
  • Security reviews tied to each new agent deployment
  • Expansion into more departments, systems, and third-party agents
Channels
  • Direct sales to CISO, CIO, and enterprise AI leaders
  • Design-partner pilots with companies already running self-serve agent builders
  • Partnerships with identity, SaaS-management, and agent-platform vendors
Customer segments
  • Digital-product companies deploying self-serve internal AI agents
  • BPO and service operators with many workflow-specific agent builders
  • Enterprise AI platform and security teams governing business-led automation
Cost structure
  • Integration engineering
  • Policy engine and control-plane infrastructure
  • Enterprise sales and security-focused customer success
Revenue streams
  • Annual software subscription
  • Usage fees for governed agent identities or approval workflows
  • Premium incident-response and runtime-enforcement modules
Section

Market

Market sizing
TAMSAMSOM TAM · Total addressable $2.5B SAM · Serviceable available $360.0M SOM · Serviceable obtainable $6.0M
Market sizing overview
TAM $2.5B Estimate: ~50,000 qualifying firms globally x roughly $50k blended annual contract value for cross-platform agent governance, anchored by U.S. firm-count references and cross-checked against current enterprise agent-platform spending patterns.
SAM $360.0M Estimate: ~8,000 reachable English-first digital-product and services firms in the beachhead x roughly $45k ACV after narrowing to organizations actively operationalizing agents and decentralized builders.
SOM $6.0M Estimate: 120 customers by year three x roughly $50k ACV, assuming initial win rates through direct sales plus identity and SaaS-security partner channels.

Executive takeaways

  • The real wedge is not generic AI security but cross-platform change control for employee-built agents that can act inside business SaaS.
  • Native platform governance is arriving fast, but it remains vendor-siloed; mixed-stack enterprises still lack one approval path, one owner record, and one rapid revocation workflow across agent builders.
  • Buyer urgency is being pulled forward by actual incident rates, shadow-agent discovery, and governance immaturity rather than by abstract future regulation alone.
  • The most defensible product layer combines entitlement simulation, ownership assignment, and rapid revocation with an inventory graph of agents, tools, and delegated actions.
  • Go-to-market should start where business teams already ship agents without central review: support, revops, and internal IT inside Microsoft-, Google-, and Salesforce-heavy service firms.

Market definition

Cross-platform governance software for employee-built and business-deployed AI agents that can read, write, or trigger actions in enterprise SaaS systems.

Customer and buyer

Primary user is the enterprise AI platform or security team that must approve and trace agent access; economic buyer is typically the CISO, CIO, or head of enterprise AI platform.

Buying triggers

  • A security incident, scope violation, or shadow-agent discovery creates immediate demand for one owner registry and one rapid revocation workflow. [17][18][19]
  • A broader rollout of Copilot Studio, Agentforce, Gemini, or similar builders forces central teams to replace ticket reviews with explicit approval controls. [2][3][5][6][7]
  • Compliance programs become urgent when enterprises realize governance maturity trails adoption and upcoming oversight expectations. [9][10][13][14][15]

Willingness to pay

Enterprises are already paying for agent capacity on usage and per-user models, while incident and visibility gaps create a separate security budget justification. A control layer can price against avoided review labor and faster incident containment, not just against raw model spend. [7][13][17][18][19]

Category dynamics

Growth signal 33% to 54% operational AI-agent adoption over roughly two years in KPMG’s panel

Tailwinds

  • Adoption is moving from pilots to day-to-day operations, which expands the surface area that needs governance rather than ad hoc review.
  • Unknown agents, scope violations, and incidents are already common enough to justify dedicated controls.
  • Large platforms are training enterprises to expect centralized admin controls and logs for AI features, which normalizes budget for governance layers.

Headwinds

  • Native controls may be good enough for single-platform deployments, especially early in rollout.
  • Identity and SaaS-security vendors can extend into agent governance with adjacent data and relationships.
  • Discovery remains hard when agent ownership is unclear and credentials are shared.

Validation signals

  • Willow’s funding and Wix design-partner endorsement validate that enterprises already perceive agent governance as a distinct problem.
  • CSA’s surveys show unknown agents, scope violations, and incidents are already common, which supports an urgent control-plane wedge.
  • Gravitee’s report suggests security approval coverage is still far behind production deployment, leaving a clear operational gap.
  • Major platforms are racing to add their own governance panes and policy surfaces, confirming buyer demand even if solutions remain siloed.

Regulatory & technical constraints

  • Any product in this category has to map delegated agent authority to explicit least-privilege controls, approval evidence, and audit logs across connected tools.
  • Prompt injection and tool-misuse risks mean approval-only products eventually need runtime hooks or fast revocation paths.
  • Enterprises need traceable human ownership and documented oversight to align with evolving AI-governance expectations.
  • Service-account reuse and shared credentials break attribution, so identity-layer integrations are a technical requirement, not just a nice-to-have.
Agent governance market map
← Low specialization High specialization → ← Low urgency High urgency → Q2 Q1 · winning zone Q3 Q4 Proposed startup Microsoft native controls Zenity Astrix Noma Security
Section

Competition

The field is splitting into three camps: platform-native governance inside major clouds and SaaS suites, identity/NHI vendors extending into agents, and AI-security vendors focused on posture plus runtime detection. The whitespace is a workflow-native approval plane that works across all three.

Competitor Stage Wedge Pricing Strength Weakness vs. us
Willow seed Agent management and access layer for transparency and control over how AI agents connect to internal systems and what they are allowed to do. Not public Pure-play narrative tightly aligned to the exact governance gap and already validated by a relevant design partner in Wix. Early product and distribution maturity are unproven; the wedge still needs broader proof outside initial partners.
Zenity scale-up Enterprise AI security platform spanning observability, posture management, and threat protection for agents across platforms. Custom / not public Strong enterprise security framing plus both pre-deployment posture and runtime controls. Broader AI-security positioning can dilute focus on change-control approvals, business ownership, and cross-app rollout workflows.
Noma Security scale-up End-to-end AI and agent discovery, secure-by-design boundaries, and runtime protection across models, tools, and MCP servers. Custom / not public Comprehensive AI-security platform story with strong funding signal and broad coverage. Feels optimized for security breadth; the pitch is less specifically about approval packets, entitlement simulation, and workflow-native revocation for business-built agents.
Astrix scale-up AI-agent and non-human-identity discovery plus least-privilege deployment with short-lived credentials and audit trails. Custom / not public Strong identity-first approach and explicit control-plane language around secure-by-design deployment. Identity-centric value proposition may under-serve the cross-functional approval and ownership UX that security and business teams need together.
Microsoft native stack incumbent Copilot Studio governance, Power Platform data policies, and Microsoft-wide agent governance guidance inside the Microsoft ecosystem. Bundled / platform-led Default distribution, rich telemetry, and credible native controls for Microsoft-created agents. Does not win by default in mixed-stack environments spanning Google, Salesforce, GitHub, Jira, and third-party or custom agents.

Why incumbents do not win by default

  • Cloud platforms. Microsoft, Google, and Salesforce can add strong controls inside their own stacks, but mixed-stack enterprises still need one inventory and one approval layer across builders, connectors, and external SaaS actions.
  • Identity and NHI vendors. Identity-first players are strong at credentials, service accounts, and least privilege, but they do not automatically own the business workflow for pre-launch approval, owner assignment, and delegated-action signoff.
  • SaaS security vendors. Discovery-heavy SaaS security tools can expose shadow AI and app sprawl, yet they are not the default place where security and business teams jointly simulate and approve agent reach before go-live.
  • Generic AI security platforms. Runtime monitoring is important, but buyers still need pre-deployment guardrails and an ownership system of record before a risky agent ever reaches production data.
Section

Business plan

Employee Agent Change Control should start as a cross-platform approval plane for business-built agents inside 1,000-5,000 employee digital-product and services companies, not as another generic AI security dashboard, agent runtime, or single-vendor admin add-on. The first customer is a roughly 2,000-person services or software-heavy operator where support, revops, and internal IT teams already connect Copilot Studio, Agentforce, or Gemini-style agents into Salesforce, Zendesk, Jira, GitHub, and Google Workspace without a central review path. The buying trigger is an incident, audit finding, or enterprise rollout decision that forces the CISO or CIO to prove which employee-built agents can act in core systems and who owns them. Research supports a sizable market at an estimated $2.5B TAM, $360.0M SAM, and $6.0M modeled year-3 SOM if the company stays focused on mixed-stack enterprises rather than chasing single-platform deployments. The MVP should begin with agent inventory, entitlement simulation, owner assignment, approval packets, and a fast kill switch across the few builders and SaaS systems that dominate early deployments. The company can win if it becomes the workflow-native system of record for approved agent reach across Microsoft-, Google-, and Salesforce-heavy estates that native controls do not unify. The main risk is that platform, identity, and AI-security incumbents ship good-enough governance faster than the startup builds distribution. A second major gap is that direct standalone pricing evidence and discovery completeness behind shared service accounts are still unproven, so the first 12 months must test whether buyers will pay for a neutral approval overlay and whether inventory can be complete enough to earn trust.

Problem

  • Security and enterprise AI platform teams cannot see, approve, and revoke employee-built agents consistently once business teams connect them into Salesforce, Jira, GitHub, Google Workspace, Zendesk, and similar core SaaS systems.
  • Existing IAM, SaaS-admin, DLP, and AI-observability tools govern humans, apps, or runtime behavior in isolation, but they do not provide one cross-platform approval workflow with a named owner, scoped entitlements, and immediate revocation before go-live.

Solution

  • Build a control plane that inventories every internal agent, maps creator and owner, simulates reachable systems and risky actions, and generates an approval packet before high-privilege connections go live.
  • Start with read-mostly approval, time-bound access, and one-click revocation across the most common builders and SaaS systems, then add runtime drift and enforcement only after approval workflow adoption is proven.

Why we win

  • The product solves a mixed-stack problem that Microsoft, Google, and Salesforce address only inside their own ecosystems.
  • Approval packets, owner assignment, and fast revocation map directly to the buyer's blocked-rollout problem more tightly than broad AI-security posture or observability products do.
  • A growing graph of agents, scopes, owners, downstream actions, and drift events can compound into a differentiated risk and policy dataset across customers.
Strategic choices
Beachhead English-first digital-product companies and BPO-style service operators with 1,000-5,000 employees, decentralized support or revops automation teams, and active use of Copilot Studio, Agentforce, Gemini, or similar builders across Salesforce, Zendesk, Jira, GitHub, and Google Workspace.
Wedge rationale This wedge creates faster proof than broad enterprise AI governance because the same teams already have incidents, shadow-agent discovery, or blocked rollout reviews tied to agents that can take action in business SaaS. One approval plane, one owner record, and one kill switch answer a single urgent question that releases budget: which employee-built agents are allowed to go live right now?
Sequencing Product, GTM, hiring, and partnerships should start with inventory, entitlement simulation, approval workflow, and revocation because that is the lowest-friction path into security-led pilots and keeps the company out of a premature runtime-security arms race. Once the startup proves that mixed-stack buyers will pay for approval first, it can add drift monitoring, runtime enforcement, and third-party agent onboarding without becoming a services-heavy integration shop.
Not yet Single-vendor Microsoft-only, Google-only, or Salesforce-only deployments where native controls may be sufficient · Customer-facing external agents · Full runtime security and SOC-style response before approval workflow adoption is proven · Long-tail SMB accounts with low agent counts and weak centralized ownership
Go-to-market
Wedge Sell a paid design-partner pilot that inventories one business unit's employee-built agents, simulates their reachable actions across core SaaS, and gives the buyer a named-owner approval path plus kill switch before a broader rollout proceeds.
Channels Founder-led direct sales to CISOs, CIOs, and enterprise AI platform leaders after incidents, audit findings, or unmanaged builder rollouts · Design-partner pilots sourced through identity, NHI, and SaaS-security partners already selling discovery and least-privilege conversations · AI-governance consultancies and transformation programs that need an operational approval layer inside regulated or high-change accounts
Funnel targets Target account→qualified discovery 15-25%, qualified discovery→paid pilot 20-30%, pilot→production 50%+, and pilot kickoff→production decision within 120 days.
Pricing Start with a paid pilot and convert to an annual subscription priced by connected systems and governed agent identities, because buyers are paying to reduce approval delay and incident exposure across a live workflow estate rather than to buy seats. Direct standalone pricing evidence is still thin, so the first design partners should test whether the product can support an assumed $15k-$25k pilot and convert to roughly $45k-$60k annual value for the first production deployment, in line with the research model's blended ACV.
Product roadmap
MVP The MVP should support the narrowest common builder and system bundle across early customers: Copilot Studio, Agentforce, and one Google or custom-agent path, plus Salesforce, Zendesk, Jira, GitHub, and Google Workspace. It should discover agents and owners, simulate scopes and risky write actions, produce approval packets, and let security revoke or time-limit access without promising full runtime enforcement on day one.
6 months Ship 2-3 paid design-partner pilots with agent registry, owner assignment, entitlement simulation, approval packets, time-bound approvals, and one-click kill switch on the initial connector set.
12 months Convert at least 2 pilots into annual production deployments, add policy templates for support, revops, and internal IT workflows, integrate more deeply with identity signals, and launch drift alerts when live behavior or ownership diverges from the approved state.
24 months Expand from approval and revocation into a broader machine-identity control plane with runtime enforcement, third-party agent onboarding, and agent lifecycle management across more departments and mixed vendor stacks.
Key bets Mixed-stack enterprises will pay for a neutral approval layer before they standardize on one platform vendor's governance pane. · The first builder and SaaS bundle covers most risky business-led deployments in the beachhead. · Fast approvals and pre-approved templates can reduce shadow bypass behavior rather than increase it. · Approval and drift data across customers will create a stronger moat than simple discovery or logging alone.
Business model
Revenue streams Annual platform subscription for inventory, approval workflow, owner assignment, and revocation controls · Tiered fees tied to governed agent identities, connected systems, or approval volume · Premium modules for runtime drift monitoring, enforcement, and incident-response workflows · Limited professional services for initial policy setup and connector onboarding
Unit of value Governed agent identities and connected systems under active approval control
Target gross margin 70%
Expansion levers Expand from one business unit to support, revops, internal IT, and additional departments inside the same account · Add runtime drift, enforcement, and lifecycle modules after the approval system of record is established · Increase wallet share through identity, NHI, and SaaS-security partner distribution in mixed-stack enterprises
Strategy map
North-star metric Percentage of active employee-built agents in covered workflows that have a named owner, approved scope, and revocable access in the control plane
Input metrics Paid pilot to production conversion rate · Median time to approve or reject a new high-privilege agent · Percentage of discovered agents with complete owner and scope records · Mean time to revoke or disable a risky agent after an incident signal · Number of governed agent identities and connected systems per production customer
Moats to build Cross-platform graph linking agent builders, owners, credentials, downstream tools, and approved actions · Dataset of approved, rejected, revoked, and drifted agent workflows by department and risk pattern · Reusable policy templates for business-led agent rollouts across mixed Microsoft, Google, Salesforce, and SaaS estates
Kill criteria Fewer than 3 paid pilots after 30 qualified beachhead account conversations · Pilot-to-production conversion below 50% across the first 6 pilots · Less than 80% discovery completeness for the first 10 pilot environments after combining builder, admin-log, and identity evidence · More than 60% of qualified prospects choose native platform controls over a neutral approval layer after a live demo

Milestones

0–12 months
  • Sign 3-5 paid pilots in the beachhead segment.
  • Ship the initial builder and SaaS connector bundle with approval packets and one-click revocation.
  • Complete first-value onboarding in under 30 days for at least 2 customers.
  • Convert at least 2 pilots into annual production contracts.
12–24 months
  • Launch drift monitoring and policy templates for support, revops, and internal IT workflows.
  • Establish 2 partner channels that can source qualified mixed-stack opportunities.
  • Expand within existing customers from one business unit to multiple governed departments.
  • Package a procurement-ready security-review kit that shortens enterprise review cycles.
24–36 months
  • Reach roughly 120 production customers or equivalent ARR consistent with the modeled SOM.
  • Add runtime enforcement and broader agent lifecycle controls only if approval adoption and retention stay strong.
  • Become the system of record for approved machine workers across more departments and third-party agent sources.
Strategy map
flowchart LR
  Wedge[Mixed-stack agent approval wedge] --> MVP[Inventory plus approval packet MVP]
  MVP --> Proof[Named owners, faster approvals, and rapid revocation]
  Proof --> Expansion[Drift monitoring and broader control plane]

Founding team

Role Start timing Rationale
Founder CEO Month 0 Own founder-led sales, customer discovery, pricing, and the cross-functional trust narrative with CISO, CIO, and AI-platform buyers.
Founding eng Month 0 Build the agent graph, approval workflow, kill switch, and first connector bundle without outsourcing core product learning.
Product security lead Month 2 Translate governance requirements into reusable policy templates, security-review artifacts, and drift-monitoring design.
Integration engineer Month 3 Productize builder and SaaS integrations so onboarding stays inside a repeatable deployment window.
GTM lead Month 9 Formalize pipeline generation and partner management only after the founder proves pilot conversion and buyer ownership.

Experiment roadmap

Horizon Experiment Hypothesis Success metric Owner
0–90 days Interview 15 security, enterprise AI platform, and operations leaders about one recent uncontrolled-agent event or blocked rollout. A named incident or rollout review creates a near-term purchase window for a cross-platform approval layer. At least 10 target accounts describe a live approval problem and at least 6 match the beachhead workflow and stack. Founder CEO
0–90 days Run a concierge inventory and approval-packet exercise for two design partners using exported builder, admin, and identity data. A single inventory plus scope simulation will surface enough hidden reach or owner gaps to justify a paid pilot. At least 2 target accounts identify previously untracked agents or risky scopes and at least 1 signs a pilot or LOI. Founding eng
90–180 days Test the minimum connector bundle across 3 paid pilots. The initial builder and SaaS bundle is sufficient to reach first value without custom integration sprawl. At least 2 pilots complete discovery, approval packet generation, and kill-switch setup within 30 days using only the initial support matrix. Product and eng lead
90–180 days Pilot pricing and packaging test A paid approval-led pilot converts better than free proofs of concept and still supports the modeled first-year ACV. At least 3 signed pilot scopes at target pricing and no worse than 50% pilot-to-production conversion across the first 6 pilots. Founder CEO
6–12 months Launch drift monitoring for the first production customers. Customers that adopt approval workflow will also pay for post-launch drift visibility tied to the approved state. At least 2 production customers enable drift monitoring for 90 days and log actionable detections without major false-positive backlash. Product security lead
12–18 months Partner-sourced pipeline motion with one identity or SaaS-security partner. Partners already in least-privilege and discovery conversations can source qualified pilots without lowering win rates. At least 25% of qualified pipeline comes from 2 active partners and partner-sourced pilots convert at least as well as direct pilots. GTM lead

Risk assessment

Business plan risks — 5 mapped
Impact →
High
R3 R5
R1 R2
Medium
R4
Low
Low
Medium
High
Likelihood →
  1. R1Platform-native governance catches up fast enough that mixed-stack buyers delay or avoid standalone spend. · Highlikelihood / Highimpact — Differentiate on cross-platform depth, ownership workflow, and rapid revocation across Microsoft, Google, Salesforce, and third-party agent estates.
  2. R2Discovery blind spots from service-account reuse or generic OAuth apps make the inventory feel incomplete. · Highlikelihood / Highimpact — Start with builders and systems that expose richer telemetry, require identity integrations early, and qualify out accounts where attribution is too weak for proof.
  3. R3Business teams see the product as a new approval bottleneck and route around it with shadow automation. · Mediumlikelihood / Highimpact — Default low-risk templates, fast approval SLAs, and clear escalation paths so governance removes friction for safe launches.
  4. R4Runtime abuse and prompt-injection concerns force customers to demand enforcement before the startup is ready. · Mediumlikelihood / Mediumimpact — Sequence drift monitoring and revocation hooks early and treat full enforcement as a gated expansion path tied to repeated customer demand.
  5. R5Pilot pricing or conversion is too weak to support software-like margins. · Mediumlikelihood / Highimpact — Test paid pilots early, keep the integration bundle narrow, and avoid expanding GTM headcount until production ACV and onboarding time are repeatable.
Risk Likelihood Impact Mitigation
Platform-native governance catches up fast enough that mixed-stack buyers delay or avoid standalone spend. High High Differentiate on cross-platform depth, ownership workflow, and rapid revocation across Microsoft, Google, Salesforce, and third-party agent estates.
Discovery blind spots from service-account reuse or generic OAuth apps make the inventory feel incomplete. High High Start with builders and systems that expose richer telemetry, require identity integrations early, and qualify out accounts where attribution is too weak for proof.
Business teams see the product as a new approval bottleneck and route around it with shadow automation. Medium High Default low-risk templates, fast approval SLAs, and clear escalation paths so governance removes friction for safe launches.
Runtime abuse and prompt-injection concerns force customers to demand enforcement before the startup is ready. Medium Medium Sequence drift monitoring and revocation hooks early and treat full enforcement as a gated expansion path tied to repeated customer demand.
Pilot pricing or conversion is too weak to support software-like margins. Medium High Test paid pilots early, keep the integration bundle narrow, and avoid expanding GTM headcount until production ACV and onboarding time are repeatable.
First customer
Title Head of Enterprise AI Platform or security architecture lead at a 2,000-person digital-services company
Profile A mixed-stack operator where support, revops, and internal IT teams already deploy self-serve agents into Salesforce, Zendesk, Jira, GitHub, and Google Workspace without a centralized review workflow.
Trigger An agent incident, audit finding, or large rollout decision forces leadership to document which employee-built agents can take action in core systems and who owns them.
Buyer CISO, CIO, or Head of Enterprise AI Platform
Initial contract Assumption: an $15k-$25k paid pilot covering one business unit and the first connector bundle, converting to roughly $45k-$60k annual subscription value for the first production deployment if approvals centralize across multiple departments.

What must be true

  • Mixed-stack enterprises must view cross-platform agent approval as a funded problem rather than a temporary extension of manual review.
  • The initial builder and SaaS connector bundle must cover most risky business-led agent deployments in the beachhead.
  • Discovery must be complete enough to expose shadow agents and support owner assignment without large manual cleanup projects.
  • Buyers must adopt approval workflow and kill switch controls before demanding full runtime enforcement in the first contract.
  • The first production deployment must support roughly $45k+ annual value while keeping onboarding productizable.

Open diligence questions

  • How often does a mixed-stack customer actually cross a vendor boundary where native controls stop being sufficient?
  • What percentage of risky agents in early pilots can the product discover when service accounts and generic OAuth apps are involved?
  • Which specific approval artifact changes the buying decision: inventory, entitlement simulation, owner record, or kill switch?
  • Who owns the first budget in practice once the problem moves from incident response to scaled rollout: security, identity, or enterprise AI platform?
  • What pilot packaging and pricing produce the best conversion without turning onboarding into consulting?
Investor verdict
Call Meet / investigate further
Conviction Strong wedge and buyer timing, but conviction depends on proving discovery completeness and standalone budget before native platform controls catch up.
Why believe The company attacks a concrete mixed-stack rollout blocker with a coherent first customer, trigger, and product scope that incumbents do not naturally unify across vendor boundaries.
Why doubt Competition is intense, pricing evidence is still thin, and the product fails if inventory remains incomplete or buyers decide platform-native controls are good enough.
Next diligence Verify with 3-5 paid pilots that buyers fund a neutral approval layer, discover enough shadow agents to matter, and convert to annual contracts within the modeled ACV range.
Section

Financial model

3-year totals
Year 1 revenue $135K EBITDA $-994K · Cash EOP $3.01M
Year 2 revenue $776K EBITDA $-1.55M · Cash EOP $9.45M
Year 3 revenue $3.31M EBITDA $-1.23M · Cash EOP $8.22M
Unit economics
ARPU (annual) $50K
Gross margin 70%
CAC $35K Payback 12.0 months
LTV / CAC 5.6x LTV $194K
Funding ask
Round seed · $4.0M
Runway 24 months
Milestone Reach 5+ production customers and $250K ARR to prove product-market fit ahead of Series A

Model sanity

  • Revenue engine. Annual subscriptions at $50K ARPU scale through a 60% pilot-to-production conversion funnel, compounding from 4 customers in Y1 to 110 by end Y3 as AE and partner channels augment founder-led sales from Q1Y2 onward.
  • Must go right. The first 3–5 paid pilots must sign by Month 8 and convert at 50%+ to sustain the Series A milestone of 18+ customers by Q3Y2; a single quarter of stalled pilot signings delays every subsequent funding and headcount milestone.
  • Model breaks if. Microsoft, Google, or Salesforce ships credible cross-vendor agent governance within 18 months, compressing ARPU below $35K and reducing Y3 revenue by ~$990K per the ARPU sensitivity row, which makes the Series A milestone unreachable on the current cost structure.
  • Next-round proof. Series A at Q3Y2 requires 18 production customers and an ARR run-rate above $850K demonstrated across both direct and at least one partner channel, consistent with the base-case cash low-point of $1,875.8K and the Y3 burn multiple of 0.29x.
Revenue, cash, and EBITDA — 12-month Y1 + 8-quarter Y2/Y3
$0K$2.00M$4.00M$6.00M$8.00M$10.00MM1M4M7M10Q1Y2Q4Y2Q3Y3Q4Y3
  • Revenue (line, area)
  • Cash EOP (dashed)
  • EBITDA (bars, gray = loss)
Use of funds — $4.0M seed
Engineering · 43% GTM · 16% G&A · 15% Buffer (6 mo) · 26%
Headcount build by role — peak18 FTE
Q1Y14Q2Y14Q3Y15Q4Y15Q1Y25Q2Y25Q3Y25Q4Y211Q1Y311Q2Y311Q3Y311Q4Y318
  • CEO
  • Engineering
  • Product & Security
  • GTM & Marketing
  • Customer Success
Year-3 scenarios — base / downside / upside
Y3 revenueY3 EBITDACash low pointDescription
Downside$2.10M-$1.90M$900KPlatform-native controls accelerate; ARPU compresses to $38K; monthly churn rises to 2.5%; pilot conversion drops to 35%; Y3 customers fall to roughly 65.
Base$3.31M-$1.23M$1.88MBase case as modeled: $50K ARPU, 60% pilot conversion, 1.5% monthly churn, 110 production customers EOP Y3, Series A of $8M closes Q3Y2.
Upside$5.00M-$600K$2.20MPartner channels accelerate; ARPU expands to $62K via runtime add-ons; monthly churn falls to 0.8%; 145 production customers EOP Y3.
Sensitivity — Y3 cash and revenue impact, sorted by magnitude
VariableDownsideUpsideCash impactRevenue impact
sales cycle35% pilot conversion — buyers default to platform-native controls after pilot80% pilot conversion — strong early champions and low discovery friction-$1.00M-$1.00M
ARPU$35K — platform-native governance compresses blended ACV by 30%$65K — runtime drift and lifecycle add-on modules raise ACV by 30%-$990K-$990K
churn2.5% monthly — product-market fit gaps reduce net revenue retention below 100%0.5% monthly — strong NPS and expansion drive 120%+ net revenue retention-$500K-$500K
gross margin60% — services-heavy onboarding for complex multi-vendor inventories raises COGS75% — infrastructure automation and self-serve onboarding compress COGS-$331K$0K
CAC$50K — long enterprise procurement cycles and two-stage security review$20K — partner-sourced leads skip initial discovery; faster trust transfer-$300K-$300K
hiring pace2 quarters behind plan — technical sourcing bottleneck delays product and GTM1 quarter ahead on GTM — employer brand enables faster AE and SE hiring-$130K-$490K

Scenarios

Scenario Y3 revenue Y3 EBITDA Cash low point Description Key changes
Downside $2.10M $-1.90M $900K Platform-native controls accelerate; ARPU compresses to $38K; monthly churn rises to 2.5%; pilot conversion drops to 35%; Y3 customers fall to roughly 65.
  • ARPU drops from $50K to $38K due to platform-native price pressure
  • Monthly churn rises from 1.5% to 2.5%
  • Pilot-to-production conversion falls from 60% to 35%
Base $3.31M $-1.23M $1.88M Base case as modeled: $50K ARPU, 60% pilot conversion, 1.5% monthly churn, 110 production customers EOP Y3, Series A of $8M closes Q3Y2.
  • All assumptions as modeled per A1–A22
Upside $5.00M $-600K $2.20M Partner channels accelerate; ARPU expands to $62K via runtime add-ons; monthly churn falls to 0.8%; 145 production customers EOP Y3.
  • ARPU expands from $50K to $62K via drift-monitoring and lifecycle add-ons
  • Partner channels source 40% of pipeline by Q3Y2
  • Monthly churn falls to 0.8% (130% net revenue retention)

Sensitivity

Variable Downside Base Upside
ARPU $35K — platform-native governance compresses blended ACV by 30% $50K — midpoint of BP $45K–$60K ACV range (A1) $65K — runtime drift and lifecycle add-on modules raise ACV by 30%
churn 2.5% monthly — product-market fit gaps reduce net revenue retention below 100% 1.5% monthly — enterprise annual contracts at 85% renewal rate (A11) 0.5% monthly — strong NPS and expansion drive 120%+ net revenue retention
sales cycle 35% pilot conversion — buyers default to platform-native controls after pilot 60% pilot conversion — BP target 50%+ used as mid-point (A6) 80% pilot conversion — strong early champions and low discovery friction
gross margin 60% — services-heavy onboarding for complex multi-vendor inventories raises COGS 70% — BP target gross margin with productized SaaS delivery (A3) 75% — infrastructure automation and self-serve onboarding compress COGS
CAC $50K — long enterprise procurement cycles and two-stage security review $35K — founder-led sales converting to AE motion with partner assist (A12) $20K — partner-sourced leads skip initial discovery; faster trust transfer
hiring pace 2 quarters behind plan — technical sourcing bottleneck delays product and GTM On plan per model headcount schedule (A14, A15) 1 quarter ahead on GTM — employer brand enables faster AE and SE hiring
Key assumptions (22)
ID Name Value Unit Source
A1 ARPU (annual subscription per production customer) 50.0 K USD per year BP pricing: $45K–$60K ACV range; model uses midpoint $50K
A2 Pilot fee (one-time design-partner engagement) 20.0 K USD BP investorMemo.firstCustomer: $15K–$25K paid pilot; model uses midpoint $20K
A3 Target gross margin 70 percent BP businessModel.targetGrossMarginPct: 70
A4 COGS as percent of revenue 30 percent Derived from A3; COGS includes cloud infrastructure, data-processing, and customer-success labor allocated to delivery; improves to 27–28% in Y3 per A17
A5 Pilot fee revenue recognition period 3 months Industry heuristic: design-partner engagements billed ratably over pilot duration; 3-month pilot is standard for enterprise SaaS proof-of-value
A6 Pilot-to-production conversion rate 60 percent BP funnelTargets: 50%+ pilot-to-production; model uses 60% as mid-point for base-case credibility
A7 Time from pilot start to production contract 5 months BP funnelTargets: pilot kickoff to production decision within 120 days (~4 months); model uses 5 months conservatively
A8 All-in cost per FTE 18.0 K USD per month Industry heuristic: $216K/year all-in (base salary, payroll taxes, benefits) for enterprise SaaS startup; excludes equity; consistent with market rates for seniority bands in BP.team
A9 Seed raise amount and timing 4.0 M USD BP fundingAsk.targetFundingRangeUsd: $3–5M; model uses midpoint $4.0M closing at Month 1
A10 Series A raise amount and timing 8.0 M USD Heuristic: milestone-driven raise at end of Q3Y2 after 18+ customers and ~$850K ARR run-rate; $8M calibrated to fund 18-month growth plan to Series B; reflected as cash inflow in Q3Y2 cashEopK
A11 Monthly churn rate 1.5 percent per month Industry heuristic: early-stage enterprise SaaS with 12-month annual contracts; ~85% annual renewal = 15% gross annual churn = ~1.35%/month; model rounds to 1.5%
A12 Blended CAC (fully-loaded sales and marketing cost per new production customer) 35.0 K USD Derived from Y2 model: S&M spend ~$788K / 22 new customers = $35.8K; consistent with 12-month payback heuristic for enterprise SaaS at $50K ACV
A13 Y1 headcount ramp CEO+FoundEng M1, ProductSec M2, IntegEng M3, GTM Lead M9 roles and months BP team section; direct mapping of stated hire timing
A14 Y2 headcount additions ending at 11 FTE +2 Q1Y2 (AE+SE), +2 Q2Y2 (Eng), +1 Q3Y2 (CS), +1 Q4Y2 (Eng) FTE additions by quarter Heuristic: scaling from 5 to 11 FTE supports 26 production customers and two partner channels by end Y2; consistent with BP 12–24 month milestones
A15 Y3 headcount additions reaching 18 FTE +2 Q1Y3, +2 Q2Y3, +2 Q3Y3, +1 Q4Y3 FTE additions by quarter Heuristic: GTM and engineering scaling to support 110 customers; revenue/FTE ~$220K by Y3 is within $200–400K SaaS benchmark
A16 Non-payroll monthly overhead Y1 13 to 22 K USD per month Heuristic: legal $4–5K, cloud tools $3–5K, travel/marketing $2–5K, misc $3–5K; grows from $13K (M1) to $22K (M9+) as GTM activities increase
A17 Y3 gross margin improvement 72 to 73 percent Heuristic: infrastructure unit economics improve ~2–3 percentage points per year of SaaS operation via deployment automation; moves from 70% (Y1–Y2) to 72% (Q1–Q2 Y3) and 73% (Q3–Q4 Y3)
A18 Y1 pilot count 5 pilots BP milestone 0–12 months: sign 3–5 paid pilots; model uses 5 (upper range) signed in M4, M6, M7, M10, M11
A19 Y1 production customers EOP 4 customers BP milestone 0–12 months: convert at least 2 pilots into annual production contracts; model uses 4 (pilots 1–3 convert M9–M11, plus one accelerated conversion M12)
A20 Y3 production customers EOP 110 customers BP milestone 24–36 months: roughly 120 production customers; model uses 110 (8% below target) as conservative base case
A21 Model start month 2026-07 YYYY-MM One month after business-plan date 2026-06-05; allows seed closing and team assembly
A22 Revenue composition subscription + pilot fees description Revenue = production-customer subscriptions (customersEop × $4.167K/month) plus pilot fees ($20K/pilot recognized ratably over 3 months per A5); ARPU in unitEconomics reflects annual subscription only
unit economics flow
flowchart LR
  Leads --> Pilots
  Pilots --> Production
  Production --> ARR
  ARR --> GrossProfit
  GrossProfit --> EBITDA
  EBITDA --> Cash
  Partners --> Pilots
  Churn --> Production

Flags: Series A close in Q3Y2 is load-bearing: the $8M raise is reflected as a $9,875.8K cash jump in that quarter; without it the company exhausts seed before reaching Q2Y3 · Pilot-to-production conversion assumed at 60%; if it falls below 40% for two consecutive quarters the Series A milestone is missed and the model requires a bridge or immediate burn reduction · Discovery completeness is unproven — BP operating assumption requires 80%+ agents discoverable; incomplete inventory reduces pilot confidence and threatens the $50K ACV anchor · Platform-native governance risk is rated high-likelihood and high-impact in the BP risk register; ARPU compression from $50K to $35K reduces Y3 revenue by ~$990K per sensitivity analysis

Section

Top risks

  • Incumbent bundling. Identity, SaaS-management, or agent-platform vendors may ship partial approval and inventory features before the startup gets distribution. Mitigation: Win on cross-platform depth, workflow-specific policy templates, and faster incident revocation across mixed agent stacks rather than single-vendor environments.
  • Discovery blind spots. If many employee-built agents are created outside supported builders or hide behind generic service accounts, the inventory could feel incomplete. Mitigation: Start with the few builders and SaaS systems that account for most business-led deployments, then expand discovery through identity, OAuth, and admin-log integrations.
  • Security friction backlash. Business teams may resist a product that feels like a new approval bottleneck and route around it with shadow automation. Mitigation: Lead with fast approvals, clear ownership, and one-click templates that let safe agents launch quickly while escalating only high-risk scopes.
Section

Evidence

Cited sources (30)

  1. CTech. Wix CEO backs Willow as startup raises $7 million for AI agent control layer | Ctech · https://www.calcalistech.com/ctechnews/article/bj9iwj1zge
  2. Microsoft Learn. Governance and security for AI agents across the organization - Cloud Adoption Framework | Microsoft Learn · https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/ai-agents/governance-security-across-organization
  3. Microsoft Learn. Security and governance - Microsoft Copilot Studio | Microsoft Learn · https://learn.microsoft.com/en-us/microsoft-copilot-studio/security-and-governance
  4. Microsoft Learn. Configure data policies for agents - Microsoft Copilot Studio | Microsoft Learn · https://learn.microsoft.com/en-us/microsoft-copilot-studio/admin-data-loss-prevention
  5. Google Workspace Help. Explore the AI control center | Google Workspace with Gemini | Google Workspace Help · https://knowledge.workspace.google.com/admin/gemini/explore-the-ai-control-center
  6. Google Workspace Blog. Enterprise security controls for Gemini in Google Workspace | Google Workspace Blog · https://workspace.google.com/blog/ai-and-machine-learning/enterprise-security-controls-google-workspace-gemini
  7. Salesforce. Salesforce Agentforce Pricing | Salesforce · https://www.salesforce.com/agentforce/pricing/?bc=OTH
  8. Salesforce. Salesforce exec on how Agentforce addresses agentic AI pain points - Salesforce · https://www.salesforce.com/news/stories/agentforce-addresses-agentic-ai-painpoints/?bc=OTH
  9. NIST. AI Risk Management Framework | NIST · https://www.nist.gov/itl/ai-risk-management-framework
  10. European Commission. AI Act | Shaping Europe’s digital future · https://digital-strategy.ec.europa.eu/en/policies/regulatory-framework-ai
  11. OWASP. Prompt Injection | OWASP Foundation · https://owasp.org/www-community/attacks/PromptInjection
  12. Model Context Protocol. Security Best Practices - Model Context Protocol · https://modelcontextprotocol.io/docs/tutorials/security/security_best_practices
  13. Deloitte. The State of AI in the Enterprise - 2026 AI report | Deloitte US · https://www.deloitte.com/us/en/what-we-do/capabilities/applied-artificial-intelligence/content/state-of-ai-in-the-enterprise.html
  14. KPMG. KPMG AI Quarterly Pulse Survey · https://kpmg.com/us/en/articles/2025/ai-quarterly-pulse-survey.html
  15. Capgemini. Generative AI Report 2025 - Capgemini · https://www.capgemini.com/us-en/insights/research-library/generative-ai-in-organizations-2025/
  16. IBM. IBM Study: Businesses View AI Agents as Essential, Not Just Experimental - Jun 10, 2025 · https://newsroom.ibm.com/2025-06-10-IBM-Study-Businesses-View-AI-Agents-as-Essential,-Not-Just-Experimental
  17. Cloud Security Alliance. New Cloud Security Alliance Survey Reveals 82% of Enterprises | CSA · https://cloudsecurityalliance.org/press-releases/2026/04/21/new-cloud-security-alliance-survey-reveals-82-of-enterprises-have-unknown-ai-agents-in-their-environments
  18. Cloud Security Alliance. More Than Half of Organizations Experience AI Agent Scope | CSA · https://cloudsecurityalliance.org/press-releases/2026/04/16/more-than-half-of-organizations-experience-ai-agent-scope-violations-cloud-security-alliance-study-finds
  19. Gravitee. State of AI Agent Security 2026 Report: When Adoption Outpaces Control · https://www.gravitee.io/blog/state-of-ai-agent-security-2026-report-when-adoption-outpaces-control
  20. Zenity. AI Security Platform for Enterprise-Grade AI Agent Governance · https://zenity.io/platform
  21. Noma Security. Noma Security | AI Security Platform for LLMs, RAG, & AI Agents · https://noma.security/
  22. Astrix. Identity Security for AI Agents & NHIs | Astrix Security · https://astrix.security/
  23. Grip Security. Grip Security | Complete SaaS + AI Control · https://www.grip.security/
  24. Okta. AI Agent Security Series: Rebuilding IAM for Autonomous Trust | Okta · https://www.okta.com/blog/ai/ai-agent-security-series/
  25. Palo Alto Networks. Securing Your SaaS and Data in the Age of AI Agents - Palo Alto Networks Blog · https://www.paloaltonetworks.com/blog/sase/securing-your-saas-and-data-in-the-age-of-ai-agents/
  26. Wiz. AI Agent Security Best Practices | Wiz · https://www.wiz.io/academy/ai-security/ai-agent-security
  27. Google Cloud Blog. Google Agentspace enables the agent-driven enterprise | Google Cloud Blog · https://cloud.google.com/blog/products/ai-machine-learning/google-agentspace-enables-the-agent-driven-enterprise
  28. Microsoft Open Source. Introducing the Agent Governance Toolkit: Open-source runtime security for AI agents | Microsoft Open Source Blog · https://opensource.microsoft.com/blog/2026/04/02/introducing-the-agent-governance-toolkit-open-source-runtime-security-for-ai-agents/
  29. U.S. Census Bureau. Statistics of U.S. Businesses (SUSB) · https://www.census.gov/programs-surveys/susb.html
  30. U.S. Census Bureau. 2022 SUSB Annual Data Tables by Establishment Industry · https://www.census.gov/data/tables/2022/econ/susb/2022-susb-annual.html