AI-NATIVE OBSERVABILITY ai-infra Scan 2026-06-23 to 2026-06-23 Run 20260624160046

Policy-safe trace relay for AI vendors in customer VPCs, exporting redacted support evidence without raw-data exfiltration.

AI application vendors increasingly win enterprise deals only if they deploy inside the customer's own cloud account, but that deployment model blinds the vendor once production incidents start. Support teams need raw traces, prompt lineage, and agent-call context to debug failures, yet security teams will not allow unrestricted telemetry exfiltration.

By Bizidea Research 2026-06-24

Overall rating 4.2 / 5.0

4
Market
$300M TAM with 16% CAGR; five mapped competitors each expose the same workflow gap in vendor-safe cross-tenant support, signaling breakable concentration.
4
Differentiation
All five competitors miss the vendor-safe cross-tenant relay workflow; policy templates and an incident corpus compound into a structural moat over time.
4
Execution
Top-decile LTV/CAC of 7.7 and 7.3-month payback; three open model flags on security-review pass rates and deployment margins keep the score at four.
5
Timeliness
Five signals in a single day—BYOC mandates, AI telemetry cost pressure, named buyers, and $45M category funding—mark this as a clear breakout moment.

Section

Why now

AI telemetry volumes are making legacy per-byte observability pricing structurally uneconomic for agent workloads.
Enterprise buyers now require telemetry and prompts to stay inside their own cloud accounts, creating a new tooling gap for vendors.
Debugging modern agents now requires prompt-level traces, confidence scores, and call graphs, not just logs and metrics.
Named customers and six-figure contracts prove there is already budget for premium BYOC AI reliability infrastructure.

Catalyst. Tsuga's traction shows enterprises now insist on in-cloud AI observability because AI traces are both too sensitive and too voluminous for legacy SaaS observability patterns.

Section

The idea

The product deploys a lightweight collector and policy engine into each customer tenant where the AI vendor already runs its application. It records complete agent traces, prompt lineage, confidence scores, and call graphs locally, then automatically generates redacted incident bundles, regression diffs, and benchmark snapshots that the vendor can access centrally. Customers control what fields can leave the account, how long evidence is retained, and whether replay artifacts are exportable. Vendors get cross-tenant fleet health, release-quality comparisons, and much faster incident resolution without asking customers to open broad observability access.

What's different. Traditional observability vendors want the customer to buy another backend; this startup sells the vendor a support-specific relay purpose-built for fleets of isolated customer-cloud deployments. The core IP is policy-aware evidence packaging: full traces stay local, while only approved summaries, diffs, and redacted artifacts leave the tenant. That makes the product useful even when the customer already has Datadog, Grafana, or CloudWatch, because it solves the vendor's cross-tenant debugging and release-support problem rather than replacing the customer's tooling.

Startup thesis
Beachhead	Remote support and release debugging for Series A-C AI application vendors that deploy copilots or agents into customer-owned AWS accounts at banks, insurers, publishers, and other data-sensitive enterprises
Wedge	An in-tenant trace relay that captures full agent execution data, applies customer-defined redaction and export policy, and emits vendor-safe support bundles plus fleet-level health signals
Non-obvious insight	BYOC changes observability from a buyer-owned dashboard problem into a vendor-support infrastructure problem. The winning product is not another Datadog clone; it is a policy engine that computes full-fidelity support evidence inside each tenant cloud and exports only the minimum approved data needed to diagnose, compare, and fix incidents across a fleet of isolated customer environments.
Venture-scale path	Start with P1 support and release-debugging workflows, then expand into private-deployment SLA proof, upgrade validation, fleet benchmarking, usage-based billing evidence, and eventually the remote-operations control plane for all BYOC AI software vendors.

Target user
Primary user	Head of Customer Reliability or AI Platform at a Series A-C software vendor shipping BYOC agent deployments into regulated enterprises
Secondary user	Solutions architects and escalation engineers supporting private-cloud customer tenants
Economic buyer	VP Engineering or CTO

Go-to-market seed
First customer	Series B AI software vendor with 5-20 live BYOC tenants in banking, insurance, or publishing, a 3-8 person reliability team, and weekly production escalations that currently require customer-run log pulls
Buying trigger	A recent enterprise win or renewal that requires private-cloud deployment and creates more support escalations than the vendor can handle manually
Current alternative	Ad hoc SSH and log-pull workflows, customer-run Datadog or Grafana dashboards, and one-off support scripts maintained by solutions engineers
Switching reason	The relay cuts MTTR without forcing raw prompt or trace data out of the tenant, while giving the vendor a central view across all private deployments that manual workflows and customer-owned dashboards cannot provide.
Pricing hypothesis	Per live BYOC tenant per month with premium tiers based on retained trace days, protected environments, and number of agent runs analyzed

Jobs to be done

Job	Current alternative	Success metric
When a BYOC customer reports an agent failure, help the vendor reliability lead collect usable evidence without violating the customer's data-boundary rules, so they can resolve the incident quickly.	Manual log exports coordinated over support tickets and shared dashboards	Time to first actionable root-cause packet and MTTR for severity-one incidents
When a vendor ships a new agent release across isolated customer tenants, help the platform team compare behavior safely, so they can catch regressions before renewals or escalations.	Limited sampling from customer dashboards and ad hoc release spreadsheets	Regression-detection rate and reduction in customer-escalated release incidents

BYOC support relay

flowchart LR
  Buyer[BYOC AI vendor] --> Pain[Blind support inside customer clouds]
  Pain --> Product[In-tenant trace relay and policy engine]
  Product --> Outcome[Faster MTTR with safe cross-tenant insight]

Idea scorecard — average4.4 / 5 · 5axes

Signal · 5/5The cluster shows strong demand, fresh funding, concrete pricing pain, and named customers already buying the category.
Pain · 4/5BYOC support blindness creates acute operational pain for vendors, though the first buyers are still a narrower subset of enterprise AI vendors.
Wedge · 5/5The first workflow is specific and painful: export-safe incident evidence and release diffs for private-cloud AI deployments.
Defense · 4/5Redaction policy, tenant-side collectors, and cross-tenant evidence packaging can compound into a workflow moat, though large observability vendors may eventually move nearby.
Scale · 4/5The beachhead is narrow, but it can expand into the default operations layer for BYOC AI software fleets across support, upgrades, and commercial analytics.

Business model canvas

Key partners

AWS and private-cloud deployment partners
AI application vendors with BYOC rollouts
Systems integrators serving regulated enterprises

Key activities

Building connectors for agent frameworks and cloud runtimes
Maintaining redaction, export-policy, and evidence-packaging logic
Operating fleet-health analytics and release-diff workflows

Key resources

In-tenant collectors and policy engine
Cross-tenant fleet analytics backend
Security and compliance reference architectures

Value propositions

Debug private-cloud AI incidents without raw-data exfiltration
Give vendors cross-tenant fleet visibility while preserving customer control
Reduce support load and speed enterprise renewals for BYOC products

Customer relationships

High-touch implementation with shared security review
Ongoing reliability reviews tied to release quality and MTTR

Channels

Direct founder-led sales to CTOs and VP Engineering leaders at AI vendors
Partnerships with cloud deployment consultancies and reference-architecture partners
Customer-success introductions from existing BYOC enterprise accounts

Customer segments

Series A-C AI software vendors selling BYOC deployments into regulated enterprises
Platform and reliability teams supporting private-cloud customer tenants

Cost structure

Cloud infrastructure for central analytics
Deployment engineering and customer support
Security, compliance, and connector maintenance

Revenue streams

Subscription per live BYOC tenant
Premium analytics and retention tiers
Enterprise onboarding and security-review services

Section

Market

Market sizing

Market sizing overview
TAM	$300.0M Estimate: 2,500 global AI vendors that could face BYOC/private-cloud support pain × roughly $120k annual spend per vendor, anchored by Tsuga’s six-figure average contracts and enterprise packaging across Langfuse, LangSmith, and groundcover.
SAM	$84.0M Estimate: 700 current beachhead vendors in the narrower Series A-C + regulated-enterprise BYOC segment × roughly $120k annual spend, constrained by private-networking and governance requirements.
SOM	$7.2M Estimate: 60 reachable customers by year 3 × roughly $120k annual spend, assuming concentrated founder-led sales into vendors already carrying multiple private-cloud tenants and recurring incident load.

Executive takeaways

The wedge is real but narrow: the pain is not generic monitoring, it is vendor-side support for AI products running inside customer-controlled clouds.
Private connectivity, masking, retention controls, and auditability are table stakes for this category, not premium add-ons.
Budget exists when the pain is acute, but the earliest spend is concentrated in AI vendors with multiple live BYOC tenants and recurring Sev-1 support load.
Go-to-market risk is more about security-review friction and install complexity than about whether the underlying telemetry stack can be built.

Market definition

A vendor-support and reliability layer for AI software companies that deploy inside customer clouds, sitting between generic observability tooling and managed support operations.

Customer and buyer

Economic buyers are usually CTOs or VPs of Engineering facing enterprise BYOC renewals, while day-to-day champions are reliability, platform, and senior solutions engineers who currently coordinate customer-run log pulls and dashboard access.

Buying triggers

A new or renewed enterprise account requires private-cloud deployment, making direct vendor access impossible without a support-safe relay. [2][29][31]
AI telemetry bills or sampling gaps make legacy SaaS observability economically and operationally painful for agent workloads. [2][3][21][26]
A bad release or repeated incident across isolated tenants exposes the cost of manual evidence gathering and weak fleet comparison. [4][11][24]
Governance work forces automatic logging, masking, and audit trails into the production checklist before more regulated customers can go live. [8][18][32][36][38]

Willingness to pay

Premium spend is plausible when pain is acute: Tech Funding News reports Tsuga at several million ARR with six-figure average contracts, while Langfuse, LangSmith, and groundcover show that paid tiers concentrate around retention, auditability, support, and enterprise controls rather than raw tracing alone. [2][9][15][23]

Category dynamics

Growth signal 16% CAGR (observability market 2024-2034, as cited in Tech Funding News coverage).

Tailwinds

AI agent loops generate far more telemetry and make legacy sampling compromises harder to accept.
Private connectivity and data-boundary controls are now standard cloud features, reducing feasibility risk for BYOC deployment patterns.
OTel-based AI observability ecosystems are maturing quickly, lowering integration risk for new entrants.

Headwinds

Each tenant install can trigger heavy security and procurement work, slowing expansion even when pain is obvious.
Buyers can often good-enough the problem with existing dashboards plus manual support processes.
Zero-retention promises are nuanced because some model-provider features still retain or monitor prompts.

Validation signals

Tsuga already reports several million dollars of ARR, six-figure average contracts, and named customers in this architectural neighborhood.
Adjacent observability vendors now package BYOC, on-prem, and air-gapped deployment models as first-class offers.
Major clouds have normalized private connectivity and tighter data-boundary controls for AI services, lowering technical feasibility risk.
The telemetry layer is standardizing around OTel-style semantics, which reduces connector risk for a new vendor.

Regulatory & technical constraints

Private networking and no-public-egress patterns are often mandatory for accessing models and moving telemetry in regulated environments.
Prompt and log masking, deletion, and auditability must be first-class product features, not post-processing steps.
Zero-retention claims are cloud- and feature-specific; some provider workflows still retain prompts or responses for monitoring.
EU AI Act-style requirements push automatic event logging, post-market monitoring, and incident evidence deeper into the runtime stack.

BYOC AI observability market map

Section

Competition

The space is crowded at the infrastructure layer but still fragmented at the vendor-support workflow layer. Most products optimize for application teams or centralized backends; fewer are built around exporting policy-safe evidence from many isolated customer tenants back to the vendor’s support organization.

Competitor	Stage	Wedge	Pricing	Strength	Weakness vs. us
Tsuga	scale-up	BYOC observability platform that keeps telemetry in the customer cloud and argues AI-era economics break legacy SaaS observability models.	Custom; company describes a per-GB model and Tech Funding News reports six-figure average contracts.	Strong narrative fit, multi-cloud positioning, and early proof that enterprises will pay premium budgets for private-cloud AI observability.	Built as a broader observability platform; the startup can stay narrower around vendor-safe support bundles and cross-tenant debugging workflows.
Langfuse	scale-up	Open-source LLM observability, evals, and prompt tooling with strong self-hosted controls.	Free to $2,499/month cloud enterprise; self-host supported.	Developer love, strong masking/retention/audit controls, and a credible self-host story.	Optimized for app-team observability inside one environment, not policy-aware incident relay across many isolated customer tenants.
LangSmith	scale-up	Integrated agent lifecycle platform spanning observability, evals, deployment, and secure sandboxes.	Free developer tier, $39 per seat Plus plan, and enterprise/custom for hybrid or self-hosted deployment.	Deep LangChain ecosystem fit plus credible self-host and Kubernetes operations story.	Best when the buyer wants a broad agent platform; less centered on vendor-to-customer evidence export across BYOC tenants.
Arize Phoenix / AX	scale-up	Enterprise AI evaluation and observability with private-connect deployment options and OTel alignment.	Phoenix OSS is free; AX adds paid and custom enterprise tiers with private-connect and compliance features.	Strong evaluation stack, private-connect support, and enterprise compliance posture.	Center of gravity is centralized AI engineering and evals rather than support-specific relay and export policy across customer-owned clouds.
Datadog Agent Observability	incumbent	Agent observability added onto a broad observability and security platform.	Module-based platform pricing plus quote-led adoption for AI observability use cases.	Huge installed base and adjacent tools such as observability pipelines and sensitive data scanning.	Still assumes a Datadog-centric backend and broad platform footprint, while the startup can focus on support-safe evidence flow inside customer-owned environments.

Why incumbents do not win by default

Cloud platforms. AWS, Azure, and Google Cloud now provide the private networking and data-boundary primitives a buyer expects, but they stop at infrastructure primitives and do not solve cross-tenant vendor support workflows.
Generic observability suites. Datadog-class platforms can unify AI and infrastructure telemetry, yet they still assume enough data can be centralized into the observability backend and paid for under a modular platform contract.
Open-source and self-hosted stacks. Self-hosted tools such as Langfuse and Helicone give local control, but buyers still have to assemble export policy, support bundles, and fleet-level comparison flows themselves.
Agent lifecycle platforms. LangSmith and Arize reach deeper into deployment, evals, and private connectivity, but their center of gravity is application-team engineering productivity rather than vendor-safe remote support across customer clouds.

Section

Business plan

BYOC agent support relay is a vendor-support infrastructure company for AI software vendors that deploy agents inside customer-owned cloud accounts. The beachhead is Series A-C vendors with 5-20 live BYOC tenants in regulated sectors where weekly escalations still depend on customer-run log pulls and ad hoc scripts. The first product is not a broad observability suite; it is an AWS-first in-tenant trace relay and policy engine that produces redacted incident bundles, release-regression diffs, and fleet health signals without exporting raw prompts or traces by default. That wedge matches the researched buying trigger of a new enterprise BYOC win, renewal, or bad release that overwhelms the vendor's reliability team. Pricing should start with a paid pilot and then convert to per-live-tenant annual subscriptions because value scales with isolated deployments, not seats. The sequencing is intentionally narrow: prove one support workflow, one cloud, and one export-approval model before expanding into multi-cloud coverage, SLA proof, or billing analytics. The strongest reasons to engage are clear pain, premium budget precedent, and a differentiated workflow relative to generic observability tools. The biggest disconfirming risk is that security teams refuse any outbound artifact egress or that the true count of vendors with enough live BYOC tenants is smaller than assumed. Research also does not resolve the dominant private-cloud agent frameworks or the first budget owner, so the first six months must treat connector selection and budget ownership as explicit tests.

Problem

AI vendors lose root-cause speed once their product runs inside customer VPCs and support teams depend on customer-run log pulls, dashboards, and SSH sessions to diagnose incidents.
Existing observability products either require sensitive telemetry to leave the tenant or price unsampled AI traces poorly, while manual workflows cannot compare incidents or regressions across isolated customer environments.

Solution

Deploy an AWS-first in-tenant collector and policy engine that captures full agent traces locally and exports only approved redacted incident bundles, regression diffs, and fleet health summaries.
Give vendor reliability teams a central cross-tenant support console for P1 triage, release comparison, and export approval without forcing the end customer to replace Datadog, Grafana, or CloudWatch.

Why we win

The product serves the vendor's support and release-debugging workflow rather than asking the end customer to buy another observability backend, which avoids a head-on platform replacement sale.
Reusable masking policies, export templates, and a growing corpus of redacted incident bundles and fixes across many tenants compound into faster diagnosis and a workflow moat.

Strategic choices
Beachhead	Series B AI software vendors selling copilots or agents into banks, insurers, publishers, and similar regulated enterprises through AWS BYOC deployments, with 5-20 live customer tenants and recurring Sev-1 support load.
Wedge rationale	This customer slice already feels the cost of manual evidence gathering, has a clear renewal-linked trigger, and can justify six-figure reliability spend sooner than broader AI vendors or end-enterprise platform teams.
Sequencing	Start with AWS plus OTel-aligned trace capture because the first goal is to prove faster incident resolution under real security review, then add release-regression and benchmarking workflows once pilot data is flowing, and only then expand cloud coverage and channels after deployment speed is repeatable.
Not yet	Selling a generic observability replacement or end-customer dashboard to enterprise platform teams. · Supporting one-off on-prem or air-gapped deployments before the AWS BYOC playbook is repeatable. · Expanding into SLA reporting, usage-billing evidence, or broad AI eval tooling before the incident-bundle wedge converts to production.

Go-to-market
Wedge	Sell a paid support-visibility pilot for the first 1-3 BYOC tenants immediately after a renewal, launch, or incident, then convert into a production subscription once the relay proves faster triage and safer release debugging.
Channels	Founder-led direct sales into CTOs, VPs of Engineering, and heads of customer reliability at AI vendors already selling BYOC. · Cloud-deployment consultancies and systems integrators already implementing private networking and security review for enterprise tenants. · Customer-success and reference-account introductions from existing BYOC enterprise customers that are already escalating support pain.
Funnel targets	Target account→qualified pilot 20-30%, qualified pilot→paid pilot 50%+, paid pilot→production 60%+, production account→5+ live tenants under relay within 9 months in 50%+ of accounts.
Pricing	Start with a $15k-$25k paid pilot for 60-90 days covering 1-3 live tenants, then convert to an annual subscription priced per live BYOC tenant with an $80k-$150k base ACV for vendors operating 5-20 tenants, plus premium charges for longer retention, protected environments, and higher analyzed run volume. This aligns price to the buyer's deployment complexity and support burden better than seat-based pricing.

Product roadmap
MVP	MVP covers AWS deployments first: OTel-aligned trace capture, local prompt and tool-call retention, redaction policy templates, human-approved incident bundle export, release-diff views, and a central fleet health console for one vendor across 5-20 tenants. It deliberately excludes multi-cloud, customer-facing dashboards, and broad model-evaluation workflows.
6 months	Ship a production-ready AWS collector, policy templates for masking and retention, export approvals with audit logs, and support-ticket packaging that gets the first actionable incident bundle out in under 30 days from kickoff.
12 months	Add release-regression benchmarking, benchmark snapshots across tenants, and the next most-requested cloud connector only after AWS deployments consistently reach first value with minimal custom work.
24 months	Expand from support relay into private-deployment SLA proof, upgrade validation, and usage-billing evidence while keeping policy-safe in-tenant computation as the architectural core.
Key bets	Buyers will value faster safe evidence export more than a broader observability dashboard in the first budget cycle. · An AWS-first deployment path can reach first value in four to six weeks with limited custom engineering. · Redacted incident bundles and regression fingerprints will become reusable assets that improve pilot-to-production conversion and future diagnosis.

Business model
Revenue streams	Annual subscription priced by live BYOC tenant count with platform minimums. · Premium retention, fleet benchmarking, and release-validation modules. · Security-review onboarding and deployment services sold as bounded packages.
Unit of value	Live BYOC tenant under relay policy, with upsell tied to retained trace days and analyzed agent runs.
Target gross margin	70%
Expansion levers	Add more live tenants and cloud environments within the same vendor account. · Expand from incident bundles into release validation, benchmarking, and SLA evidence. · Reuse approved policy templates to shorten rollout into new regulated customer logos.

Strategy map
North-star metric	Number of production live BYOC tenants automatically generating approved incident bundles and release diffs for the vendor support team.
Input metrics	Time from kickoff to first actionable incident bundle. · Qualified pilot to paid pilot conversion rate. · Paid pilot to production conversion rate. · Median MTTR improvement on incidents where the relay is used. · Expansion rate from first production tenant to five or more covered tenants in an account.
Moats to build	Policy-template library for masking, retention, and export approvals by regulated use case. · Redacted incident-bundle corpus linking trace patterns to resolutions across many isolated tenants. · AWS and OTel deployment playbooks that reduce security-review and onboarding friction.
Kill criteria	If fewer than 3 of the first 10 qualified prospects allow any outbound redacted bundle after security review, the relay export thesis is wrong. · If the first 3 deployments cannot reach an actionable incident bundle within 30 days and less than 2 engineer-weeks of custom work, repeatable GTM is broken. · If fewer than 2 paid pilots convert to "$80k+" production subscriptions by month 12, the ROI and pricing model are too weak.

Milestones

0–12 months

Package the AWS-first collector, policy templates, and incident-bundle export workflow.
Sign 6-8 design partners and convert at least 3 into paid pilots.
Put 2 customers into production at "$80k+" annualized value.
Prove first value in 30 days and keep customization below 2 engineer-weeks per pilot.

12–24 months

Add the next cloud connector only after AWS deployments are repeatable.
Grow to 12-15 production customers and expand multiple tenants inside the best accounts.
Launch release-regression benchmarking and policy-template reuse as standard upsells.
Establish partner-sourced pipeline as a meaningful share of qualified pilots.

24–36 months

Expand from support relay into SLA proof, upgrade validation, and billing-evidence workflows.
Build a recognized corpus of cross-tenant incident patterns and remediation templates.
Reach category credibility as the default vendor-support layer for private-cloud AI deployments without becoming a generic observability suite.

Strategy map

flowchart LR
  Wedge[BYOC support relay] --> MVP[AWS-first incident bundle MVP]
  MVP --> Proof[Faster triage and paid pilot conversion]
  Proof --> Expansion[More tenants plus release validation and SLA evidence]

Founding team

Role	Start timing	Rationale
Founder CEO	Month 0	Own founder-led sales, design-partner recruitment, and ICP discipline because the first deals require problem education and sharp qualification.
Founding eng	Month 0	Build the AWS collector, policy engine, export pipeline, and first fleet-health workflow needed for credible pilots.
Solutions engineer	Month 3	Compress deployment time, standardize security-review artifacts, and protect core engineering bandwidth as pilots multiply.
Security / policy product lead	Month 6	Turn pilot learnings into reusable masking templates, approval flows, and retention controls that procurement teams trust.
Partnerships lead	Month 9	Scale the deployment channel only after the AWS playbook is repeatable and partner enablement can be productized.

Experiment roadmap

Horizon	Experiment	Hypothesis	Success metric	Owner
0–90 days	Build a named target-account list and interview 25 prospects already selling BYOC into regulated enterprises.	At least 10 prospects have recurring support pain and 5 or more live or imminent BYOC tenants.	10 qualified accounts with tenant count, trigger event, and named economic buyer captured.	Founder CEO
0–90 days	Run structured security reviews around a sample export schema, masking policy, and audit-log design with 6 prospects.	A standard redacted-bundle template will be acceptable in at least half of qualified accounts.	3 prospects approve a concrete outbound artifact schema for pilot use.	Founder product
0–90 days	Ship the first AWS collector and incident-bundle prototype on top of OTel-aligned traces.	One design partner can see a useful root-cause packet inside 30 days without replacing existing observability tooling.	1 design partner receives an actionable incident bundle from a live or replayed workflow within 30 days.	Founding eng
3–6 months	Convert 3 design partners into paid pilots with explicit MTTR and deployment-speed success criteria.	Buyers will pay for support visibility before the full release-benchmarking roadmap is built.	3 paid pilots signed at "$15k+" each with agreed conversion criteria.	Founder CEO
6–12 months	Launch release-regression diffs and convert the first 2 paid pilots into production subscriptions.	Release debugging plus incident bundles creates a stronger ROI case than incident relay alone.	2 production customers at "$80k+" ACV and documented MTTR or escalation-load improvement.	Engineering lead
6–12 months	Recruit 2 cloud-deployment partners and test whether they can deliver the packaged AWS playbook.	Partner-led installs can reduce founder time without increasing customization.	2 signed partners and 1 partner-assisted deployment reaching first value in under 45 days.	Head of partnerships

Risk assessment

Business plan risks — 5 mapped

Impact →

High

R2 R3

Medium

R4 R5

Low

Medium

High

Likelihood →

R1Security teams may reject any outbound artifact egress even after masking and approval controls. · Highlikelihood / Highimpact — Qualify hard on export policy early, standardize the approval package, and maintain an in-tenant-only fallback product direction.
R2The number of vendors with enough live BYOC tenants may be smaller than research estimates. · Mediumlikelihood / Highimpact — Build a named pipeline before scaling spend and expand into adjacent private-deployment software vendors only after the initial ICP is measured.
R3Deployment complexity may force heavy custom work and reduce gross margin. · Mediumlikelihood / Highimpact — Stay AWS-first, reject edge-case installs early, and hire solutions talent before adding more clouds or features.
R4Incumbent observability suites or self-hosted tools may add policy-aware export features quickly. · Mediumlikelihood / Mediumimpact — Differentiate on vendor-support workflow depth, cross-tenant comparisons, and faster production deployment rather than broader telemetry breadth.
R5Buyers may treat the product as a nice-to-have debugging aid instead of a budgeted reliability layer. · Mediumlikelihood / Mediumimpact — Sell only against explicit trigger events and require quantified MTTR or escalation-load goals in every pilot.

Risk	Likelihood	Impact	Mitigation
Security teams may reject any outbound artifact egress even after masking and approval controls.	High	High	Qualify hard on export policy early, standardize the approval package, and maintain an in-tenant-only fallback product direction.
The number of vendors with enough live BYOC tenants may be smaller than research estimates.	Medium	High	Build a named pipeline before scaling spend and expand into adjacent private-deployment software vendors only after the initial ICP is measured.
Deployment complexity may force heavy custom work and reduce gross margin.	Medium	High	Stay AWS-first, reject edge-case installs early, and hire solutions talent before adding more clouds or features.
Incumbent observability suites or self-hosted tools may add policy-aware export features quickly.	Medium	Medium	Differentiate on vendor-support workflow depth, cross-tenant comparisons, and faster production deployment rather than broader telemetry breadth.
Buyers may treat the product as a nice-to-have debugging aid instead of a budgeted reliability layer.	Medium	Medium	Sell only against explicit trigger events and require quantified MTTR or escalation-load goals in every pilot.

First customer
Title	Head of Customer Reliability at a Series B AI software vendor
Profile	A 300-800 employee vendor with 5-20 AWS BYOC tenants in banking, insurance, or publishing, a 3-8 person reliability team, and weekly escalations that currently require customer-run log pulls.
Trigger	A new regulated-enterprise launch, renewal, or bad release exposes that the support team cannot debug isolated tenants fast enough with current tooling.
Buyer	VP Engineering or CTO
Initial contract	$15k-$25k paid pilot for 1-3 tenants converting to an $80k-$150k annual subscription once 5-20 live tenants are under policy, with upsell for retention and release-validation modules.

What must be true

At least one target segment has enough vendors with 5 or more live BYOC tenants to support efficient founder-led pipeline generation now.
Security teams will approve a predefined redacted export schema often enough that the relay can operate without full manual bespoke review every time.
The relay reduces time to first actionable root-cause packet and overall MTTR enough that buyers fund it from an existing reliability or support budget.
An AWS-first deployment can be packaged tightly enough that pilots do not become custom services projects.
Incumbent observability and self-hosted tooling do not solve cross-tenant vendor support workflows well enough to erase the wedge before expansion.

Open diligence questions

How many live BYOC tenants do the best initial prospects actually run today, not merely pilot?
What exact fields and approvals must be present before a security team allows outbound incident bundles?
Which cloud and agent-framework combinations dominate the first 20 prospects and therefore define the connector roadmap?
Where did comparable buyers fund adjacent tools: reliability, support operations, or security architecture?
What measurable MTTR delta would make a prospect convert from pilot to a six-figure annual contract?

Investor verdict
Call	Meet / investigate further
Conviction	Strong problem signal and credible budget precedent, but conviction stays moderate until security-review pass rates and real tenant counts are proven.
Why believe	The company attacks a specific and painful support workflow where BYOC deployment, AI telemetry volume, and enterprise security policy now intersect.
Why doubt	The beachhead may be narrower than it appears and some customers may refuse any artifact egress, which would force a different product and pricing model.
Next diligence	Validate with 8-10 target vendors that at least two will approve a paid pilot with outbound redacted bundles inside one budget cycle.

Section

Financial model

3-year totals
Year 1 revenue	$87K EBITDA $-883K · Cash EOP $2.12M
Year 2 revenue	$1.10M EBITDA $-832K · Cash EOP $1.29M
Year 3 revenue	$2.74M EBITDA $-143K · Cash EOP $1.14M

Unit economics
ARPU (annual)	$130K
Gross margin	70%
CAC	$55K Payback 7.3 months
LTV / CAC	7.7x LTV $422K

Funding ask
Round	pre-seed · $3.0M
Runway	30 months
Milestone	Reach 14-20 production customers, prove partner-assisted AWS deployments and release-regression upsell, and start a seed process with six months of cash buffer.

Model sanity

Revenue engine. The base case is driven by growing from 2 production vendors at Y1 end to 28 by Q4Y3 at a $130K blended ACV, with most value coming from repeatable AWS deployments and tenant expansion.
Must go right. Security reviews have to approve a standard redacted-bundle workflow often enough that partner-assisted pilots can convert into 14 production customers by Q4Y2.
Model breaks if. If sales cycles stretch toward 8-9 months or gross margin stalls below 67%, the downside case keeps EBITDA deeply negative and shrinks the cash floor to roughly $0.6M.
Next-round proof. A seed-ready story appears once the company can show 14-20 production customers, partner-assisted installs, and release-regression upsell without losing the path to 70% gross margin.

Revenue, cash, and EBITDA — 12-month Y1 + 8-quarter Y2/Y3

Revenue (line, area)
Cash EOP (dashed)
EBITDA (bars, gray = loss)

Use of funds — $3.0M pre-seed

Headcount build by role — peak10 FTE

Founder / CEO
Engineering
Solutions engineering
Security / policy product
Partnerships / sales
G&A / Ops

Year-3 scenarios — base / downside / upside

	Y3 revenue	Y3 EBITDA	Cash low point	Description
Downside	$2.31M	-$491K	$620K	Security approvals, partner activation, and production conversions slip by roughly two quarters while pricing stays closer to the low end of the business-plan range.
Base	$2.74M	-$143K	$1.09M	AWS-first deployments become repeatable, the first partner channel contributes in Y2, and premium retention / protected-environment upsells modestly lift blended ACV.
Upside	$3.47M	$434K	$1.49M	Pilot conversion and partner referrals improve, and release-regression plus retention modules attach earlier without requiring a much larger team.

Sensitivity — Y3 cash and revenue impact, sorted by magnitude

Variable	Downside	Upside	Cash impact	Revenue impact
CAC	$70K fully loaded CAC	$45K fully loaded CAC	-$360K	$0K
sales cycle	8-9 months from pilot kickoff to production	4 months	-$330K	-$431K
hiring pace	Pull forward one extra engineer and one GTM hire into H2Y2	Delay one noncritical hire until after seed proof	-$230K	$0K
churn	2.5% monthly churn	1.2% monthly churn	-$170K	-$240K
ARPU	$120K annual subscription value per customer	$135K annual subscription value per customer	-$148K	-$211K
gross margin	67% steady-state gross margin	72% steady-state gross margin	-$82K	$0K

Scenarios

Scenario	Y3 revenue	Y3 EBITDA	Cash low point	Description	Key changes
Downside	$2.31M	$-491K	$620K	Security approvals, partner activation, and production conversions slip by roughly two quarters while pricing stays closer to the low end of the business-plan range.	Q4Y3 customersEop reaches 24 instead of 28. Blended ACV stays at $120K instead of $130K because premium retention and protected-environment upsells land later. Gross margin tops out at 68% because deployments remain more services-heavy than planned.
Base	$2.74M	$-143K	$1.09M	AWS-first deployments become repeatable, the first partner channel contributes in Y2, and premium retention / protected-environment upsells modestly lift blended ACV.	Q4Y3 customersEop reaches 28 with $130K blended ACV. Gross margin ramps from 65% in Y1 to the 70% target in Y3. Headcount stays lean at 10 FTE by Q4Y3 while partner-assisted installs start to reduce founder bottlenecks.
Upside	$3.47M	$434K	$1.49M	Pilot conversion and partner referrals improve, and release-regression plus retention modules attach earlier without requiring a much larger team.	Q4Y3 customersEop reaches 32 instead of 28. Blended ACV rises to $135K as premium modules attach to more production accounts. Gross margin reaches 72% as policy templates and deployment playbooks reduce custom work.

Sensitivity

Variable	Downside	Base	Upside
ARPU	$120K annual subscription value per customer	$130K annual subscription value per customer	$135K annual subscription value per customer
CAC	$70K fully loaded CAC	$55K fully loaded CAC	$45K fully loaded CAC
churn	2.5% monthly churn	1.8% monthly churn	1.2% monthly churn
sales cycle	8-9 months from pilot kickoff to production	5-6 months	4 months
gross margin	67% steady-state gross margin	70% steady-state gross margin	72% steady-state gross margin
hiring pace	Pull forward one extra engineer and one GTM hire into H2Y2	Lean ramp to 10 FTE by Q4Y3	Delay one noncritical hire until after seed proof

Key assumptions (20)

ID	Name	Value	Unit	Source
A1	Model start month	2026-07	month	[BP date] First full month after the 2026-06-24 business-plan date.
A2	Opening cash / pre-seed raise	$3.0M	usdM	[BP fundingAsk] The business plan targets a $3-5M pre-seed; the model uses the bottom of that range to fund the AWS-first build, early design partners, and a six-month cash buffer before seed proof.
A3	Revenue recognition basis	Only annual production subscriptions are recognized in core revenue; paid pilots and security-onboarding packages are excluded from the base P&L.	policy	[BP gtm.wedge; BP gtm.pricing; BP businessModel.revenueStreams] This keeps the model conservative and makes revenue reconcile cleanly to production customers × ARPU.
A4	Blended annual production-customer ARPU	$130,000 per customer-year	usd_per_customer_year	[BP gtm.pricing; research.bottomUpSizingDrivers] Research implies about $120K annual spend per vendor, while the business plan allows $80K-$150K base ACV plus premium retention and protected-environment charges, so the model uses a modest upsell-blended $130K.
A5	Year 1 production-customer ramp	M1-M12 customersEop = 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 2, 2	customers	[BP milestones 0-12 months; BP experimentRoadmap] This matches three paid pilots converting into two production subscriptions by year-end.
A6	Year 2 and Year 3 production-customer ramp	M13-M36 customersEop = 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 25, 26, 28	customers	[BP milestones 12-24 months; research.market.som; BP gtm.funnelTargets] The ramp reaches 14 production customers by month 24 and 28 by year 3, which is ambitious but still below half of the 60-customer year-3 SOM from research.
A7	Gross-margin ramp	65% in Y1, 68% in Y2, 70% in Y3	percent	[BP businessModel.targetGrossMarginPct; BP operatingAssumptions] The business plan targets 70% gross margin, but early installs are modeled below target until deployment work stays under the stated two engineer-weeks per customer.
A8	Founder / CEO loaded cash compensation	$120,000	usd_per_fte_year	Startup-finance heuristic for a below-market founder salary at pre-seed, consistent with BP team showing founder-led GTM from Month 0.
A9	Engineering loaded cash compensation	$165,000	usd_per_fte_year	Startup-finance heuristic for cloud / telemetry engineers building AWS collectors, policy controls, and release-regression workflows for an enterprise infrastructure startup [BP product; BP team].
A10	Solutions engineering loaded cash compensation	$145,000	usd_per_fte_year	Startup-finance heuristic for a solutions engineer who compresses deployment and security-review work across BYOC tenants [BP team].
A11	Security / policy product loaded cash compensation	$160,000	usd_per_fte_year	Startup-finance heuristic for a product/security lead translating masking, retention, and export policies into repeatable product templates [BP team; research.regulatoryTechnicalConstraints].
A12	Partnerships / sales loaded cash compensation	$150,000	usd_per_fte_year	Startup-finance heuristic for one partner-led commercial hire added only after the AWS playbook is repeatable, matching BP team sequencing.
A13	G&A / ops loaded cash compensation	$110,000	usd_per_fte_year	Startup-finance heuristic for a lean finance / operations generalist added when customer count and contracting load rise in Y2.
A14	Headcount ramp snapshots	Founder 1/1/1/1/1/1; engineering 1/1/2/2/3/3; solutions 0/1/1/1/2/2; security 0/0/1/1/1/1; partnerships 0/0/0/1/1/2; G&A 0/0/0/0/1/1 across q1y1/q2y1/q3y1/q4y1/q4y2/q4y3	fte	[BP team; BP strategicChoices.sequencingRationale] The model follows the business-plan order: build the AWS-first product, standardize deployment, then add channel leverage and back-office support.
A15	Payroll smoothing in Y2 and Y3	Quarterly salary expense uses the most recent hiring ramp instead of stepping only on the year-end snapshots.	method	[Financial Modeler instructions] This keeps the salary line internally consistent with the six-column headcount schema.
A16	Non-payroll operating budget	Y1 monthly S&M $8K-$16K, R&D $9K-$13K, G&A $5K-$8K; Y2 quarterly S&M $42K-$60K, R&D $30K-$39K, G&A $18K-$27K; Y3 quarterly S&M $66K-$84K, R&D $42K-$51K, G&A $30K-$39K	usdK	[BP operations; BP fundingAsk.useOfFundsSummary; research.reportMemo.distributionChannels; research.regulatoryTechnicalConstraints] These budgets cover cloud costs, travel, security review materials, audit workflows, and founder-led enterprise selling without assuming a large field team.
A17	Fully loaded CAC	$55,000 per net production customer	usd_per_customer	[BP gtm.channels; BP gtm.funnelTargets] Derived startup-finance heuristic for founder-led enterprise sales, pilot travel, partner enablement, and a measured first commercial hire.
A18	Monthly churn for unit economics	1.8%	percent	[BP risks; research.categoryDynamics.headwinds] This heuristic assumes enterprise accounts are sticky once deployed, but still reflects renewal and security-friction risk for an early workflow product.
A19	Cash roll-forward convention	Ending cash equals opening cash plus EBITDA; debt, capex, taxes, and working-capital timing are not modeled separately.	policy	Startup-finance heuristic for an asset-light software company where operating burn is the dominant cash driver.
A20	Funding objective	Reach 14-20 production customers, prove at least one partner-assisted deployment path, and enter a seed process with six months of buffer.	goal	[BP milestones 12-24 months; BP fundingAsk; BP experimentRoadmap] This is the next financing proof implied by the plan once AWS deployment is repeatable and release-regression upsells begin.

unit economics flow

flowchart LR
  Leads[Qualified BYOC vendors] --> Pilots[Paid pilots]
  CACSpend[CAC spend] --> Pilots
  Pilots --> Customers[Production customers]
  Customers --> Tenants[More live tenants]
  Tenants --> Revenue[Annual subscription revenue]
  Revenue --> GrossProfit[Gross profit]
  GrossProfit --> EBITDA[EBITDA]
  EBITDA --> Cash[Ending cash]
  Churn[Churn and security friction] --> Customers

Flags: The model assumes enough target accounts will approve outbound redacted incident bundles to reach 14 production customers by month 24, but research still leaves budget ownership and export-policy pass rates unresolved. · Gross margin only reaches the 70% business-plan target if deployment work stays near the plan threshold of under two engineer-weeks per customer; otherwise solutions work will push the funding need higher. · Revenue excludes paid pilots and onboarding fees, which keeps ARR conservative but means the cash path may diverge if the company leans harder into services to win early deals.

Section

Top risks

Slow security reviews. Each customer tenant may require a separate security and procurement review before the relay can be installed. Mitigation: Start with vendors already selling BYOC into regulated accounts and provide opinionated reference architectures, redaction defaults, and audit artifacts.
Incumbent observability overlap. Customers may argue their existing Datadog, Grafana, or CloudWatch stack should already solve the problem. Mitigation: Position the product as vendor-side support infrastructure that works on top of customer-owned tools and focuses on cross-tenant evidence packaging rather than replacing the stack.
Beachhead too narrow. The number of vendors with enough live BYOC tenants could be smaller than expected in the first year. Mitigation: Land in BYOC support first, then expand to release validation, SLA proof, and billing evidence for the broader private-deployment software market.

Section

Evidence

Cited sources (40)

Tsuga. Tsuga Raises $35 Million Series A | Tsuga · https://www.tsuga.com/resources/blog/pr/tsuga-raises-35-million-series-a
Tech Funding News. They sold their startup to Datadog. Now, they've raised $35M to disrupt observability · https://techfundingnews.com/ex-datadog-founders-raise-35m-tsuga-observability/
Tsuga. Resilient Observability | Tsuga · https://www.tsuga.com/solutions/resilient-observability
Tsuga. Agent-Native Observability | Tsuga · https://www.tsuga.com/solutions/agent-native-observability
Tsuga. The new standard for observability. | Tsuga · https://www.tsuga.com/product/overview
Langfuse. Masking - Langfuse · https://langfuse.com/docs/observability/features/masking
Langfuse. Data Retention - Langfuse · https://langfuse.com/docs/administration/data-retention
Langfuse. Audit Logs - Langfuse · https://langfuse.com/docs/administration/audit-logs
Langfuse. Pricing - Langfuse · https://langfuse.com/pricing
LangChain. LangSmith: AI Agent & LLM Observability Platform · https://www.langchain.com/langsmith/observability
LangChain. LangSmith: Agent Deployment Infrastructure for Production AI Agents · https://www.langchain.com/langsmith/deployment
LangChain. Mission Control for Self-Hosted LangSmith on Kubernetes · https://www.langchain.com/blog/mission-control-operating-self-hosted-langsmith-on-kubernetes
LangChain. Introducing End-to-End OpenTelemetry Support in LangSmith · https://www.langchain.com/blog/end-to-end-opentelemetry-langsmith
LangChain. How LangSmith and LangChain OSS Help You Meet EU AI Act Requirements · https://www.langchain.com/blog/langsmith-langchain-oss-eu-ai-act
LangChain. LangSmith Plans and Pricing · https://www.langchain.com/pricing
Arize AI. Agent Observability and Tracing · https://arize.com/ai-agents/agent-observability/
Arize AI. Arize Private Connect - Arize AX Docs · https://arize.com/docs/ax/security-and-settings/arize-private-connect
Arize AI. Arize Audit Log - Arize AX Docs · https://arize.com/docs/ax/security-and-settings/compliance/arize-audit-log
Arize AI. The Role of OpenTelemetry (OTEL) in LLM Observability · https://arize.com/blog/the-role-of-opentelemetry-in-llm-observability/
Arize AI. Pricing · https://arize.com/pricing/
groundcover. Bring Your Own Cloud Observability | groundcover BYOC · https://www.groundcover.com/byoc
groundcover. On-Prem & Air-Gapped Solutions with groundcover · https://www.groundcover.com/onprem-and-airgapped
groundcover. groundcover Pricing Plans: Free, Team & Enterprise Plans · https://www.groundcover.com/pricing
groundcover. BYOC Architecture Tradeoffs and Real World Lessons · https://www.groundcover.com/blog/byoc-in-practice-architectures-tradeoffs-lessons
Datadog. Agent Observability | LLM Observability | Datadog · https://www.datadoghq.com/products/ai/agent-observability/
Datadog. Pricing | Datadog · https://www.datadoghq.com/pricing/
Datadog. Observability Pipelines · https://docs.datadoghq.com/observability_pipelines/
Datadog. Sensitive Data Scanner · https://docs.datadoghq.com/security/sensitive_data_scanner/
AWS. VPC Networking - AWS PrivateLink - AWS · https://aws.amazon.com/privatelink/
AWS. Use interface VPC endpoints (AWS PrivateLink) to create a private connection between your VPC and Amazon Bedrock - Amazon Bedrock · https://docs.aws.amazon.com/bedrock/latest/userguide/vpc-interface-endpoints.html
Microsoft. How to configure network isolation for Microsoft Foundry - Microsoft Foundry · https://learn.microsoft.com/en-us/azure/foundry/how-to/configure-private-link
Microsoft. Governance and security for AI agents across the organization - Cloud Adoption Framework · https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/ai-agents/governance-security-across-organization
Microsoft. Data, privacy, and security for Foundry Models sold by Azure in Microsoft Foundry - Microsoft Foundry · https://learn.microsoft.com/en-us/azure/foundry/responsible-ai/openai/data-privacy
Google Cloud. VPC Service Controls · https://cloud.google.com/security/vpc-service-controls
Google Cloud. Gemini Enterprise Agent Platform and zero data retention | Google Cloud Documentation · https://docs.cloud.google.com/gemini-enterprise-agent-platform/resources/zero-data-retention
NIST. AI Risk Management Framework · https://www.nist.gov/itl/ai-risk-management-framework
OWASP. Home · https://genai.owasp.org/
EUR-Lex. Regulation - EU - 2024/1689 - EN - EUR-Lex · https://eur-lex.europa.eu/eli/reg/2024/1689/oj/eng
OpenTelemetry. Moved: Generative AI semantic conventions · https://opentelemetry.io/docs/specs/semconv/gen-ai/
Helicone. Self-Hosting Helicone - Helicone OSS LLM Observability · https://docs.helicone.ai/getting-started/self-host/overview

Why now

The idea

Jobs to be done

Market

Executive takeaways

Market definition

Customer and buyer

Buying triggers

Willingness to pay

Category dynamics

Tailwinds

Headwinds

Validation signals

Regulatory & technical constraints

Competition

Why incumbents do not win by default

Business plan

Problem

Solution

Why we win

Milestones

Founding team

Experiment roadmap

Risk assessment

What must be true

Open diligence questions

Financial model

Model sanity

Scenarios

Sensitivity

Top risks

Evidence

Cited sources (40)

Related dossiers

Knowledge expiry gate that quarantines stale docs before support and employee AI agents answer from them.

Control plane that shadow-tests email and CRM permissions before support agents can act on customer conversations.

Continuity compiler for code-migration agents that shadow-tests fallback models before export controls strand global releases.