Policy-safe trace relay for AI vendors in customer VPCs, exporting redacted support evidence without raw-data exfiltration.
AI application vendors increasingly win enterprise deals only if they deploy inside the customer's own cloud account, but that deployment model blinds the vendor once production incidents start. Support teams need raw traces, prompt lineage, and agent-call context to debug failures, yet security teams will not allow unrestricted telemetry exfiltration.
Why now
- AI telemetry volumes are making legacy per-byte observability pricing structurally uneconomic for agent workloads.
- Enterprise buyers now require telemetry and prompts to stay inside their own cloud accounts, creating a new tooling gap for vendors.
- Debugging modern agents now requires prompt-level traces, confidence scores, and call graphs, not just logs and metrics.
- Named customers and six-figure contracts prove there is already budget for premium BYOC AI reliability infrastructure.
Catalyst. Tsuga's traction shows enterprises now insist on in-cloud AI observability because AI traces are both too sensitive and too voluminous for legacy SaaS observability patterns.
The idea
The product deploys a lightweight collector and policy engine into each customer tenant where the AI vendor already runs its application. It records complete agent traces, prompt lineage, confidence scores, and call graphs locally, then automatically generates redacted incident bundles, regression diffs, and benchmark snapshots that the vendor can access centrally. Customers control what fields can leave the account, how long evidence is retained, and whether replay artifacts are exportable. Vendors get cross-tenant fleet health, release-quality comparisons, and much faster incident resolution without asking customers to open broad observability access.
What's different. Traditional observability vendors want the customer to buy another backend; this startup sells the vendor a support-specific relay purpose-built for fleets of isolated customer-cloud deployments. The core IP is policy-aware evidence packaging: full traces stay local, while only approved summaries, diffs, and redacted artifacts leave the tenant. That makes the product useful even when the customer already has Datadog, Grafana, or CloudWatch, because it solves the vendor's cross-tenant debugging and release-support problem rather than replacing the customer's tooling.
| Beachhead | Remote support and release debugging for Series A-C AI application vendors that deploy copilots or agents into customer-owned AWS accounts at banks, insurers, publishers, and other data-sensitive enterprises |
|---|---|
| Wedge | An in-tenant trace relay that captures full agent execution data, applies customer-defined redaction and export policy, and emits vendor-safe support bundles plus fleet-level health signals |
| Non-obvious insight | BYOC changes observability from a buyer-owned dashboard problem into a vendor-support infrastructure problem. The winning product is not another Datadog clone; it is a policy engine that computes full-fidelity support evidence inside each tenant cloud and exports only the minimum approved data needed to diagnose, compare, and fix incidents across a fleet of isolated customer environments. |
| Venture-scale path | Start with P1 support and release-debugging workflows, then expand into private-deployment SLA proof, upgrade validation, fleet benchmarking, usage-based billing evidence, and eventually the remote-operations control plane for all BYOC AI software vendors. |
| Primary user | Head of Customer Reliability or AI Platform at a Series A-C software vendor shipping BYOC agent deployments into regulated enterprises |
|---|---|
| Secondary user | Solutions architects and escalation engineers supporting private-cloud customer tenants |
| Economic buyer | VP Engineering or CTO |
| First customer | Series B AI software vendor with 5-20 live BYOC tenants in banking, insurance, or publishing, a 3-8 person reliability team, and weekly production escalations that currently require customer-run log pulls |
|---|---|
| Buying trigger | A recent enterprise win or renewal that requires private-cloud deployment and creates more support escalations than the vendor can handle manually |
| Current alternative | Ad hoc SSH and log-pull workflows, customer-run Datadog or Grafana dashboards, and one-off support scripts maintained by solutions engineers |
| Switching reason | The relay cuts MTTR without forcing raw prompt or trace data out of the tenant, while giving the vendor a central view across all private deployments that manual workflows and customer-owned dashboards cannot provide. |
| Pricing hypothesis | Per live BYOC tenant per month with premium tiers based on retained trace days, protected environments, and number of agent runs analyzed |
Jobs to be done
| Job | Current alternative | Success metric |
|---|---|---|
| When a BYOC customer reports an agent failure, help the vendor reliability lead collect usable evidence without violating the customer's data-boundary rules, so they can resolve the incident quickly. | Manual log exports coordinated over support tickets and shared dashboards | Time to first actionable root-cause packet and MTTR for severity-one incidents |
| When a vendor ships a new agent release across isolated customer tenants, help the platform team compare behavior safely, so they can catch regressions before renewals or escalations. | Limited sampling from customer dashboards and ad hoc release spreadsheets | Regression-detection rate and reduction in customer-escalated release incidents |
flowchart LR Buyer[BYOC AI vendor] --> Pain[Blind support inside customer clouds] Pain --> Product[In-tenant trace relay and policy engine] Product --> Outcome[Faster MTTR with safe cross-tenant insight]
- Signal · 5/5The cluster shows strong demand, fresh funding, concrete pricing pain, and named customers already buying the category.
- Pain · 4/5BYOC support blindness creates acute operational pain for vendors, though the first buyers are still a narrower subset of enterprise AI vendors.
- Wedge · 5/5The first workflow is specific and painful: export-safe incident evidence and release diffs for private-cloud AI deployments.
- Defense · 4/5Redaction policy, tenant-side collectors, and cross-tenant evidence packaging can compound into a workflow moat, though large observability vendors may eventually move nearby.
- Scale · 4/5The beachhead is narrow, but it can expand into the default operations layer for BYOC AI software fleets across support, upgrades, and commercial analytics.
- AWS and private-cloud deployment partners
- AI application vendors with BYOC rollouts
- Systems integrators serving regulated enterprises
- Building connectors for agent frameworks and cloud runtimes
- Maintaining redaction, export-policy, and evidence-packaging logic
- Operating fleet-health analytics and release-diff workflows
- In-tenant collectors and policy engine
- Cross-tenant fleet analytics backend
- Security and compliance reference architectures
- Debug private-cloud AI incidents without raw-data exfiltration
- Give vendors cross-tenant fleet visibility while preserving customer control
- Reduce support load and speed enterprise renewals for BYOC products
- High-touch implementation with shared security review
- Ongoing reliability reviews tied to release quality and MTTR
- Direct founder-led sales to CTOs and VP Engineering leaders at AI vendors
- Partnerships with cloud deployment consultancies and reference-architecture partners
- Customer-success introductions from existing BYOC enterprise accounts
- Series A-C AI software vendors selling BYOC deployments into regulated enterprises
- Platform and reliability teams supporting private-cloud customer tenants
- Cloud infrastructure for central analytics
- Deployment engineering and customer support
- Security, compliance, and connector maintenance
- Subscription per live BYOC tenant
- Premium analytics and retention tiers
- Enterprise onboarding and security-review services
Market
| TAM | $300.0M Estimate: 2,500 global AI vendors that could face BYOC/private-cloud support pain × roughly $120k annual spend per vendor, anchored by Tsuga’s six-figure average contracts and enterprise packaging across Langfuse, LangSmith, and groundcover. |
|---|---|
| SAM | $84.0M Estimate: 700 current beachhead vendors in the narrower Series A-C + regulated-enterprise BYOC segment × roughly $120k annual spend, constrained by private-networking and governance requirements. |
| SOM | $7.2M Estimate: 60 reachable customers by year 3 × roughly $120k annual spend, assuming concentrated founder-led sales into vendors already carrying multiple private-cloud tenants and recurring incident load. |
Executive takeaways
- The wedge is real but narrow: the pain is not generic monitoring, it is vendor-side support for AI products running inside customer-controlled clouds.
- Private connectivity, masking, retention controls, and auditability are table stakes for this category, not premium add-ons.
- Budget exists when the pain is acute, but the earliest spend is concentrated in AI vendors with multiple live BYOC tenants and recurring Sev-1 support load.
- Go-to-market risk is more about security-review friction and install complexity than about whether the underlying telemetry stack can be built.
Market definition
A vendor-support and reliability layer for AI software companies that deploy inside customer clouds, sitting between generic observability tooling and managed support operations.
Customer and buyer
Economic buyers are usually CTOs or VPs of Engineering facing enterprise BYOC renewals, while day-to-day champions are reliability, platform, and senior solutions engineers who currently coordinate customer-run log pulls and dashboard access.
Buying triggers
- A new or renewed enterprise account requires private-cloud deployment, making direct vendor access impossible without a support-safe relay. [2][29][31]
- AI telemetry bills or sampling gaps make legacy SaaS observability economically and operationally painful for agent workloads. [2][3][21][26]
- A bad release or repeated incident across isolated tenants exposes the cost of manual evidence gathering and weak fleet comparison. [4][11][24]
- Governance work forces automatic logging, masking, and audit trails into the production checklist before more regulated customers can go live. [8][18][32][36][38]
Willingness to pay
Premium spend is plausible when pain is acute: Tech Funding News reports Tsuga at several million ARR with six-figure average contracts, while Langfuse, LangSmith, and groundcover show that paid tiers concentrate around retention, auditability, support, and enterprise controls rather than raw tracing alone. [2][9][15][23]
Category dynamics
Tailwinds
- AI agent loops generate far more telemetry and make legacy sampling compromises harder to accept.
- Private connectivity and data-boundary controls are now standard cloud features, reducing feasibility risk for BYOC deployment patterns.
- OTel-based AI observability ecosystems are maturing quickly, lowering integration risk for new entrants.
Headwinds
- Each tenant install can trigger heavy security and procurement work, slowing expansion even when pain is obvious.
- Buyers can often good-enough the problem with existing dashboards plus manual support processes.
- Zero-retention promises are nuanced because some model-provider features still retain or monitor prompts.
Validation signals
- Tsuga already reports several million dollars of ARR, six-figure average contracts, and named customers in this architectural neighborhood.
- Adjacent observability vendors now package BYOC, on-prem, and air-gapped deployment models as first-class offers.
- Major clouds have normalized private connectivity and tighter data-boundary controls for AI services, lowering technical feasibility risk.
- The telemetry layer is standardizing around OTel-style semantics, which reduces connector risk for a new vendor.
Regulatory & technical constraints
- Private networking and no-public-egress patterns are often mandatory for accessing models and moving telemetry in regulated environments.
- Prompt and log masking, deletion, and auditability must be first-class product features, not post-processing steps.
- Zero-retention claims are cloud- and feature-specific; some provider workflows still retain prompts or responses for monitoring.
- EU AI Act-style requirements push automatic event logging, post-market monitoring, and incident evidence deeper into the runtime stack.
Competition
The space is crowded at the infrastructure layer but still fragmented at the vendor-support workflow layer. Most products optimize for application teams or centralized backends; fewer are built around exporting policy-safe evidence from many isolated customer tenants back to the vendor’s support organization.
| Competitor | Stage | Wedge | Pricing | Strength | Weakness vs. us |
|---|---|---|---|---|---|
| Tsuga | scale-up | BYOC observability platform that keeps telemetry in the customer cloud and argues AI-era economics break legacy SaaS observability models. | Custom; company describes a per-GB model and Tech Funding News reports six-figure average contracts. | Strong narrative fit, multi-cloud positioning, and early proof that enterprises will pay premium budgets for private-cloud AI observability. | Built as a broader observability platform; the startup can stay narrower around vendor-safe support bundles and cross-tenant debugging workflows. |
| Langfuse | scale-up | Open-source LLM observability, evals, and prompt tooling with strong self-hosted controls. | Free to $2,499/month cloud enterprise; self-host supported. | Developer love, strong masking/retention/audit controls, and a credible self-host story. | Optimized for app-team observability inside one environment, not policy-aware incident relay across many isolated customer tenants. |
| LangSmith | scale-up | Integrated agent lifecycle platform spanning observability, evals, deployment, and secure sandboxes. | Free developer tier, $39 per seat Plus plan, and enterprise/custom for hybrid or self-hosted deployment. | Deep LangChain ecosystem fit plus credible self-host and Kubernetes operations story. | Best when the buyer wants a broad agent platform; less centered on vendor-to-customer evidence export across BYOC tenants. |
| Arize Phoenix / AX | scale-up | Enterprise AI evaluation and observability with private-connect deployment options and OTel alignment. | Phoenix OSS is free; AX adds paid and custom enterprise tiers with private-connect and compliance features. | Strong evaluation stack, private-connect support, and enterprise compliance posture. | Center of gravity is centralized AI engineering and evals rather than support-specific relay and export policy across customer-owned clouds. |
| Datadog Agent Observability | incumbent | Agent observability added onto a broad observability and security platform. | Module-based platform pricing plus quote-led adoption for AI observability use cases. | Huge installed base and adjacent tools such as observability pipelines and sensitive data scanning. | Still assumes a Datadog-centric backend and broad platform footprint, while the startup can focus on support-safe evidence flow inside customer-owned environments. |
Why incumbents do not win by default
- Cloud platforms. AWS, Azure, and Google Cloud now provide the private networking and data-boundary primitives a buyer expects, but they stop at infrastructure primitives and do not solve cross-tenant vendor support workflows.
- Generic observability suites. Datadog-class platforms can unify AI and infrastructure telemetry, yet they still assume enough data can be centralized into the observability backend and paid for under a modular platform contract.
- Open-source and self-hosted stacks. Self-hosted tools such as Langfuse and Helicone give local control, but buyers still have to assemble export policy, support bundles, and fleet-level comparison flows themselves.
- Agent lifecycle platforms. LangSmith and Arize reach deeper into deployment, evals, and private connectivity, but their center of gravity is application-team engineering productivity rather than vendor-safe remote support across customer clouds.
Business plan
BYOC agent support relay is a vendor-support infrastructure company for AI software vendors that deploy agents inside customer-owned cloud accounts. The beachhead is Series A-C vendors with 5-20 live BYOC tenants in regulated sectors where weekly escalations still depend on customer-run log pulls and ad hoc scripts. The first product is not a broad observability suite; it is an AWS-first in-tenant trace relay and policy engine that produces redacted incident bundles, release-regression diffs, and fleet health signals without exporting raw prompts or traces by default. That wedge matches the researched buying trigger of a new enterprise BYOC win, renewal, or bad release that overwhelms the vendor's reliability team. Pricing should start with a paid pilot and then convert to per-live-tenant annual subscriptions because value scales with isolated deployments, not seats. The sequencing is intentionally narrow: prove one support workflow, one cloud, and one export-approval model before expanding into multi-cloud coverage, SLA proof, or billing analytics. The strongest reasons to engage are clear pain, premium budget precedent, and a differentiated workflow relative to generic observability tools. The biggest disconfirming risk is that security teams refuse any outbound artifact egress or that the true count of vendors with enough live BYOC tenants is smaller than assumed. Research also does not resolve the dominant private-cloud agent frameworks or the first budget owner, so the first six months must treat connector selection and budget ownership as explicit tests.
Problem
- AI vendors lose root-cause speed once their product runs inside customer VPCs and support teams depend on customer-run log pulls, dashboards, and SSH sessions to diagnose incidents.
- Existing observability products either require sensitive telemetry to leave the tenant or price unsampled AI traces poorly, while manual workflows cannot compare incidents or regressions across isolated customer environments.
Solution
- Deploy an AWS-first in-tenant collector and policy engine that captures full agent traces locally and exports only approved redacted incident bundles, regression diffs, and fleet health summaries.
- Give vendor reliability teams a central cross-tenant support console for P1 triage, release comparison, and export approval without forcing the end customer to replace Datadog, Grafana, or CloudWatch.
Why we win
- The product serves the vendor's support and release-debugging workflow rather than asking the end customer to buy another observability backend, which avoids a head-on platform replacement sale.
- Reusable masking policies, export templates, and a growing corpus of redacted incident bundles and fixes across many tenants compound into faster diagnosis and a workflow moat.
| Beachhead | Series B AI software vendors selling copilots or agents into banks, insurers, publishers, and similar regulated enterprises through AWS BYOC deployments, with 5-20 live customer tenants and recurring Sev-1 support load. |
|---|---|
| Wedge rationale | This customer slice already feels the cost of manual evidence gathering, has a clear renewal-linked trigger, and can justify six-figure reliability spend sooner than broader AI vendors or end-enterprise platform teams. |
| Sequencing | Start with AWS plus OTel-aligned trace capture because the first goal is to prove faster incident resolution under real security review, then add release-regression and benchmarking workflows once pilot data is flowing, and only then expand cloud coverage and channels after deployment speed is repeatable. |
| Not yet | Selling a generic observability replacement or end-customer dashboard to enterprise platform teams. · Supporting one-off on-prem or air-gapped deployments before the AWS BYOC playbook is repeatable. · Expanding into SLA reporting, usage-billing evidence, or broad AI eval tooling before the incident-bundle wedge converts to production. |
| Wedge | Sell a paid support-visibility pilot for the first 1-3 BYOC tenants immediately after a renewal, launch, or incident, then convert into a production subscription once the relay proves faster triage and safer release debugging. |
|---|---|
| Channels | Founder-led direct sales into CTOs, VPs of Engineering, and heads of customer reliability at AI vendors already selling BYOC. · Cloud-deployment consultancies and systems integrators already implementing private networking and security review for enterprise tenants. · Customer-success and reference-account introductions from existing BYOC enterprise customers that are already escalating support pain. |
| Funnel targets | Target account→qualified pilot 20-30%, qualified pilot→paid pilot 50%+, paid pilot→production 60%+, production account→5+ live tenants under relay within 9 months in 50%+ of accounts. |
| Pricing | Start with a $15k-$25k paid pilot for 60-90 days covering 1-3 live tenants, then convert to an annual subscription priced per live BYOC tenant with an $80k-$150k base ACV for vendors operating 5-20 tenants, plus premium charges for longer retention, protected environments, and higher analyzed run volume. This aligns price to the buyer's deployment complexity and support burden better than seat-based pricing. |
| MVP | MVP covers AWS deployments first: OTel-aligned trace capture, local prompt and tool-call retention, redaction policy templates, human-approved incident bundle export, release-diff views, and a central fleet health console for one vendor across 5-20 tenants. It deliberately excludes multi-cloud, customer-facing dashboards, and broad model-evaluation workflows. |
|---|---|
| 6 months | Ship a production-ready AWS collector, policy templates for masking and retention, export approvals with audit logs, and support-ticket packaging that gets the first actionable incident bundle out in under 30 days from kickoff. |
| 12 months | Add release-regression benchmarking, benchmark snapshots across tenants, and the next most-requested cloud connector only after AWS deployments consistently reach first value with minimal custom work. |
| 24 months | Expand from support relay into private-deployment SLA proof, upgrade validation, and usage-billing evidence while keeping policy-safe in-tenant computation as the architectural core. |
| Key bets | Buyers will value faster safe evidence export more than a broader observability dashboard in the first budget cycle. · An AWS-first deployment path can reach first value in four to six weeks with limited custom engineering. · Redacted incident bundles and regression fingerprints will become reusable assets that improve pilot-to-production conversion and future diagnosis. |
| Revenue streams | Annual subscription priced by live BYOC tenant count with platform minimums. · Premium retention, fleet benchmarking, and release-validation modules. · Security-review onboarding and deployment services sold as bounded packages. |
|---|---|
| Unit of value | Live BYOC tenant under relay policy, with upsell tied to retained trace days and analyzed agent runs. |
| Target gross margin | 70% |
| Expansion levers | Add more live tenants and cloud environments within the same vendor account. · Expand from incident bundles into release validation, benchmarking, and SLA evidence. · Reuse approved policy templates to shorten rollout into new regulated customer logos. |
| North-star metric | Number of production live BYOC tenants automatically generating approved incident bundles and release diffs for the vendor support team. |
|---|---|
| Input metrics | Time from kickoff to first actionable incident bundle. · Qualified pilot to paid pilot conversion rate. · Paid pilot to production conversion rate. · Median MTTR improvement on incidents where the relay is used. · Expansion rate from first production tenant to five or more covered tenants in an account. |
| Moats to build | Policy-template library for masking, retention, and export approvals by regulated use case. · Redacted incident-bundle corpus linking trace patterns to resolutions across many isolated tenants. · AWS and OTel deployment playbooks that reduce security-review and onboarding friction. |
| Kill criteria | If fewer than 3 of the first 10 qualified prospects allow any outbound redacted bundle after security review, the relay export thesis is wrong. · If the first 3 deployments cannot reach an actionable incident bundle within 30 days and less than 2 engineer-weeks of custom work, repeatable GTM is broken. · If fewer than 2 paid pilots convert to "$80k+" production subscriptions by month 12, the ROI and pricing model are too weak. |
Milestones
- Package the AWS-first collector, policy templates, and incident-bundle export workflow.
- Sign 6-8 design partners and convert at least 3 into paid pilots.
- Put 2 customers into production at "$80k+" annualized value.
- Prove first value in 30 days and keep customization below 2 engineer-weeks per pilot.
- Add the next cloud connector only after AWS deployments are repeatable.
- Grow to 12-15 production customers and expand multiple tenants inside the best accounts.
- Launch release-regression benchmarking and policy-template reuse as standard upsells.
- Establish partner-sourced pipeline as a meaningful share of qualified pilots.
- Expand from support relay into SLA proof, upgrade validation, and billing-evidence workflows.
- Build a recognized corpus of cross-tenant incident patterns and remediation templates.
- Reach category credibility as the default vendor-support layer for private-cloud AI deployments without becoming a generic observability suite.
flowchart LR Wedge[BYOC support relay] --> MVP[AWS-first incident bundle MVP] MVP --> Proof[Faster triage and paid pilot conversion] Proof --> Expansion[More tenants plus release validation and SLA evidence]
Founding team
| Role | Start timing | Rationale |
|---|---|---|
| Founder CEO | Month 0 | Own founder-led sales, design-partner recruitment, and ICP discipline because the first deals require problem education and sharp qualification. |
| Founding eng | Month 0 | Build the AWS collector, policy engine, export pipeline, and first fleet-health workflow needed for credible pilots. |
| Solutions engineer | Month 3 | Compress deployment time, standardize security-review artifacts, and protect core engineering bandwidth as pilots multiply. |
| Security / policy product lead | Month 6 | Turn pilot learnings into reusable masking templates, approval flows, and retention controls that procurement teams trust. |
| Partnerships lead | Month 9 | Scale the deployment channel only after the AWS playbook is repeatable and partner enablement can be productized. |
Experiment roadmap
| Horizon | Experiment | Hypothesis | Success metric | Owner |
|---|---|---|---|---|
| 0–90 days | Build a named target-account list and interview 25 prospects already selling BYOC into regulated enterprises. | At least 10 prospects have recurring support pain and 5 or more live or imminent BYOC tenants. | 10 qualified accounts with tenant count, trigger event, and named economic buyer captured. | Founder CEO |
| 0–90 days | Run structured security reviews around a sample export schema, masking policy, and audit-log design with 6 prospects. | A standard redacted-bundle template will be acceptable in at least half of qualified accounts. | 3 prospects approve a concrete outbound artifact schema for pilot use. | Founder product |
| 0–90 days | Ship the first AWS collector and incident-bundle prototype on top of OTel-aligned traces. | One design partner can see a useful root-cause packet inside 30 days without replacing existing observability tooling. | 1 design partner receives an actionable incident bundle from a live or replayed workflow within 30 days. | Founding eng |
| 3–6 months | Convert 3 design partners into paid pilots with explicit MTTR and deployment-speed success criteria. | Buyers will pay for support visibility before the full release-benchmarking roadmap is built. | 3 paid pilots signed at "$15k+" each with agreed conversion criteria. | Founder CEO |
| 6–12 months | Launch release-regression diffs and convert the first 2 paid pilots into production subscriptions. | Release debugging plus incident bundles creates a stronger ROI case than incident relay alone. | 2 production customers at "$80k+" ACV and documented MTTR or escalation-load improvement. | Engineering lead |
| 6–12 months | Recruit 2 cloud-deployment partners and test whether they can deliver the packaged AWS playbook. | Partner-led installs can reduce founder time without increasing customization. | 2 signed partners and 1 partner-assisted deployment reaching first value in under 45 days. | Head of partnerships |
Risk assessment
- R1Security teams may reject any outbound artifact egress even after masking and approval controls. — Qualify hard on export policy early, standardize the approval package, and maintain an in-tenant-only fallback product direction.
- R2The number of vendors with enough live BYOC tenants may be smaller than research estimates. — Build a named pipeline before scaling spend and expand into adjacent private-deployment software vendors only after the initial ICP is measured.
- R3Deployment complexity may force heavy custom work and reduce gross margin. — Stay AWS-first, reject edge-case installs early, and hire solutions talent before adding more clouds or features.
- R4Incumbent observability suites or self-hosted tools may add policy-aware export features quickly. — Differentiate on vendor-support workflow depth, cross-tenant comparisons, and faster production deployment rather than broader telemetry breadth.
- R5Buyers may treat the product as a nice-to-have debugging aid instead of a budgeted reliability layer. — Sell only against explicit trigger events and require quantified MTTR or escalation-load goals in every pilot.
| Risk | Likelihood | Impact | Mitigation |
|---|---|---|---|
| Security teams may reject any outbound artifact egress even after masking and approval controls. | High | High | Qualify hard on export policy early, standardize the approval package, and maintain an in-tenant-only fallback product direction. |
| The number of vendors with enough live BYOC tenants may be smaller than research estimates. | Medium | High | Build a named pipeline before scaling spend and expand into adjacent private-deployment software vendors only after the initial ICP is measured. |
| Deployment complexity may force heavy custom work and reduce gross margin. | Medium | High | Stay AWS-first, reject edge-case installs early, and hire solutions talent before adding more clouds or features. |
| Incumbent observability suites or self-hosted tools may add policy-aware export features quickly. | Medium | Medium | Differentiate on vendor-support workflow depth, cross-tenant comparisons, and faster production deployment rather than broader telemetry breadth. |
| Buyers may treat the product as a nice-to-have debugging aid instead of a budgeted reliability layer. | Medium | Medium | Sell only against explicit trigger events and require quantified MTTR or escalation-load goals in every pilot. |
| Title | Head of Customer Reliability at a Series B AI software vendor |
|---|---|
| Profile | A 300-800 employee vendor with 5-20 AWS BYOC tenants in banking, insurance, or publishing, a 3-8 person reliability team, and weekly escalations that currently require customer-run log pulls. |
| Trigger | A new regulated-enterprise launch, renewal, or bad release exposes that the support team cannot debug isolated tenants fast enough with current tooling. |
| Buyer | VP Engineering or CTO |
| Initial contract | $15k-$25k paid pilot for 1-3 tenants converting to an $80k-$150k annual subscription once 5-20 live tenants are under policy, with upsell for retention and release-validation modules. |
What must be true
- At least one target segment has enough vendors with 5 or more live BYOC tenants to support efficient founder-led pipeline generation now.
- Security teams will approve a predefined redacted export schema often enough that the relay can operate without full manual bespoke review every time.
- The relay reduces time to first actionable root-cause packet and overall MTTR enough that buyers fund it from an existing reliability or support budget.
- An AWS-first deployment can be packaged tightly enough that pilots do not become custom services projects.
- Incumbent observability and self-hosted tooling do not solve cross-tenant vendor support workflows well enough to erase the wedge before expansion.
Open diligence questions
- How many live BYOC tenants do the best initial prospects actually run today, not merely pilot?
- What exact fields and approvals must be present before a security team allows outbound incident bundles?
- Which cloud and agent-framework combinations dominate the first 20 prospects and therefore define the connector roadmap?
- Where did comparable buyers fund adjacent tools: reliability, support operations, or security architecture?
- What measurable MTTR delta would make a prospect convert from pilot to a six-figure annual contract?
| Call | Meet / investigate further |
|---|---|
| Conviction | Strong problem signal and credible budget precedent, but conviction stays moderate until security-review pass rates and real tenant counts are proven. |
| Why believe | The company attacks a specific and painful support workflow where BYOC deployment, AI telemetry volume, and enterprise security policy now intersect. |
| Why doubt | The beachhead may be narrower than it appears and some customers may refuse any artifact egress, which would force a different product and pricing model. |
| Next diligence | Validate with 8-10 target vendors that at least two will approve a paid pilot with outbound redacted bundles inside one budget cycle. |
Financial model
| Year 1 revenue | $87K EBITDA $-883K · Cash EOP $2.12M |
|---|---|
| Year 2 revenue | $1.10M EBITDA $-832K · Cash EOP $1.29M |
| Year 3 revenue | $2.74M EBITDA $-143K · Cash EOP $1.14M |
| ARPU (annual) | $130K |
|---|---|
| Gross margin | 70% |
| CAC | $55K Payback 7.3 months |
| LTV / CAC | 7.7x LTV $422K |
| Round | pre-seed · $3.0M |
|---|---|
| Runway | 30 months |
| Milestone | Reach 14-20 production customers, prove partner-assisted AWS deployments and release-regression upsell, and start a seed process with six months of cash buffer. |
Model sanity
- Revenue engine. The base case is driven by growing from 2 production vendors at Y1 end to 28 by Q4Y3 at a $130K blended ACV, with most value coming from repeatable AWS deployments and tenant expansion.
- Must go right. Security reviews have to approve a standard redacted-bundle workflow often enough that partner-assisted pilots can convert into 14 production customers by Q4Y2.
- Model breaks if. If sales cycles stretch toward 8-9 months or gross margin stalls below 67%, the downside case keeps EBITDA deeply negative and shrinks the cash floor to roughly $0.6M.
- Next-round proof. A seed-ready story appears once the company can show 14-20 production customers, partner-assisted installs, and release-regression upsell without losing the path to 70% gross margin.
- Revenue (line, area)
- Cash EOP (dashed)
- EBITDA (bars, gray = loss)
- Founder / CEO
- Engineering
- Solutions engineering
- Security / policy product
- Partnerships / sales
- G&A / Ops
| Y3 revenue | Y3 EBITDA | Cash low point | Description | |
|---|---|---|---|---|
| Downside | Security approvals, partner activation, and production conversions slip by roughly two quarters while pricing stays closer to the low end of the business-plan range. | |||
| Base | AWS-first deployments become repeatable, the first partner channel contributes in Y2, and premium retention / protected-environment upsells modestly lift blended ACV. | |||
| Upside | Pilot conversion and partner referrals improve, and release-regression plus retention modules attach earlier without requiring a much larger team. |
| Variable | Downside | Upside | Cash impact | Revenue impact |
|---|---|---|---|---|
| CAC | $70K fully loaded CAC | $45K fully loaded CAC | ||
| sales cycle | 8-9 months from pilot kickoff to production | 4 months | ||
| hiring pace | Pull forward one extra engineer and one GTM hire into H2Y2 | Delay one noncritical hire until after seed proof | ||
| churn | 2.5% monthly churn | 1.2% monthly churn | ||
| ARPU | $120K annual subscription value per customer | $135K annual subscription value per customer | ||
| gross margin | 67% steady-state gross margin | 72% steady-state gross margin |
Scenarios
| Scenario | Y3 revenue | Y3 EBITDA | Cash low point | Description | Key changes |
|---|---|---|---|---|---|
| Downside | $2.31M | $-491K | $620K | Security approvals, partner activation, and production conversions slip by roughly two quarters while pricing stays closer to the low end of the business-plan range. |
|
| Base | $2.74M | $-143K | $1.09M | AWS-first deployments become repeatable, the first partner channel contributes in Y2, and premium retention / protected-environment upsells modestly lift blended ACV. |
|
| Upside | $3.47M | $434K | $1.49M | Pilot conversion and partner referrals improve, and release-regression plus retention modules attach earlier without requiring a much larger team. |
|
Sensitivity
| Variable | Downside | Base | Upside |
|---|---|---|---|
| ARPU | $120K annual subscription value per customer | $130K annual subscription value per customer | $135K annual subscription value per customer |
| CAC | $70K fully loaded CAC | $55K fully loaded CAC | $45K fully loaded CAC |
| churn | 2.5% monthly churn | 1.8% monthly churn | 1.2% monthly churn |
| sales cycle | 8-9 months from pilot kickoff to production | 5-6 months | 4 months |
| gross margin | 67% steady-state gross margin | 70% steady-state gross margin | 72% steady-state gross margin |
| hiring pace | Pull forward one extra engineer and one GTM hire into H2Y2 | Lean ramp to 10 FTE by Q4Y3 | Delay one noncritical hire until after seed proof |
Key assumptions (20)
| ID | Name | Value | Unit | Source |
|---|---|---|---|---|
| A1 | Model start month | 2026-07 | month | [BP date] First full month after the 2026-06-24 business-plan date. |
| A2 | Opening cash / pre-seed raise | $3.0M | usdM | [BP fundingAsk] The business plan targets a $3-5M pre-seed; the model uses the bottom of that range to fund the AWS-first build, early design partners, and a six-month cash buffer before seed proof. |
| A3 | Revenue recognition basis | Only annual production subscriptions are recognized in core revenue; paid pilots and security-onboarding packages are excluded from the base P&L. | policy | [BP gtm.wedge; BP gtm.pricing; BP businessModel.revenueStreams] This keeps the model conservative and makes revenue reconcile cleanly to production customers × ARPU. |
| A4 | Blended annual production-customer ARPU | $130,000 per customer-year | usd_per_customer_year | [BP gtm.pricing; research.bottomUpSizingDrivers] Research implies about $120K annual spend per vendor, while the business plan allows $80K-$150K base ACV plus premium retention and protected-environment charges, so the model uses a modest upsell-blended $130K. |
| A5 | Year 1 production-customer ramp | M1-M12 customersEop = 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 2, 2 | customers | [BP milestones 0-12 months; BP experimentRoadmap] This matches three paid pilots converting into two production subscriptions by year-end. |
| A6 | Year 2 and Year 3 production-customer ramp | M13-M36 customersEop = 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 25, 26, 28 | customers | [BP milestones 12-24 months; research.market.som; BP gtm.funnelTargets] The ramp reaches 14 production customers by month 24 and 28 by year 3, which is ambitious but still below half of the 60-customer year-3 SOM from research. |
| A7 | Gross-margin ramp | 65% in Y1, 68% in Y2, 70% in Y3 | percent | [BP businessModel.targetGrossMarginPct; BP operatingAssumptions] The business plan targets 70% gross margin, but early installs are modeled below target until deployment work stays under the stated two engineer-weeks per customer. |
| A8 | Founder / CEO loaded cash compensation | $120,000 | usd_per_fte_year | Startup-finance heuristic for a below-market founder salary at pre-seed, consistent with BP team showing founder-led GTM from Month 0. |
| A9 | Engineering loaded cash compensation | $165,000 | usd_per_fte_year | Startup-finance heuristic for cloud / telemetry engineers building AWS collectors, policy controls, and release-regression workflows for an enterprise infrastructure startup [BP product; BP team]. |
| A10 | Solutions engineering loaded cash compensation | $145,000 | usd_per_fte_year | Startup-finance heuristic for a solutions engineer who compresses deployment and security-review work across BYOC tenants [BP team]. |
| A11 | Security / policy product loaded cash compensation | $160,000 | usd_per_fte_year | Startup-finance heuristic for a product/security lead translating masking, retention, and export policies into repeatable product templates [BP team; research.regulatoryTechnicalConstraints]. |
| A12 | Partnerships / sales loaded cash compensation | $150,000 | usd_per_fte_year | Startup-finance heuristic for one partner-led commercial hire added only after the AWS playbook is repeatable, matching BP team sequencing. |
| A13 | G&A / ops loaded cash compensation | $110,000 | usd_per_fte_year | Startup-finance heuristic for a lean finance / operations generalist added when customer count and contracting load rise in Y2. |
| A14 | Headcount ramp snapshots | Founder 1/1/1/1/1/1; engineering 1/1/2/2/3/3; solutions 0/1/1/1/2/2; security 0/0/1/1/1/1; partnerships 0/0/0/1/1/2; G&A 0/0/0/0/1/1 across q1y1/q2y1/q3y1/q4y1/q4y2/q4y3 | fte | [BP team; BP strategicChoices.sequencingRationale] The model follows the business-plan order: build the AWS-first product, standardize deployment, then add channel leverage and back-office support. |
| A15 | Payroll smoothing in Y2 and Y3 | Quarterly salary expense uses the most recent hiring ramp instead of stepping only on the year-end snapshots. | method | [Financial Modeler instructions] This keeps the salary line internally consistent with the six-column headcount schema. |
| A16 | Non-payroll operating budget | Y1 monthly S&M $8K-$16K, R&D $9K-$13K, G&A $5K-$8K; Y2 quarterly S&M $42K-$60K, R&D $30K-$39K, G&A $18K-$27K; Y3 quarterly S&M $66K-$84K, R&D $42K-$51K, G&A $30K-$39K | usdK | [BP operations; BP fundingAsk.useOfFundsSummary; research.reportMemo.distributionChannels; research.regulatoryTechnicalConstraints] These budgets cover cloud costs, travel, security review materials, audit workflows, and founder-led enterprise selling without assuming a large field team. |
| A17 | Fully loaded CAC | $55,000 per net production customer | usd_per_customer | [BP gtm.channels; BP gtm.funnelTargets] Derived startup-finance heuristic for founder-led enterprise sales, pilot travel, partner enablement, and a measured first commercial hire. |
| A18 | Monthly churn for unit economics | 1.8% | percent | [BP risks; research.categoryDynamics.headwinds] This heuristic assumes enterprise accounts are sticky once deployed, but still reflects renewal and security-friction risk for an early workflow product. |
| A19 | Cash roll-forward convention | Ending cash equals opening cash plus EBITDA; debt, capex, taxes, and working-capital timing are not modeled separately. | policy | Startup-finance heuristic for an asset-light software company where operating burn is the dominant cash driver. |
| A20 | Funding objective | Reach 14-20 production customers, prove at least one partner-assisted deployment path, and enter a seed process with six months of buffer. | goal | [BP milestones 12-24 months; BP fundingAsk; BP experimentRoadmap] This is the next financing proof implied by the plan once AWS deployment is repeatable and release-regression upsells begin. |
flowchart LR Leads[Qualified BYOC vendors] --> Pilots[Paid pilots] CACSpend[CAC spend] --> Pilots Pilots --> Customers[Production customers] Customers --> Tenants[More live tenants] Tenants --> Revenue[Annual subscription revenue] Revenue --> GrossProfit[Gross profit] GrossProfit --> EBITDA[EBITDA] EBITDA --> Cash[Ending cash] Churn[Churn and security friction] --> Customers
Flags: The model assumes enough target accounts will approve outbound redacted incident bundles to reach 14 production customers by month 24, but research still leaves budget ownership and export-policy pass rates unresolved. · Gross margin only reaches the 70% business-plan target if deployment work stays near the plan threshold of under two engineer-weeks per customer; otherwise solutions work will push the funding need higher. · Revenue excludes paid pilots and onboarding fees, which keeps ARR conservative but means the cash path may diverge if the company leans harder into services to win early deals.
Top risks
- Slow security reviews. Each customer tenant may require a separate security and procurement review before the relay can be installed. Mitigation: Start with vendors already selling BYOC into regulated accounts and provide opinionated reference architectures, redaction defaults, and audit artifacts.
- Incumbent observability overlap. Customers may argue their existing Datadog, Grafana, or CloudWatch stack should already solve the problem. Mitigation: Position the product as vendor-side support infrastructure that works on top of customer-owned tools and focuses on cross-tenant evidence packaging rather than replacing the stack.
- Beachhead too narrow. The number of vendors with enough live BYOC tenants could be smaller than expected in the first year. Mitigation: Land in BYOC support first, then expand to release validation, SLA proof, and billing evidence for the broader private-deployment software market.
Evidence
Cited sources (40)
- Tsuga. Tsuga Raises $35 Million Series A | Tsuga · https://www.tsuga.com/resources/blog/pr/tsuga-raises-35-million-series-a
- Tech Funding News. They sold their startup to Datadog. Now, they've raised $35M to disrupt observability · https://techfundingnews.com/ex-datadog-founders-raise-35m-tsuga-observability/
- Tsuga. Resilient Observability | Tsuga · https://www.tsuga.com/solutions/resilient-observability
- Tsuga. Agent-Native Observability | Tsuga · https://www.tsuga.com/solutions/agent-native-observability
- Tsuga. The new standard for observability. | Tsuga · https://www.tsuga.com/product/overview
- Langfuse. Masking - Langfuse · https://langfuse.com/docs/observability/features/masking
- Langfuse. Data Retention - Langfuse · https://langfuse.com/docs/administration/data-retention
- Langfuse. Audit Logs - Langfuse · https://langfuse.com/docs/administration/audit-logs
- Langfuse. Pricing - Langfuse · https://langfuse.com/pricing
- LangChain. LangSmith: AI Agent & LLM Observability Platform · https://www.langchain.com/langsmith/observability
- LangChain. LangSmith: Agent Deployment Infrastructure for Production AI Agents · https://www.langchain.com/langsmith/deployment
- LangChain. Mission Control for Self-Hosted LangSmith on Kubernetes · https://www.langchain.com/blog/mission-control-operating-self-hosted-langsmith-on-kubernetes
- LangChain. Introducing End-to-End OpenTelemetry Support in LangSmith · https://www.langchain.com/blog/end-to-end-opentelemetry-langsmith
- LangChain. How LangSmith and LangChain OSS Help You Meet EU AI Act Requirements · https://www.langchain.com/blog/langsmith-langchain-oss-eu-ai-act
- LangChain. LangSmith Plans and Pricing · https://www.langchain.com/pricing
- Arize AI. Agent Observability and Tracing · https://arize.com/ai-agents/agent-observability/
- Arize AI. Arize Private Connect - Arize AX Docs · https://arize.com/docs/ax/security-and-settings/arize-private-connect
- Arize AI. Arize Audit Log - Arize AX Docs · https://arize.com/docs/ax/security-and-settings/compliance/arize-audit-log
- Arize AI. The Role of OpenTelemetry (OTEL) in LLM Observability · https://arize.com/blog/the-role-of-opentelemetry-in-llm-observability/
- Arize AI. Pricing · https://arize.com/pricing/
- groundcover. Bring Your Own Cloud Observability | groundcover BYOC · https://www.groundcover.com/byoc
- groundcover. On-Prem & Air-Gapped Solutions with groundcover · https://www.groundcover.com/onprem-and-airgapped
- groundcover. groundcover Pricing Plans: Free, Team & Enterprise Plans · https://www.groundcover.com/pricing
- groundcover. BYOC Architecture Tradeoffs and Real World Lessons · https://www.groundcover.com/blog/byoc-in-practice-architectures-tradeoffs-lessons
- Datadog. Agent Observability | LLM Observability | Datadog · https://www.datadoghq.com/products/ai/agent-observability/
- Datadog. Pricing | Datadog · https://www.datadoghq.com/pricing/
- Datadog. Observability Pipelines · https://docs.datadoghq.com/observability_pipelines/
- Datadog. Sensitive Data Scanner · https://docs.datadoghq.com/security/sensitive_data_scanner/
- AWS. VPC Networking - AWS PrivateLink - AWS · https://aws.amazon.com/privatelink/
- AWS. Use interface VPC endpoints (AWS PrivateLink) to create a private connection between your VPC and Amazon Bedrock - Amazon Bedrock · https://docs.aws.amazon.com/bedrock/latest/userguide/vpc-interface-endpoints.html
- Microsoft. How to configure network isolation for Microsoft Foundry - Microsoft Foundry · https://learn.microsoft.com/en-us/azure/foundry/how-to/configure-private-link
- Microsoft. Governance and security for AI agents across the organization - Cloud Adoption Framework · https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/ai-agents/governance-security-across-organization
- Microsoft. Data, privacy, and security for Foundry Models sold by Azure in Microsoft Foundry - Microsoft Foundry · https://learn.microsoft.com/en-us/azure/foundry/responsible-ai/openai/data-privacy
- Google Cloud. VPC Service Controls · https://cloud.google.com/security/vpc-service-controls
- Google Cloud. Gemini Enterprise Agent Platform and zero data retention | Google Cloud Documentation · https://docs.cloud.google.com/gemini-enterprise-agent-platform/resources/zero-data-retention
- NIST. AI Risk Management Framework · https://www.nist.gov/itl/ai-risk-management-framework
- OWASP. Home · https://genai.owasp.org/
- EUR-Lex. Regulation - EU - 2024/1689 - EN - EUR-Lex · https://eur-lex.europa.eu/eli/reg/2024/1689/oj/eng
- OpenTelemetry. Moved: Generative AI semantic conventions · https://opentelemetry.io/docs/specs/semconv/gen-ai/
- Helicone. Self-Hosting Helicone - Helicone OSS LLM Observability · https://docs.helicone.ai/getting-started/self-host/overview