DEFENSE AUTONOMY AI FUNDING·defense·Scan 2026-04-29 to 2026-04-29·Run 20260430091617
Mission-assurance software that proves mixed military robot fleets are safe, controllable, and contract-ready before deployment.
Defense robotics teams are being pushed to put one operator in control of mixed unmanned fleets, but they still prove safety and controllability with ad hoc simulation, range time, and slideware. That makes every field exercise, OTA milestone, and customer demo a high-risk integration event.
By Bizidea Research/
Overall rating3.9/ 5.0
3
Market
$0.3B TAM with defense-autonomy adoption accelerating, but five mapped competitors make the category meaningful rather than wide open.
4
Differentiation
Cross-platform mission assurance and approval packets are a sharper wedge than generic simulation, though primes could build adjacent tooling.
4
Execution
Plan is specific, with strong modeled economics: 72% gross margin, 9.6x LTV/CAC and 6.9-month payback, offset by four concentration and delivery flags.
5
Timeliness
Scout AI's oversubscribed $100M Series A and four same-day signals point to a fresh budget and deployment inflection in defense autonomy.
Section
Why now
A record-scale, oversubscribed Series A shows defense buyers and investors are rapidly funding autonomy software, creating downstream budget for the assurance stack around it.
The control problem is shifting from one robot to fleets of robots supervised by soldiers, which sharply increases the need for mission-level validation before deployment.
Foundation models for unmanned warfare make autonomy behavior more powerful but less legible, increasing the value of an independent assurance layer.
Training on a U.S. military base with autonomous ATVs suggests real field data and procurement-adjacent trials now exist, so readiness evidence can be tied to live operating conditions rather than lab demos.
Catalyst.Scout AI’s large round and on-base training signal that soldier-facing autonomous fleet control is moving from prototype theater into funded deployment, making assurance a budgeted blocker now.
Section
The idea
Autonomy Mission Assurance ingests mission plans, vehicle capabilities, autonomy stack outputs, and communications assumptions before a field event. It runs scenario sweeps for lost-link, GPS degradation, target confusion, handoff failures, and human override latency across mixed UGV and UAS fleets. The product then produces a pass-fail readiness score, recommended guardrails, and an evidence packet that test teams can hand to internal review boards or government customers. After live exercises, it reconciles telemetry against the simulated envelope to improve the next mission package. Over time, the company builds the proprietary corpus of failure modes, mission templates, and readiness benchmarks for defense autonomy programs.
What's different. This is not another autonomy brain or generic defense simulation tool. The product owns the assurance workflow between mission planning, range execution, and customer approval for mixed fleets, which is where time-to-contract gets lost today. Its moat comes from a growing library of cross-platform failure cases, operator override benchmarks, and evidence templates that become harder to reproduce with each additional program and field exercise.
Startup thesis
Beachhead
Army-focused UGV and UAS OEMs preparing platoon-level field exercises or procurement milestones where a single operator must supervise multiple autonomous vehicles.
Wedge
A simulation-to-field assurance layer that rehearses missions, stress-tests comms and GPS loss, and auto-generates a safety case plus after-action evidence package for program offices.
Non-obvious insight
The next choke point in defense autonomy is not better vehicle intelligence; it is proving that heterogeneous robot fleets remain controllable by a soldier across degraded comms, changing mission plans, and mixed hardware. Once autonomy becomes a foundation-model layer, assurance becomes a separate software category.
Venture-scale path
Start as the evidence layer for pre-deployment testing, then expand into runtime policy enforcement, fleet telemetry benchmarking, supplier certification, and the system of record for autonomy readiness across primes, OEMs, and program offices.
Target user
Primary user
Autonomy test directors at defense OEMs building UGV and UAS systems for U.S. Army programs.
Secondary user
T&E teams inside Army innovation units piloting mixed unmanned fleets.
Economic buyer
VP of Autonomy or program manager for robotic systems at defense primes and venture-backed OEMs.
Go-to-market seed
First customer
Director of test and evaluation at a 50-500 person defense robotics company shipping autonomous UGVs or ISR drones into Army pilots.
Buying trigger
An upcoming Army field exercise, OTA milestone, or customer demo that requires mixed-fleet autonomy to work under degraded comms.
Current alternative
Manual test scripts, bespoke simulation, live exercises, internal tools, and PowerPoint-based safety reviews.
Switching reason
The wedge cuts weeks of manual prep into a repeatable evidence package, catches mission-killing failures before scarce range time, and gives program teams documentation they can reuse across milestones.
Pricing hypothesis
Annual platform subscription priced by program, plus usage-based fees per mission rehearsal package or vehicle type onboarded.
Jobs to be done
Job
Current alternative
Success metric
When an upcoming field exercise puts one operator in charge of multiple autonomous vehicles, help the autonomy test director prove mission readiness before range day, so they can avoid expensive failures and approval delays.
Manual rehearsal across internal simulators and live tests.
Fewer critical failures discovered during live exercises and faster sign-off before milestones.
When a program office asks for evidence that autonomy will behave inside mission constraints, help the robotics program manager generate a reusable safety case, so they can clear reviews without rebuilding documentation every time.
Slide decks, spreadsheets, and bespoke reports assembled by engineering teams.
Shorter review cycles and higher reuse of evidence across demos and milestones.
Assurance layer for autonomous missions
flowchart LR
Buyer[Autonomy Test Director] --> Pain[High-risk fleet field exercises]
Pain --> Product[Mission assurance rehearsal plus evidence layer]
Product --> Outcome[Faster approvals and safer autonomous deployments]
Idea scorecard — average4.8 / 5 · 5axes
Signal · 5/5The cluster has multiple verified sources, a $100 million round, and explicit signals that defense autonomy is accelerating quickly.
Pain · 5/5Failed autonomy tests burn scarce range time, delay contracts, and can kill deployment trust with military buyers.
Wedge · 5/5Pre-deployment mission assurance for mixed unmanned fleets is a narrow workflow with a clear user, trigger, and deliverable.
Defense · 4/5A proprietary corpus of mission failures, override benchmarks, and evidence templates can compound, though incumbents may try to copy the category.
Scale · 5/5The beachhead can expand from testing into runtime controls, supplier certification, and a data network across many defense autonomy programs.
Business model canvas
Key partners
Defense robotics OEMs
Simulation providers
Telemetry hardware vendors
Former T&E operators and advisors
Key activities
Modeling mission constraints
Running scenario sweeps
Generating evidence packages
Improving readiness benchmarks from exercise data
Key resources
Scenario library
Assurance engine
Telemetry connectors
Defense-domain workflow expertise
Value propositions
Proves fleet autonomy readiness before scarce range time
Generates reusable evidence packages for reviews and milestones
Reduces mission failures caused by edge-case autonomy behavior
Customer relationships
High-touch deployment support
Mission-template onboarding
Quarterly readiness reviews
Channels
Direct sales to autonomy program leaders
Embedded pilots tied to field exercises
Partnerships with simulation and telemetry vendors
Government T&E organizations piloting unmanned operations
Cost structure
Engineering for simulation and analytics
Secure deployment and support
Defense sales and field operations
Compliance and air-gapped infrastructure
Revenue streams
Annual software subscriptions
Per-program onboarding fees
Usage fees for mission rehearsal runs
Section
Market
Market sizing
Market sizing overview
TAM
$0.3BEstimate: ~150 U.S./allied unmanned-autonomy programs, OEM teams, primes, and government test organizations x roughly $2.0M annual assurance spend (platform, integration, mission-package generation). Account count is anchored by visible Army procurement activity, expanding mission-autonomy programs at Applied/Shield/Anduril, and the broader drone ecosystem; spend is modeled because public prices are not disclosed.
SAM
$45.0MEstimate: ~60 U.S. Army-adjacent OEM, prime, and T&E accounts x roughly $0.75M annual beachhead spend focused on pre-field-event assurance rather than full-platform autonomy.
SOM
$9.0MEstimate: 15 reachable accounts by year 3 x ~$0.6M blended ACV after pilots, reflecting high-touch integrations and program-based land-and-expand motion.
Executive takeaways
Autonomy mission assurance is emerging as a distinct wedge because capital is pouring into soldier-facing autonomy itself while the operator problem is shifting from one robot to mixed fleets [1][2].
The best near-term buyer is the OEM or prime autonomy/T&E leader about to burn scarce range time on an Army exercise, milestone, or demo [15][16][18].
Incumbents mostly sell the brain, the platform, or the simulator; fewer sell a vendor-neutral readiness artifact that survives procurement review across mixed fleets [3][6][9][25].
The beachhead is attractive but not huge as a standalone app category; the venture case strengthens only if the product expands into runtime policy enforcement, telemetry benchmarking, and certification workflows [3][12][21].
Budget willingness exists: peers are winning large autonomy and ISR contracts while the Army is changing drone acquisition pathways, so the spend is real even if software prices are quote-based [7][8][16].
The hardest barriers are trust, security, and integration: buyers must validate behavior under EW/GPS-loss conditions and brittle comms infrastructure can fail tests late in the cycle [13][19][23].
Market definition
Category: software that rehearses, stress-tests, and documents readiness for mixed military unmanned fleets before live deployment. The buyer is the OEM/prime autonomy or T&E leader running Army-adjacent UGV/UAS programs; geography starts with U.S. programs and expands to close allies with similar doctrine and procurement patterns. It includes mission rehearsal, degraded-comms/GPS-loss scenario sweeps, readiness scoring, and evidence-package generation; it excludes the core autonomy brain, C2 system of record, and vehicle hardware itself [3][4][15][17][20].
Customer and buyer
Primary user: autonomy test directors, chief engineers, and T&E leads at defense OEMs and primes preparing field exercises or milestone reviews. Economic buyer: VP of autonomy, program manager, or business-unit leader who owns schedule risk and range-time burn. Urgent jobs are proving one operator can supervise multiple assets, showing behavior under degraded comms/EW, and producing documentation reusable across reviews and demos [2][4][15][16][21].
Buying triggers
Upcoming Army field exercise, OTA milestone, or customer demo where mixed-fleet autonomy must work on limited range time.[15][16][17]
Transition from single-vehicle autonomy to multi-asset mission autonomy or loyal-wingman concepts.[8][9][11][14]
Need to prove resilience against EW, GPS denial, or communications outages before deployment.[13][23][24]
Willingness to pay
Budget evidence is strong even without public SaaS pricing: Scout raised $100M for soldier-facing autonomy, Shield is competing for up to $800M in ISR services and has public mission-autonomy wins, Anduril is winning mission-autonomy and unmanned-vessel programs, and the Army is formalizing faster drone buying channels. That supports quote-based, program-budgeted spend for assurance when it is attached to milestone risk.[1][7][8][10][16]
Category dynamics
Growth signal Step-function adoption rather than a clean published CAGR; program tempo is being pulled by drone urgency, mission-autonomy programs, and rapid acquisition changes.
Tailwinds
Large new autonomy funding rounds and mission-autonomy contract wins validate budget gravity around unmanned warfare software.
Army and allied forces are pushing harder on drone acquisition, rapid testing, and fielded autonomy, which increases the cost of failed exercises.
Open-source and simulation toolchains make scenario generation easier, which helps an assurance layer plug into existing workflows.
Headwinds
The buyer universe is concentrated, so long procurement cycles or one canceled program can materially slow growth.
Autonomy-stack vendors can bundle adjacent assurance features, especially if customers accept single-vendor lock-in.
Field tests still depend on communications infrastructure and airspace constraints that software alone cannot control.
Validation signals
Scout AI raised a $100M Series A around soldier-facing autonomy software.
Shield AI is competing for up to $800M in ISR services with V-BAT and continues to win mission-autonomy programs.
Anduril is expanding mission-autonomy into autonomous surface vessels and collaborative combat aircraft.
Army ATEC and Army drone marketplace efforts show testing and acquisition process change is already underway.
Recent drone-test disruption from a Starlink outage highlights how brittle real-world autonomy trials can be and why pre-event scenario rehearsal matters.
Regulatory & technical constraints
AI governance and assurance expectations are rising; buyers increasingly need documented risk-management and test procedures, not just demos.
Degraded comms, EW resilience, and GPS-denied operations are core mission constraints that any assurance workflow must model explicitly.
Domestic airspace and unmanned-operations rules can still shape where and how live testing is run.
Security, data rights, and air-gapped deployments raise implementation cost and slow time-to-value.
Heterogeneous fleets require connectors into multiple autonomy stacks, vehicle buses, telemetry stores, and simulators.
Mission-assurance market map
Section
Competition
Applied Intuition is closest on simulation/T&E workflow and already has Army credibility; Shield AI and Anduril are the strongest autonomy-stack and mission-autonomy incumbents; ReSim and NVIDIA/Isaac Sim are tooling substitutes; PX4/ROS/ArduPilot/Gazebo and internal simulation teams are the low-cost fallback. The strategic gap is a vendor-neutral readiness artifact that sits above the simulator and below procurement sign-off [3][4][6][9][25][26][27].
Competitor
Stage
Wedge
Pricing
Strength
Weakness vs. us
Applied Intuition Defense
scale-up
Digital engineering, simulation, and autonomy tooling for defense programs.
Custom enterprise / government quote
Strong fit with Army vehicle programs and explicit positioning around autonomous military test and evaluation.
Broad platform orientation; less clearly focused on reusable approval packets and mixed-fleet readiness evidence as a standalone workflow.
Shield AI
scale-up
Autonomy brain and mission-autonomy software bundled with operational UAS platforms.
Contract / program-based pricing
Real mission-autonomy deployments, visible defense contracts, and credibility with contested-environment operations.
Often a direct platform competitor to OEMs; buyers may resist making their approval and telemetry corpus dependent on a rival autonomy vendor.
Anduril Mission Autonomy
incumbent
System-level mission autonomy and C2 across unmanned assets.
Contract / program-based pricing
Deep distribution, mission-autonomy branding, and growing unmanned maritime and air programs.
Highly integrated stack; third-party OEMs and program offices may want independent evidence rather than an Anduril-owned approval layer.
ReSim
scale-up
Simulation and autonomy validation infrastructure.
Custom enterprise quote
Purpose-built tooling for simulation and validation workflows.
Commercial-autonomy lineage and generic validation posture leave room for a defense-specific evidence artifact tied to Army milestones and degraded-ops scenarios.
NVIDIA Isaac Sim / open-source stack
incumbent
General robotics simulation plus open-source autonomy building blocks.
Platform licensing or self-managed open source
Massive developer familiarity and low incremental adoption cost for engineering teams.
Tooling is horizontal; it does not natively solve procurement evidence, military doctrine mapping, or mixed-fleet approval workflows.
Why incumbents do not win by default
Vertical autonomy primes.Shield AI and Anduril ship autonomy stacks and mission systems, but many OEMs and program offices will prefer an assurance layer that is vendor-neutral and does not hand telemetry or approval logic to a direct platform competitor.
Digital engineering platforms.Applied Intuition, ReSim, and NVIDIA provide simulation and testing infrastructure, but the startup can win if it turns raw test output into reusable defense-ready evidence packets tied to specific field events, not just developer workflows.
Open source and in-house stacks.PX4, ROS, Gazebo, MAVLink, and internal scripts lower build-vs-buy friction, but they do not naturally accumulate cross-program failure benchmarks, approval templates, or mixed-fleet readiness scoring that can compound into a data moat.
Government-led procurement channels.Army marketplaces and rapid acquisition pathways increase access to drones, but they do not remove the burden of proving that autonomy will behave under edge conditions; that still falls on OEM and T&E teams.
Section
Business plan
Autonomy Mission Assurance sells into Army-adjacent defense robotics teams that are about to run a field exercise, OTA milestone, or customer demo and cannot afford a mixed-fleet autonomy failure on scarce range time. The product is a vendor-neutral assurance layer that sits on top of existing simulators and telemetry, stress-tests degraded comms and GPS loss, and produces a reusable readiness packet for internal review boards and government customers. This wedge matches the researched buying trigger: a VP of Autonomy, program manager, or T&E lead with budget tied to an imminent event rather than a speculative platform evaluation. The beachhead is intentionally narrow - 50-500 person UGV/UAS OEMs and prime programs with Army-facing milestones - because that workflow has short proof cycles, visible ROI, and less direct platform conflict than selling the autonomy brain itself. Research-sized market estimates are modest as a standalone app category ($45.0M SAM and $9.0M year-3 SOM), so the venture case depends on expanding from pre-event assurance into runtime policy enforcement, telemetry benchmarking, and certification workflows after early deployments. Pricing and delivery are therefore designed as paid field-event pilots that convert into annual per-program subscriptions, aligning first revenue with the customer's real budget line and preserving gross-margin potential above 70% after integration templates mature. The main disconfirming risks are that buyers treat evidence generation as an internal responsibility, that secure deployments and integrations take too long, or that incumbents such as Applied Intuition, Shield AI, and Anduril bundle enough adjacent capability to block the wedge. Public pricing benchmarks are limited in the inputs, so ACV, conversion rates, and funding needs remain operating assumptions to validate in the first 6 months.
Problem
Mixed UGV/UAS programs still prove readiness with ad hoc simulation, limited range time, and manually assembled review decks, so failures surface too late and burn schedule.
Soldier-supervised fleet autonomy introduces degraded-comms, GPS-loss, handoff, and override edge cases that are harder to test than single-vehicle autonomy.
Program teams need evidence that survives internal and customer review, but current artifacts are rebuilt from scratch for each milestone.
Solution
Ingest mission plans, vehicle profiles, autonomy outputs, telemetry, and communications assumptions before a field event and run repeatable scenario sweeps against degraded operating conditions.
Generate a readiness score, recommended guardrails, telemetry replay, and an exportable evidence packet that can be reused across reviews, demos, and milestone gates.
Reconcile live exercise telemetry back into the scenario library so each deployment improves cross-platform failure benchmarks and future mission templates.
Why we win
The product is sold as a vendor-neutral approval layer, which is easier for OEMs and program offices to trust than relying on a direct platform competitor's autonomy stack.
The first wedge is tied to a budgeted event with visible ROI - avoiding failed range days and compressing milestone prep - rather than a long-horizon platform replacement.
Proprietary failure cases, operator-override benchmarks, and evidence templates can compound across programs faster than generic simulators or in-house scripts.
Strategic choices
Beachhead
Army-adjacent autonomy test directors, chief engineers, and program managers at 50-500 person UGV/UAS OEMs and prime-led mixed-fleet programs preparing a field exercise, OTA milestone, or customer demo within 90-120 days.
Wedge rationale
This entry point has the clearest buying trigger, the shortest proof loop, and a deliverable the customer can judge immediately - whether the software catches mission-killing issues and produces a reviewable packet before scarce range time is consumed. Broader autonomy-platform or government-wide sales would require replacing core control software or waiting through slower procurement cycles.
Sequencing
Start as an evidence layer on top of existing simulation and telemetry workflows so the company can land without displacing incumbent autonomy stacks, then add after-action reconciliation, repeatable benchmark dashboards, and finally runtime policy enforcement once it has proprietary mission data and customer trust. Hiring follows the same sequence: build product and domain credibility first, then add forward deployment and repeatable sales only after pilots convert.
Not yet
Selling a full autonomy brain, C2 system, or hardware platform. · Going directly after programs of record before proving value with OEM and prime program budgets. · Expanding into non-defense robotics or generic commercial autonomy validation. · Building runtime autonomy control before the pre-deployment evidence workflow is repeatable.
Go-to-market
Wedge
Sell a paid pre-field-event assurance pilot to Army-facing UGV/UAS programs that have a live exercise or milestone inside one quarter, then convert to an annual per-program subscription once the evidence packet is reused and telemetry reconciliation proves recurring value.
Channels
Direct founder and defense-domain expert sales into autonomy leads, T&E leads, and program managers. · Integration-led partnerships with simulation, digital-engineering, and telemetry vendors already inside test workflows. · Program-adjacent entry through Army rapid-acquisition and drone-marketplace channels after the first evidence template is accepted by customers.
Funnel targets
10-15 target accounts per quarter, discovery-to-qualified-pilot 25-35%, qualified-pilot-to-paid-pilot 40%+, paid-pilot-to-annual-production 50%+, and production-to-second-program expansion 30%+ within 12 months.
Pricing
Paid pilot priced at $150k-$300k for one field-event cycle, converting to a $600k-$1.2M annual per-program subscription with additional onboarding fees for new vehicle types or secure deployment environments. This matches the researched program-budget buying motion better than seat-based pricing.
Product roadmap
MVP
MVP covers mission-plan ingest, scenario sweeps for lost-link, GPS-denied, handoff-failure, and override-latency cases, plus a readiness report and evidence export for one UGV/UAS mixed-fleet workflow. It should integrate with one open-source autonomy stack and one existing simulation or telemetry workflow rather than trying to replace either.
6 months
Support one paid pilot workflow with after-action telemetry reconciliation, one open-source connector, one commercial simulator or telemetry export, and an air-gapped on-prem deployment option.
12 months
Add repeatable approval templates, multi-program dashboards, benchmark comparisons across missions, and support for a second vehicle family or autonomy stack.
24 months
Expand into runtime policy checks, supplier certification workflows, and a readiness system of record that spans multiple programs and allied deployments.
Key bets
Buyers will pay for evidence packaging and mission rehearsal without requiring the startup to replace their simulator or autonomy stack. · Sanitized or unclassified telemetry is sufficient to prove ROI before the company gains access to more sensitive mission data. · The same core failure library can serve both smaller OEMs on open-source stacks and larger programs using commercial simulation tooling. · Pre-event assurance data can later support higher-value modules such as runtime policy enforcement and certification.
Business model
Revenue streams
Paid pilots tied to field exercises, demos, and OTA milestones. · Annual software subscriptions priced per autonomy program. · Onboarding and integration fees for new vehicle types, telemetry sources, and secure environments. · Usage-based fees for high-volume mission rehearsal packages once a program is in production.
Unit of value
One assured autonomy program or field-event cycle.
Target gross margin
72%
Expansion levers
Add more programs, vehicle families, and mission templates within each OEM or prime account. · Upsell telemetry benchmarking, approval-workflow management, and after-action analytics after initial pilot success. · Expand into runtime policy enforcement and supplier certification once the company owns a meaningful failure corpus.
Strategy map
North-star metric
Number of production programs that use the platform across two or more consecutive milestone cycles.
Input metrics
Days from kickoff to first usable evidence packet. · Paid-pilot to annual-conversion rate. · Number of mission scenarios run per active program. · Evidence-packet reuse rate across milestones. · Time to onboard a new autonomy stack or vehicle type.
Moats to build
Cross-platform corpus of degraded-comms, EW, GPS-loss, and override failure modes. · Reusable Army-style evidence templates and approval logic. · Benchmark dataset for operator supervision and mixed-fleet readiness across programs.
Kill criteria
Fewer than 2 paid pilots after 12 months of focused Army-adjacent selling. · Pilot-to-production conversion below 30% after the first 4 paid pilots. · Average integration time stays above 6 engineer-weeks per account after the third deployment. · Customers will not allow the evidence packet to be used in any internal review or customer-facing milestone.
Milestones
0-12 months
Sign 3 design partners and close at least 2 paid pilots tied to live field events.
Support one open-source connector, one commercial simulation or telemetry connector, and one air-gapped deployment.
Convert at least 1 pilot into an annual subscription and document a repeatable ROI case based on avoided failures or faster sign-off.
12-24 months
Reach 5-7 production programs across OEM and prime accounts.
Launch benchmark dashboards, reusable approval workflows, and support for a second vehicle family or autonomy stack.
Keep target gross margin above 70% by reducing deployment effort and reusing evidence templates.
Establish at least 1 channel partnership that brings the product into an existing simulation or telemetry workflow.
24-36 months
Land 12-15 accounts and approach the researched $9.0M year-3 SOM.
Release runtime policy checks or certification workflows that create recurring usage between field events.
Expand to at least 1 allied program through existing U.S. customer or partner relationships.
Demonstrate benchmark data and approval logic as the primary moat, not bespoke services labor.
Strategy map
flowchart LR
Wedge[Army field-event assurance pilot] --> MVP[Mission rehearsal plus evidence packet]
MVP --> Proof[Paid pilots catch issues before range day]
Proof --> Expansion[Annual per-program subscriptions]
Expansion --> Moat[Telemetry benchmarks and certification workflows]
Founding team
Role
Start timing
Rationale
CEO or founder
Month 0
Own founder-led sales, design-partner recruitment, and partnerships with simulation and telemetry vendors.
Founding eng
Month 0
Build the assurance engine, first connectors, and evidence-packet generation workflow.
Defense autonomy or T&E lead
Month 1
Translate Army-adjacent milestone workflows into credible packet templates and run pilot delivery with customers.
Forward deployed engineer
Month 4
Shorten pilot onboarding, manage integrations, and prevent customer-specific work from derailing product velocity.
Security or platform engineer
Month 6
Productize on-prem, air-gapped, and audit-log requirements needed for broader defense adoption.
Account executive or BD lead
Month 9
Add repeatable pipeline generation only after the first paid-pilot playbook and reference account exist.
Experiment roadmap
Horizon
Experiment
Hypothesis
Success metric
Owner
0-90 days
Interview 12-15 autonomy test directors, chief engineers, and program managers and collect their current milestone packet artifacts.
The first buyer will pay to compress packet creation and reduce range-day failure risk, not just to run more simulations.
At least 8 interviews rank evidence generation or review delay as a top-3 pain and 3 accounts volunteer a current artifact for redlining.
CEO
0-90 days
Prototype a readiness packet from one sample mission plan and one telemetry log using manual analyst support behind the product.
Customers will judge value by whether the output is decision-ready for a real review board, even before full automation.
3 design partners agree the packet is materially better than their current slideware and commit to a paid pilot timeline.
Founding eng
90-180 days
Build connectors for one PX4 or ROS-style workflow and one commercial simulator or telemetry export already present in the pipeline.
Covering those two paths will unlock most first-year opportunities without custom platform replacement work.
More than 60% of qualified opportunities fit one of the two connector paths and onboarding stays under 20 engineer-days.
Founding eng
90-180 days
Run the first paid pilot against a live field exercise and compare pre-event findings with post-event outcomes.
The product can catch at least one material issue before range day and create a packet that is reused after the event.
Customer confirms at least one avoided failure or schedule save and requests a production proposal within 30 days.
Defense T&E lead
180-360 days
Deliver an air-gapped on-prem deployment and security review for a second paying account.
Secure deployment can be standardized enough to avoid becoming a bespoke integration project.
Second secure deployment completes in 4 weeks or less and uses the same release process as the first account.
Security or platform engineer
180-540 days
Convert the first cohort of pilots into annual subscriptions and test expansion demand for benchmarking or approval-workflow modules.
Customers will renew because the tool is reused across milestones, not because of one-off services support.
At least 2 annual production contracts signed and at least 1 customer requests an expansion module before renewal.
CEO
Risk assessment
Business plan risks — 5 mapped
Impact →
High
R1
R3
R4
R2
Medium
R5
Low
Low
Medium
High
Likelihood →
R1Defense procurement and program-budget timing may be too slow for a startup burn profile. · Mediumlikelihood / Highimpact — Start with venture-backed OEMs and prime-led pilots where field-event budgets and decision makers are easier to access.
R2Secure deployment, data rights, and classified workflow needs may raise implementation cost beyond the planned model. · Highlikelihood / Highimpact — Prove ROI first on sanitized telemetry and standardize on-prem and air-gapped deployment before broad expansion.
R3Incumbent simulation or autonomy vendors may bundle enough adjacent assurance capability to block adoption. · Mediumlikelihood / Highimpact — Stay vendor-neutral, own the reviewable evidence artifact, and focus on mixed-fleet approval workflows that incumbents do not standardize well.
R4Customers may prefer internal scripts and services over a productized evidence layer. · Mediumlikelihood / Highimpact — Narrow the ICP, show time-to-packet and range-day savings, and refuse custom work that does not improve the shared template library.
R5The product may not expand beyond a niche pre-event tool. · Mediumlikelihood / Mediumimpact — Track pull for benchmarking, workflow management, and runtime modules early and adjust company ambition if that pull does not emerge.
Risk
Likelihood
Impact
Mitigation
Defense procurement and program-budget timing may be too slow for a startup burn profile.
Medium
High
Start with venture-backed OEMs and prime-led pilots where field-event budgets and decision makers are easier to access.
Secure deployment, data rights, and classified workflow needs may raise implementation cost beyond the planned model.
High
High
Prove ROI first on sanitized telemetry and standardize on-prem and air-gapped deployment before broad expansion.
Incumbent simulation or autonomy vendors may bundle enough adjacent assurance capability to block adoption.
Medium
High
Stay vendor-neutral, own the reviewable evidence artifact, and focus on mixed-fleet approval workflows that incumbents do not standardize well.
Customers may prefer internal scripts and services over a productized evidence layer.
Medium
High
Narrow the ICP, show time-to-packet and range-day savings, and refuse custom work that does not improve the shared template library.
The product may not expand beyond a niche pre-event tool.
Medium
Medium
Track pull for benchmarking, workflow management, and runtime modules early and adjust company ambition if that pull does not emerge.
First customer
Title
Director of Test and Evaluation at an Army-facing robotics OEM
Profile
A 50-500 employee UGV or ISR-drone company running one to three Army pilots with mixed internal simulation, open-source tooling, and live range rehearsals.
Trigger
A field exercise, OTA milestone, or customer demo inside the next 90-120 days where one operator must supervise multiple autonomous assets under degraded comms.
Buyer
VP of Autonomy or robotic systems program manager
Initial contract
$150k-$300k paid pilot for one exercise, converting to a $600k-$1.2M annual per-program subscription if reused across two milestones.
What must be true
At least 5 of the first 10 target T&E or autonomy leaders confirm that failed range days or delayed approval packets are a current budgeted problem.
At least 3 design partners agree to paid pilots before the company has access to classified mission data.
One evidence packet format is accepted in an internal review or customer-facing milestone at two separate accounts.
Pilot-to-production conversion reaches at least 50% when the packet is reused in a second event.
A single open-source path plus one commercial simulator or telemetry connector covers more than 60% of the first-year pipeline.
Open diligence questions
Which exact artifact unlocks milestone progress today - readiness score, safety case, telemetry replay, or after-action packet?
How many pre-field-event budgets can actually be spent within one quarter without a full government procurement cycle?
Will customers buy a vendor-neutral assurance layer if Applied, Shield, or Anduril offer adjacent features in the same deal?
What minimum telemetry and mission data are required to prove value before secure or classified deployment is available?
Can onboarding be standardized enough to keep gross margin above 70%, or does each program become a custom integration?
Investor verdict
Call
Meet / investigate further
Conviction
Strong wedge into an urgent defense workflow, but only if early customers treat the evidence layer as a repeatable product rather than bespoke services.
Why believe
The plan targets a budgeted milestone-risk problem with clear buyer urgency, credible incumbents to displace at the workflow layer, and a plausible data moat from cross-program failure evidence.
Why doubt
The standalone beachhead is not large enough on its own, and security or integration friction could trap the company in slow, services-heavy deployments.
Next diligence
Validate with 10-15 Army-adjacent T&E and autonomy leaders that they will fund a paid pilot before classified-data access and will reuse a third-party evidence packet in a real milestone.
Section
Financial model
3-year totals
Year 1 revenue
$900KEBITDA $-1.40M · Cash EOP $2.00M
Year 2 revenue
$3.95MEBITDA $-774K · Cash EOP $1.23M
Year 3 revenue
$7.50MEBITDA $900K · Cash EOP $2.13M
Unit economics
ARPU (annual)
$600K
Gross margin
72%
CAC
$250KPayback 6.9 months
LTV / CAC
9.6xLTV $2.40M
Funding ask
Round
pre-seed · $3.2M
Runway
18 months
Milestone
Reach 10 active programs, 2+ annualized reference accounts, one repeatable air-gapped deployment, and enough proof to raise the next round with 6 months of cash buffer.
Model sanity
Revenue engine. Base case exits Y3 with 15 active programs and $9.0M ARR, driven by $600K ACV and roughly 50% pilot-to-production conversion.
Must go right. Onboarding has to stay template-based enough to hold gross margin at ~72% while the company adds programs faster than services labor.
Model breaks if. If the sales cycle stretches to 6 months or ACV falls to $500K, the downside case needs bridge financing before Y3 profitability.
Next-round proof. A credible next round is supported by reaching 10 active programs, an air-gapped reference deployment, and repeatable ROI by the end of Y2.
Revenue, cash, and EBITDA — 12-month Y1 + 8-quarter Y2/Y3
Revenue (line, area)
Cash EOP (dashed)
EBITDA (bars, gray = loss)
Use of funds — $3.2M pre-seedHeadcount build by role — peak16 FTE
Founder/CEO
Engineering
Domain/T&E
Forward Deployed / Delivery
Sales / BD
G&A / Ops
Year-3 scenarios — base / downside / upside
Y3 revenue
Y3 EBITDA
Cash low point
Description
Downside
$5.40M
-$450K
-$150K
Two quarters of slower defense budget timing extend pilot close and reduce production conversion.
Base
$7.50M
$900K
$1.23M
Founder-led pilots convert on plan, gross margin reaches target by Y3, and the company exits Y3 at 15 active programs / $9.0M ARR.
Upside
$9.30M
$1.80M
$1.35M
Shorter pilot cycles and stronger expansion lift both customer count and ACV without materially raising headcount.
Sensitivity — Y3 cash and revenue impact, sorted by magnitude
Variable
Downside
Upside
Cash impact
Revenue impact
sales cycle
6 months
3 months
-$1.10M
-$1.50M
ARPU
$500K ACV
$650K ACV
-$900K
-$1.25M
gross margin
65% in Y3
75% in Y3
-$525K
$0K
hiring pace
pull forward 2 hires into Y2
delay 2 non-critical hires by 2 quarters
-$400K
$0K
churn
2.5% monthly churn
1.0% monthly churn
-$350K
-$450K
CAC
$325K CAC
$200K CAC
-$250K
-$300K
Scenarios
Scenario
Y3 revenue
Y3 EBITDA
Cash low point
Description
Key changes
Downside
$5.40M
$-450K
$-150K
Two quarters of slower defense budget timing extend pilot close and reduce production conversion.
sales cycle extends from 4 to 6 months
pilot-to-production conversion falls from 50% to 35%
steady-state ACV falls from $600K to $500K
Base
$7.50M
$900K
$1.23M
Founder-led pilots convert on plan, gross margin reaches target by Y3, and the company exits Y3 at 15 active programs / $9.0M ARR.
sales cycle holds at 4 months
pilot-to-production conversion stays at 50%
steady-state ACV stays at $600K
Upside
$9.30M
$1.80M
$1.35M
Shorter pilot cycles and stronger expansion lift both customer count and ACV without materially raising headcount.
sales cycle compresses from 4 to 3 months
pilot-to-production conversion rises from 50% to 60%
steady-state ACV rises from $600K to $650K
Sensitivity
Variable
Downside
Base
Upside
ARPU
$500K ACV
$600K ACV
$650K ACV
CAC
$325K CAC
$250K CAC
$200K CAC
churn
2.5% monthly churn
1.5% monthly churn
1.0% monthly churn
sales cycle
6 months
4 months
3 months
gross margin
65% in Y3
72% in Y3
75% in Y3
hiring pace
pull forward 2 hires into Y2
current plan
delay 2 non-critical hires by 2 quarters
Key assumptions (23)
ID
Name
Value
Unit
Source
A1
Model start month
2026-05
month
[BP date 2026-04-30] Model starts the month after the plan date.
A2
Starting cash before round
200
USD K
Startup-finance heuristic: assume founder/angel cash covers incorporation and pre-close costs before a pre-seed round.
A3
Pre-seed closes in M1
3200
USD K
[BP fundingAsk] stage pre-seed, target range $2-4M, runway target 18 months; base case uses $3.2M midpoint-plus to fund air-gapped productization and pilots.
A4
Target accounts prospected per quarter
12
accounts per quarter
[BP gtm.funnelTargets] midpoint of 10-15 target accounts per quarter.
A5
Sales cycle
4
months
[BP operatingAssumptions + market.buyingProcess] 90-120 day budget-to-pilot motion modeled at the midpoint.
A6
Qualified opportunity to paid pilot conversion
45
percent
[BP gtm.funnelTargets] qualified-pilot-to-paid-pilot 40%+; base case uses 45%.
A7
Paid pilot to annual production conversion
50
percent
[BP gtm.funnelTargets] paid-pilot-to-annual-production 50%+; base case uses 50%.
A8
Steady-state ACV per paying program
600
USD K per year
[BP gtm.pricing, investorMemo.firstCustomer, market.som] bottom of $600k-$1.2M annual subscription range and consistent with 15 accounts ~ $9.0M exit ARR.
A9
Revenue recognition simplification
50
USD K per active paying program per month
[Derived from A8] Conservative ratable recognition; onboarding and usage fees are excluded from base case revenue to avoid double counting.
A10
Y1 active paying-program ramp
0,0,0,1,1,1,1,2,2,3,3,4
customers EOP by month
[BP milestones 0-12 months] supports 2 paid pilots, 1 conversion, and 4 active paying programs by M12.
A11
Y2 and Y3 exit customers
10 in Q4Y2; 15 in Q4Y3
customers EOP
[BP milestones] 5-7 production programs by 12-24 months and 12-15 accounts by 24-36 months; base case exits Y2 at 10 and Y3 at 15.
A12
Gross margin ramp
55% Y1; 68% Y2; 72% Y3
percent
[BP businessModel.targetGrossMarginPct 72] Early pilots are modeled below target because delivery is more services-heavy; mature margin reaches target in Y3.
A13
Hiring plan basis
Founder M0; founding eng M0; domain lead M1; FDE M4; security/platform M6; AE M9; later hires in line with milestone expansion
timing
[BP team] explicit role timing plus startup-finance heuristic for Y2-Y3 follow-on hires to support 10 then 15 programs.
Startup-finance heuristic for U.S. senior software/security talent in defense-adjacent enterprise software.
A16
Loaded domain T&E compensation
210
USD K annual per FTE
Startup-finance heuristic for defense autonomy/T&E specialist with field credibility.
A17
Loaded forward-deployed engineer compensation
190
USD K annual per FTE
Startup-finance heuristic for implementation-heavy defense software engineering roles.
A18
Loaded sales or BD compensation
230
USD K annual per FTE
Startup-finance heuristic including base salary, commission, and benefits for enterprise defense sales.
A19
Loaded G&A or ops compensation
180
USD K annual per FTE
Startup-finance heuristic for finance/ops support in a venture-backed startup.
A20
Non-payroll overhead
Meaningful cloud, travel, legal, insurance, and security-compliance spend layered on top of payroll
qualitative
[BP operations, risks, fundingAsk] air-gapped deployment, audit logs, and field deployment support make non-payroll opex materially higher than a generic SaaS startup.
A21
Steady-state CAC
250
USD K per new production customer
Startup-finance heuristic anchored to founder-led high-ACV enterprise sales with a small but targeted funnel and defense travel costs.
A22
Steady-state monthly churn
1.5
percent
Startup-finance heuristic: low logo churn for sticky program software, tempered upward for budget concentration and contract lumpiness.
A23
Next-round milestone
10 active programs, repeatable air-gapped deployment, and at least 2 annualized reference accounts by end of Y2
milestone
[BP milestones + fundingAsk] this is the proof package used to size the round and runway.
Flags: Public pricing comps are sparse, so ACV and CAC are anchored to the business plan plus clearly labeled startup-finance heuristics rather than disclosed contracts. · Revenue is modeled as ratable $600K per active program and excludes onboarding or usage upside, so actual bookings could be lumpier than this P&L. · Gross margin assumes sanitized telemetry, reusable connectors, and limited custom work; classified or bespoke deployments could push the model toward services. · The customer base is concentrated, so one slipped Army-adjacent program can move quarterly revenue and cash materially.
Section
Top risks
Procurement drag. Defense sales cycles can be slow enough to starve a young company before it reaches repeatable revenue. Mitigation: Start with venture-backed OEMs and primes before programs of record, selling into urgent field exercises and milestone prep where budgets already exist.
Sensitive-data friction. Mission data and telemetry may be restricted, limiting access to the highest-value workflows. Mitigation: Support air-gapped and on-prem deployments, and begin with unclassified test ranges plus sanitized telemetry feeds to prove ROI.
Internal-build pressure. Large primes may try to extend their own simulation stacks instead of buying an external assurance product. Mitigation: Focus on cross-platform evidence generation, mixed-fleet workflows, and benchmark data that internal tools lack and struggle to standardize.