SANDBOXAQ·bio·Scan 2026-05-18 to 2026-05-18·Run 20260519080120
Audit-ready copilot that turns Claude model sessions into reproducible experiment packets for pharma discovery teams.
Quantitative drug-discovery models are becoming easy to query through Claude, but most pharma teams still cannot use those outputs safely in real experiment planning. Once more scientists can ask for model recommendations in plain English, labs face a new failure mode: irreproducible prompts, hidden assumptions, and weak handoffs from model output into assay design or CRO instructions.
By Bizidea Research/
Overall rating3.7/ 5.0
3
Market
$360.0M TAM, 9.9% CAGR, and five mapped incumbents support a solid niche market, but not a category-scale breakout.
4
Differentiation
A model-neutral packet layer across Claude sessions, vendors, and CRO handoffs is sharper than point tools, though incumbents could copy parts.
4
Execution
Named early hires and staged milestones pair with 72% gross margin, 6.7x LTV/CAC, and 10-month payback, despite four model flags.
4
Timeliness
Four same-day signals around SandboxAQ's Claude launch create a fresh why-now moment for governed scientific AI workflows.
Section
Why now
Claude is now a live interface for quantitative scientific models rather than just a text assistant, which expands the set of scientists who can invoke them.
Independent coverage says the interface is the bottleneck for adoption, so workflow control around that interface becomes the natural next layer to buy.
Natural-language access removes the coding barrier, which raises the urgency for reproducibility and review before model outputs influence assay decisions.
SandboxAQ is already extending the same interface to AQPotency and AQCell, so the problem will compound as more model-backed decisions enter one discovery workflow.
Catalyst.SandboxAQ's Claude launch shows that quantitative models are moving from specialist tooling into a mainstream LLM interface, creating immediate demand for controls that make those outputs usable in real R&D decisions.
Section
The idea
The product connects Claude to approved internal and external quantitative models through an MCP control plane built for scientific workflows. Scientists can ask natural-language questions, but each workflow is constrained by assay-specific templates, parameter bounds, required controls, and citation of the underlying evidence. The software records prompt history, datasets, model versions, and uncertainty notes, then packages the result into an experiment brief a bench team or CRO can actually execute. It also compares outputs across model versions and flags when a recommendation relies on out-of-domain assumptions or missing validation. The initial product focuses on potency ranking and next-experiment planning for hit-to-lead programs where each bad handoff costs weeks of assay time.
What's different. SandboxAQ and similar vendors sell model access, while incumbent lab software stores downstream records; this company owns the missing layer in between: governed conversion of natural-language model sessions into executable experiment packets. Because it is model-neutral and workflow-specific, it can sit across internal models, vendor models, and future MCP endpoints instead of betting on one scientific model stack. Over time, its proprietary asset becomes the library of approved assay templates, review policies, and model-to-outcome traces that make scientific AI usable at enterprise scale.
Startup thesis
Beachhead
Small-molecule hit-to-lead teams at top-30 pharma companies that already license external quantitative models but still depend on a handful of computational specialists to turn potency and cell-model outputs into wet-lab assay plans.
Wedge
A Claude-native experiment packet builder that captures prompt, dataset, model version, assumptions, and reviewer sign-off, then converts model outputs into assay-ready recommendations for potency-ranking and next-experiment planning.
Non-obvious insight
MCP-native access does not just democratize scientific models; it shifts the bottleneck to reproducibility and trust. As soon as non-specialists can invoke frontier quantitative models from Claude, the scarce asset is no longer model access but a workflow layer that turns free-form queries into reviewable, SOP-bound experiment decisions.
Venture-scale path
Starting with potency and cell-model workflows, the company can expand into ADME, safety, formulation, materials, and catalyst programs, becoming the system of record for model-to-experiment decisions across enterprise scientific R&D.
Target user
Primary user
Translational pharmacology and medicinal chemistry scientists at large pharma companies who need model-backed next-step recommendations without writing code.
Secondary user
Discovery informatics teams responsible for approved scientific software, model access, and auditability.
Economic buyer
Head of Discovery Informatics or VP Computational Chemistry
Go-to-market seed
First customer
A top-30 pharma discovery program with 10-30 medicinal chemistry and translational pharmacology scientists running weekly hit-to-lead review meetings and outsourcing assay execution to one or more CRO partners.
Buying trigger
A team adopts Claude-accessible quantitative models and suddenly needs reproducible model outputs for weekly compound-prioritization decisions.
Current alternative
Manual analyst mediation plus PowerPoint summaries, ELN notes, and bespoke scripts maintained by computational chemistry specialists.
Switching reason
The wedge saves scarce specialist time while giving bench and informatics leaders a reviewable experiment packet they can trust, share, and audit across internal teams and CROs.
Pricing hypothesis
Annual platform subscription priced by active discovery program plus usage-based fees for governed model runs and exported experiment packets.
Jobs to be done
Job
Current alternative
Success metric
When weekly hit-to-lead reviews turn model outputs into assay requests, help translational pharmacology leads produce a reproducible recommendation packet, so they can approve the next experiment without chasing a model specialist.
PowerPoint summaries assembled by computational chemistry experts and emailed bench notes
Time from model query to approved assay request drops from days to hours
When discovery informatics teams roll out Claude-accessible scientific models, help them enforce approved templates and review trails, so they can expand usage without increasing governance risk.
Ad hoc prompt guidance documents and manual audit collection
Share of governed model-backed decisions captured with full provenance exceeds 90 percent
From Claude query to assay packet
flowchart LR
Buyer[Discovery team] --> Pain[Model outputs are hard to trust and operationalize]
Pain --> Product[Governed Claude to experiment packet layer]
Product --> Outcome[Faster and reproducible assay decisions]
Idea scorecard — average4.2 / 5 · 5axes
Signal · 4/5Multiple fetched sources confirm the Claude integration and its relevance to drug and materials workflows.
Pain · 4/5Broadening access to quantitative models creates real reproducibility and handoff pain for high-cost R&D decisions.
Wedge · 5/5The first product is a narrowly defined experiment packet layer for hit-to-lead potency and cell-model workflows.
Defense · 4/5Workflow templates, review policies, and model-to-outcome traces create switching costs beyond a generic chatbot wrapper.
Scale · 4/5The beachhead can expand from drug discovery into adjacent pharma and materials R&D workflows that share the same control problem.
Business model canvas
Key partners
Scientific model vendors such as SandboxAQ
CROs executing downstream assays
Pharma discovery informatics teams providing workflow requirements
Key activities
Building governed model orchestration and provenance capture
Validating packet outputs against historical discovery workflows
Expanding templates across potency, cell, and adjacent R&D use cases
Key resources
MCP integration layer for internal and external scientific models
Library of assay-specific workflow templates and review policies
Scientific implementation team with discovery informatics credibility
Value propositions
Convert natural-language model sessions into reproducible experiment packets
Reduce dependence on scarce computational specialists for routine model-backed decisions
Create audit and review trails before model outputs reach the lab or CRO
Customer relationships
High-touch implementation with assay-template setup
Shared validation and benchmark reviews with scientific leadership
Expansion through additional model workflows inside the same R&D organization
Channels
Direct enterprise sales to discovery informatics and platform leaders
Design-partner deployments with translational pharmacology teams
Integrations with existing scientific model vendors and CRO workflows
Discovery informatics groups standardizing scientific AI tooling across R&D
Cost structure
Scientific workflow engineering and implementation
Enterprise integration and security support
Validation studies and customer-specific template development
Revenue streams
Annual software subscription per discovery program
Usage fees for governed model runs and packet exports
Professional services for workflow validation and template deployment
Section
Market
Market sizing
Market sizing overview
TAM
$360.0MBottom-up estimate: 150 large biopharma and adjacent discovery organizations globally × est. 8 AI-exposed hit-to-lead or preclinical programs per organization × est. $300k annual program ACV; cross-check remains well below the $6.93B 2025 AI-in-drug-discovery market.
SAM
$45.0MBeachhead estimate: top-30 pharma targets × est. 5 small-molecule hit-to-lead programs per company that already use external models or heavy CRO handoffs × est. $300k annual program ACV.
SOM
$4.5MReachable year-3 case: 5 enterprise customers × 3 live programs per customer × est. $300k annual ACV after one design-partner land and several adjacent-program expansions.
Executive takeaways
Large-pharma scientists are already using copilots as a default interface, but adoption falls in the most regulated and data-messy workflows where trust breaks [5][24].
The why-now is concrete: Claude-accessible scientific models and MCP widen model access faster than informatics teams can manually govern outputs [1][2][3][4].
Incumbents are moving fast, but they mostly win when work already lives inside their own stack; the white space is model-neutral capture of external Claude sessions and conversion into assay-ready packets [6][7][13][14][15].
The product should be sold as governed workflow infrastructure, not as a general chatbot, because the pain is manual handoff, reproducibility loss, and slow CRO or wet-lab packet preparation [8][10][11].
Market definition
The relevant category is governed scientific-workflow software that captures LLM- and model-mediated discovery reasoning, links it to source data and approvals, and exports execution-ready experiment packets rather than loose chat transcripts [3][6][8][17].
Customer and buyer
Primary users are translational pharmacology and medicinal chemistry scientists who can now access advanced models through natural-language interfaces but still need discovery informatics leaders to make outputs reviewable, shareable, and executable. The economic buyer is the Head of Discovery Informatics or VP Computational Chemistry because the purchase touches validation, security, data structure, and cross-team workflow design [5][9][11].
Buying triggers
A team enables Claude- or MCP-accessible scientific models and suddenly has more people producing model-backed recommendations than it has specialists available to document and review.[1][2][3][4]
Program leaders realize weekly review decisions still require manual PowerPoint, notebook, and CRO-PDF synthesis before anyone can approve an assay request.[8][10][11]
An AI pilot reaches security, IP, or compliance review and stalls because provenance, validation, and auditability are not yet reliable enough for scaled rollout.[5][6][9][21][22]
Willingness to pay
Willingness to pay should be high when framed as program infrastructure: incumbent platforms already sell customized enterprise bundles with validation support, while users still spend hours to weeks on report prep, CRO-data wrangling, and institutional-memory recovery.[9][10][11]
Category dynamics
Growth signal 9.9% CAGR
Tailwinds
Scientists are already using copilots as a default interface, which expands the need for workflow controls beyond specialist modelers.
Claude and MCP are lowering integration friction between AI assistants and scientific systems.
Large-pharma R&D continues to seek productivity gains as AI-enabled discovery matures.
Headwinds
Existing informatics vendors are rapidly adding agentic AI, traceability, and reporting features.
Scientific data fragmentation can still make AI outputs untrustworthy even with better orchestration.
Buyers may prefer extending an existing ELN or DMTA system over adding another validation surface.
Validation signals
89% of surveyed biotech scientists already use copilots or reasoning tools as a first stop for interrogating data.
Benchling reports that scientific AI report prep can drop sharply when the workflow is structured and source-linked.
SandboxAQ explicitly says advanced quantitative models can now be accessed through plain-English prompts inside Claude.
Benchling’s Anthropic integration already markets one-click traceability and automatic audit-log carryover, proving buyers care about this layer.
Regulatory & technical constraints
Electronic records in regulated environments need validated systems, retrievable records, and secure time-stamped audit trails.
Approval workflows need signature meaning, timestamp, and non-repudiation features if they are to substitute for manual review records.
AI governance increasingly requires documented risk management, human oversight, and traceability rather than opaque model outputs.
Model recommendations are only as reliable as the structured data and contextual metadata feeding them.
Scientific AI workflow map
Section
Competition
Competition is real but fragmented. Benchling and Dotmatics own structured-system-of-record territory, Schrödinger owns DMTA collaboration, BenchSci owns evidence-grounded disease-biology copilots, and SandboxAQ owns the model-access wedge. None of them clearly owns cross-model, external-Claude session capture plus experiment-packet governance as a standalone layer [6][7][12][13][14][15].
Competitor
Stage
Wedge
Pricing
Strength
Weakness vs. us
SandboxAQ
scale-up
Physics-grounded LQMs for drug discovery exposed through MCP and enterprise licensing.
Strong model depth and a clear path to put quantitative models directly behind Claude-like interfaces.
Owns model access, not a neutral review-and-packet layer across multiple model vendors and external sessions.
Benchling
incumbent
Unified R&D data platform with agents, model hub, MCP connectors, and compliance-ready audit trails.
Custom enterprise platform pricing with add-ons, validation support, and services
Best-positioned incumbent for teams already inside a structured ELN and system-of-record workflow.
Less natural for capturing and governing model decisions that happen outside Benchling before they need to become an experiment packet.
Dotmatics Luma
incumbent
AI-native multimodal scientific platform with logged actions, approval gates, and audit-ready reporting.
Custom enterprise pricing (not publicly disclosed on fetched pages)
Strong governance narrative and breadth across scientific data and workflows.
Broad platform scope can make the external-Claude-to-packet wedge feel like one feature among many rather than the core product.
Schrödinger LiveDesign
incumbent
Cloud-native DMTA collaboration and ML co-pilot tightly integrated into computational design workflows.
Enterprise software / request-demo model
Deep fit for medicinal chemistry and explicit support for CRO-partner data sharing.
Optimized for Schrödinger-centric design workflows rather than portable governance across mixed assistants and model stacks.
BenchSci
scale-up
Disease-biology copilot built on a curated biological evidence knowledge graph.
Custom enterprise pricing (not publicly disclosed on fetched pages)
Strong evidence grounding and scientific credibility in preclinical ideation and experiment design.
More focused on the evidence and target-selection side than on cross-model provenance, approvals, and assay-packet export.
Why incumbents do not win by default
Cloud platforms.Claude and MCP increase access, but the platform itself does not become the scientific system of record or approval workflow by default.
ELN and informatics suites.Benchling-like platforms win when work already happens inside their data model, but they do not automatically capture external sessions or model decisions that occur outside their boundary.
Model vendors.SandboxAQ-class vendors differentiate on predictive models and licensing, not on neutral provenance, cross-model comparison, or reviewer sign-off across the broader workflow.
Computational design suites.Schrödinger-class tools are strong for in-platform DMTA collaboration, but their collaboration model is still coupled to their design environment rather than a portable packet layer across mixed systems.
Section
Business plan
This company should be built as a governed experiment-packet layer for large-pharma hit-to-lead teams that are starting to use Claude-accessible quantitative models in weekly compound-prioritization work. The first user is a translational pharmacology or medicinal chemistry lead who wants model-backed next-step recommendations without waiting on a computational specialist. The economic buyer is the Head of Discovery Informatics or VP Computational Chemistry because the purchase touches validation, model access policy, and CRO handoff. The MVP should not try to replace Benchling, Dotmatics, LIMS, or the upstream model vendor; it should capture external Claude sessions, enforce approved assay templates, and export reviewer-signed packets that a bench team or CRO can execute. The best wedge is U.S.-led small-molecule potency-ranking and next-experiment planning inside top-30 pharma because the workflow repeats weekly, the handoff pain is visible, and the validation surface is narrower than broader scientific AI governance. Research supports real demand for traceability and workflow control, but it does not yet prove buyers will fund a neutral layer before waiting for Benchling, Dotmatics, or SandboxAQ to extend their products. The company can win if it becomes the fastest path from free-form model query to audit-ready assay packet across mixed model stacks, then compounds into packet templates, review policies, and packet-to-outcome traces. The board-level question is whether early pilots show enough cycle-time reduction and specialist-time savings to create budget pull before incumbents close the gap.
Problem
As Claude and MCP expose quantitative models to more scientists, weekly hit-to-lead decisions create irreproducible prompts, hidden assumptions, and manual packet prep before anyone can approve an assay request.
Discovery informatics teams must govern model-backed recommendations across external sessions, multiple model vendors, and CRO handoffs, but ELNs and model vendors mostly capture records inside their own stack after the fact.
Each bad handoff can cost weeks of assay time and erode trust in scientific AI, so buyers need reviewability before they need broader automation.
Solution
Capture Claude session context, approved datasets, model version, assumptions, and reviewer sign-off in assay-specific packet templates for potency ranking and next-experiment planning.
Export an execution-ready experiment brief for internal labs or CRO partners while blocking packet generation when data, parameter bounds, or provenance are incomplete.
Compare outputs across model versions and surface out-of-domain warnings so informatics leaders can scale model access without surrendering control.
Why we win
The wedge is model-neutral governance between Claude sessions and the experiment record, not another model vendor or ELN replacement.
The buying trigger is specific: once a team enables Claude-accessible scientific models, review meetings generate more model-backed recommendations than specialists can manually document.
Wet-lab and CRO handoffs create measurable ROI through faster packet preparation, lower specialist mediation, and fewer review loops.
Reusable packet templates, reviewer policies, and packet-to-outcome traces can compound into switching costs if the startup lands early reference programs.
Strategic choices
Beachhead
U.S.-based small-molecule hit-to-lead teams inside top-30 pharma companies that already use external quantitative models and outsource meaningful assay execution to CRO partners.
Wedge rationale
This slice has a weekly decision cadence, high-cost downstream experiments, and a clear informatics buyer who feels the governance pain as soon as model access broadens beyond specialists. It creates faster proof than a broad scientific-AI governance product because one packet type, one buyer, and one ROI story can be validated first.
Sequencing
Start with potency-ranking and next-experiment packets because they sit closest to a concrete approval event and can be sold as an overlay on existing ELN and model stacks. Only after the company proves pilot-to-production conversion should it add adjacent workflows, deeper system-of-record integrations, and broader geographic rollout, keeping product scope, sales motion, hiring, and compliance burden aligned.
Not yet
Materials science and catalyst workflows before pharma proof exists · ADME, safety, and developability modules before potency and cell-model packet templates are repeatable · Full ELN, LIMS, or model-vendor replacement · EU-first expansion before the U.S. design-partner playbook and traceability controls are proven
Go-to-market
Wedge
Sell the company as the control layer that lets top-30 pharma teams use Claude-accessible quantitative models in weekly hit-to-lead decisions without adding governance debt or manual packet assembly.
Channels
Founder-led enterprise sales to Heads of Discovery Informatics, VP Computational Chemistry, and translational pharmacology leaders · Design-partner deployments inside one live hit-to-lead program with weekly review meetings · Upstream model-vendor and Claude ecosystem introductions where downstream operationalization is missing · CRO and assay-operations partner referrals tied to packet export and handoff pain
Funnel targets
Target account to qualified buyer 20%+, qualified buyer to paid pilot 25%+, paid pilot to production program 50%+, first program to second-program expansion 40%+ within 12 months
Pricing
Annual subscription priced per live discovery program with a paid implementation and validation package, plus usage-based fees for governed model runs or packet exports at higher volumes. This fits the buyer's value equation because it replaces specialist mediation and manual packet preparation at the program workflow level rather than selling another end-user seat.
Product roadmap
MVP
Concierge-assisted control plane for one hit-to-lead program that captures Claude sessions, enforces potency or cell-model packet templates, records signature-linked approvals, and exports CRO-ready packets into existing review workflows. Human approval remains mandatory for every packet and the product acts as an overlay rather than replacing ELN or LIMS systems.
6 months
Ship the first production MVP for potency-ranking and next-experiment packet generation with dataset references, model-version tracking, out-of-domain warnings, and export formats usable by internal labs and CRO partners.
12 months
Add a reusable packet-template library for potency and cell-model workflows, cross-model comparison, approval dashboards, and evidence packs that quantify prep-time reduction, packet completeness, and reviewer throughput across the first several programs.
24 months
Expand into adjacent discovery workflows such as ADME and developability, deeper ELN and LIMS integrations, and portfolio views that benchmark packet quality and downstream assay outcomes across programs.
Key bets
Potency-ranking and next-experiment planning are the narrowest workflows with enough frequency and pain to support a repeatable product. · Buyers will accept an overlay with human approval faster than they will accept a broader autonomous scientific agent. · Packet generation can remove most manual documentation burden without weakening scientific judgment or auditability. · Cross-program packet templates and outcome traces become more defensible than a one-off services workflow.
Business model
Revenue streams
Annual software subscription per governed discovery program · Implementation and validation fees for packet-template setup and workflow mapping · Usage-based fees for governed model runs and packet exports above contracted baseline volume · Premium portfolio analytics and additional workflow modules for repeat enterprise customers
Unit of value
Live discovery program with approved packet templates, governed model sessions, and production handoff workflows
Target gross margin
72%
Expansion levers
Additional hit-to-lead and adjacent preclinical programs within the same pharma account · Expansion from potency and cell-model packets into ADME, developability, and safety workflows · Premium cross-model comparison and packet-to-outcome analytics modules · Partner-led distribution through model vendors, scientific consultancies, and CRO integrations
Strategy map
North-star metric
Number of approved model-backed experiment packets that convert into executed assays with full provenance and no manual rework
Input metrics
Median hours from model query to reviewer-approved packet · Percentage of packets accepted at first review without missing provenance fields · Specialist hours removed from weekly review preparation per active program · Paid pilot to production-program conversion rate · Percentage of production customers adding a second governed program within 12 months
Moats to build
Library of approved assay templates, parameter bounds, and review policies mapped to real hit-to-lead workflows · Packet-to-outcome dataset linking prompts, models, assumptions, and downstream assay results · Embedded approval and CRO-handoff workflows that sit across mixed model vendors and record systems
Kill criteria
Fewer than 3 of the first 10 target pharma accounts agree to fund a paid pilot in the current budget cycle · The first 3 paid pilots fail to cut packet-preparation time by at least 50% versus the team's current manual process · Pilot-to-production conversion stays below 40% after the first 5 paid deployments · More than half of serious prospects insist the need should wait for an incumbent ELN or model vendor release
Milestones
0–12 months
Land 2 paid design-partner pilots inside top-30 pharma hit-to-lead programs
Ship production MVP for potency-ranking and next-experiment packet generation with approval logs and CRO-ready export
Prove at least 50% reduction in packet-preparation time in one live weekly review workflow
Establish a repeatable validation package that passes customer QA and informatics review in pilot deployments
12–24 months
Convert at least 2 pilot accounts into annual production contracts and expand one account to a second governed program
Add reusable cell-model templates, cross-model comparison, and portfolio reporting across the first customer base
Formalize one CRO or scientific-services integration path that reduces manual packet reformatting
Build a reference implementation playbook that keeps deployment under 10 weeks for new accounts
24–36 months
Expand into adjacent workflows such as ADME or developability after proving the initial potency wedge
Build packet-to-outcome benchmarks across multiple customers to strengthen retention and pricing power
Enter broader U.S. and selected EU pharma deployments once the traceability and validation posture is mature
Establish partner-led distribution through model vendors or scientific workflow consultancies
Strategy map
flowchart LR
Wedge[Hit-to-lead packet wedge] --> MVP[Governed Claude session to packet MVP]
MVP --> Proof[Faster approvals and cleaner CRO handoffs]
Proof --> Expansion[Multi-workflow scientific AI control layer]
Founding team
Role
Start timing
Rationale
Founder / CEO
Month 0
Founder-led selling is required because the first deals depend on buyer discovery, packaging, and partner trust more than scaled demand generation.
Founding eng
Month 0
The core technical risk is reliable session capture, provenance enforcement, approval logging, and repeatable export into mixed enterprise systems.
Scientific workflow lead
Month 0
The product must encode assay-specific templates and reviewer logic that match real hit-to-lead decision workflows rather than generic AI governance theory.
Solutions architect
Month 4
Enterprise pilots need someone who can translate customer record systems, CRO handoffs, and validation needs into standard deployment patterns.
Quality and compliance lead
Month 9
As pilots move into production, the company needs dedicated ownership of validation artifacts, traceability controls, and audit-readiness posture.
Experiment roadmap
Horizon
Experiment
Hypothesis
Success metric
Owner
0–90 days
Interview 15 discovery informatics, computational chemistry, and translational pharmacology leaders at top-30 pharma targets.
External-Claude packet governance is a current budgetable pain tied to model rollout and weekly review workflows.
At least 8 buyers rank the problem in their top three near-term workflow priorities and at least 3 agree to scope a paid pilot.
Founder / CEO
0–90 days
Shadow one live hit-to-lead weekly review cycle and manually generate a packet prototype from the team's current artifacts.
Packet prep time, provenance gaps, and CRO-handoff friction are visible enough to support a narrow ROI story.
Document a baseline process that takes at least one business day of manual preparation and identify 5 or more recurring packet fields that can be templatized.
Scientific workflow lead
90–180 days
Close 2 paid pilot deployments for potency-ranking and next-experiment packet generation.
Buyers will pay before full automation exists if the product fits existing approval workflows and keeps humans in the loop.
Two paid pilots at or above $100k each with a defined production conversion path and no requirement to replace incumbent ELN systems.
Founder / CEO
90–180 days
Validate one CRO export integration and one approval-signature workflow with a design partner.
The startup can own the packet layer without being forced to become the system of record.
One pilot account accepts production use of exported packets and review logs without a critical QA or audit blocker.
Founding eng
180–365 days
Measure cycle-time reduction, packet completeness, and specialist hours saved across the first paid pilots.
Operational ROI is strong enough to convert pilots into annual program subscriptions before broader outcome data accrues.
At least 50% lower packet-preparation time, at least 90% packet completeness at first review, and one pilot-to-production conversion.
Solutions architect
Risk assessment
Business plan risks — 5 mapped
Impact →
High
R3
R4
R1
R2
Medium
R5
Low
Low
Medium
High
Likelihood →
R1Incumbent ELN or model vendors add enough packetization and traceability to narrow the standalone wedge. · Highlikelihood / Highimpact — Differentiate on neutrality across external sessions and model vendors, faster deployment, and deeper workflow-specific packet templates.
R2Pharma validation and procurement cycles delay pilots long enough to starve learning and revenue. · Highlikelihood / Highimpact — Start in shadow mode, keep human approval mandatory, and sell into one live program with a narrow workflow and clear operational ROI.
R3Scientists reject the product if packet generation feels like extra documentation work. · Mediumlikelihood / Highimpact — Auto-fill packet fields from session context, measure edit time explicitly, and cut scope until the workflow is faster than the current manual process.
R4Poor source data or out-of-domain model output undermines trust in the packet regardless of the governance layer. · Mediumlikelihood / Highimpact — Require dataset references, parameter bounds, and uncertainty warnings in every packet and block export when evidence is incomplete.
R5The beachhead is too narrow to support venture-scale growth if adjacent workflows do not open after early proof. · Mediumlikelihood / Mediumimpact — Treat potency workflows as a proof wedge, but test expansion pull into cell models and adjacent preclinical packets within the first 18 months.
Risk
Likelihood
Impact
Mitigation
Incumbent ELN or model vendors add enough packetization and traceability to narrow the standalone wedge.
High
High
Differentiate on neutrality across external sessions and model vendors, faster deployment, and deeper workflow-specific packet templates.
Pharma validation and procurement cycles delay pilots long enough to starve learning and revenue.
High
High
Start in shadow mode, keep human approval mandatory, and sell into one live program with a narrow workflow and clear operational ROI.
Scientists reject the product if packet generation feels like extra documentation work.
Medium
High
Auto-fill packet fields from session context, measure edit time explicitly, and cut scope until the workflow is faster than the current manual process.
Poor source data or out-of-domain model output undermines trust in the packet regardless of the governance layer.
Medium
High
Require dataset references, parameter bounds, and uncertainty warnings in every packet and block export when evidence is incomplete.
The beachhead is too narrow to support venture-scale growth if adjacent workflows do not open after early proof.
Medium
Medium
Treat potency workflows as a proof wedge, but test expansion pull into cell models and adjacent preclinical packets within the first 18 months.
First customer
Title
Head of Discovery Informatics sponsoring one small-molecule hit-to-lead design-partner program
Profile
Top-30 pharma team with 10 to 30 medicinal chemistry and translational pharmacology scientists, active external model usage, and recurring CRO assay handoffs from weekly review meetings.
Trigger
Claude-accessible quantitative models are enabled for a live program and review meetings start producing more model-backed recommendations than specialists can package and approve.
Buyer
Head of Discovery Informatics or VP Computational Chemistry
Initial contract
$100k-$150k paid pilot for one governed hit-to-lead workflow and packet-template setup, converting to a $250k-$400k annual program subscription once the team uses production packets in weekly reviews and CRO handoffs.
What must be true
At least 3 of the first 10 target pharma accounts treat external-Claude packet governance as a budgetable problem this year.
The first 3 pilots cut packet-preparation time by at least 50% without increasing scientific rework.
More than 80% of packet fields can be auto-generated from session and model context with less than 15 minutes of reviewer edits.
At least 2 early pilots convert into annual production contracts of $250k or more within 6 months of pilot completion.
At least one production customer expands to a second program or adjacent workflow within 12 months.
Open diligence questions
Will discovery informatics leaders buy a neutral packet layer now or wait for Benchling, Dotmatics, or SandboxAQ to extend their stack?
Is potency-ranking and next-experiment planning the best first workflow, or does another packet type convert faster with buyers?
What KPI opens budget first in practice: specialist-time savings, cycle-time reduction, packet completeness, or compliance posture?
How much manual review must remain for buyers to trust the system in assay-planning decisions?
Can the company secure enough rights to build packet-to-outcome benchmarks without triggering IP or data-sharing objections?
Investor verdict
Call
Watch
Conviction
Strong workflow wedge and buyer pain, but conviction stays limited until a neutral packet layer proves separate budget pull against incumbents.
Why believe
The company targets a concrete operating failure between model output and assay execution where reviewability, provenance, and CRO handoff quality matter more than adding another frontier model.
Why doubt
Incumbent ELN, workflow, and model vendors may add enough packetization and traceability to slow adoption before the startup wins reference accounts.
Next diligence
The next proof point is 2 to 3 paid top-pharma pilots that show materially faster packet approval, lower specialist load, and at least one conversion into a production program.
Section
Financial model
3-year totals
Year 1 revenue
$400KEBITDA $-757K · Cash EOP $2.24M
Year 2 revenue
$1.28MEBITDA $-779K · Cash EOP $1.46M
Year 3 revenue
$2.34MEBITDA $-506K · Cash EOP $958K
Unit economics
ARPU (annual)
$300K
Gross margin
72%
CAC
$180KPayback 10.0 months
LTV / CAC
6.7xLTV $1.20M
Funding ask
Round
seed · $3.0M
Runway
18 months
Milestone
Reach 5 production governed programs across at least 3 pharma accounts, prove one second-program expansion, and keep new deployments under 10 weeks while preserving a 6-month cash buffer.
Model sanity
Revenue engine. Base-case revenue is driven by turning 2 early pilots into production, adding 3 more programs through Y2, and ending Y3 with 9 live governed programs at roughly $300K ACV.
Must go right. Pilot-to-production conversion must stay near the 50% business-plan target and at least one account must expand to a second program within 12 months.
Model breaks if. If buyers wait for incumbents and the cycle drifts toward 9 months, downside cash falls toward roughly $0.1M and the company needs a bridge before Y3 ends.
Next-round proof. The next financing is supported once the company exits seed with 5 production programs, one repeatable expansion motion, and proof that deployments stay under 10 weeks.
Revenue, cash, and EBITDA — 12-month Y1 + 8-quarter Y2/Y3
Revenue (line, area)
Cash EOP (dashed)
EBITDA (bars, gray = loss)
Use of funds — $3.0M seedHeadcount build by role — peak11 FTE
Founder / CEO
Engineering
Scientific workflow
Solutions architect
Quality and compliance
Sales / GTM
Customer success / ops
Year-3 scenarios — base / downside / upside
Y3 revenue
Y3 EBITDA
Cash low point
Description
Downside
$1.71M
-$1.03M
$87K
Incumbent feature catch-up and slower buyer urgency stretch the cycle toward 9 months, leaving fewer programs live by the end of year 3.
Base
$2.34M
-$506K
$958K
Two early paid pilots convert into a repeatable governed-program motion that reaches 5 production programs by the end of Y2 and 9 live programs by the end of Y3.
Upside
$3.03M
$82K
$1.95M
Cycle-time savings become obvious in the first reference accounts, shortening the sales cycle and pulling forward second-program expansion.
Sensitivity — Y3 cash and revenue impact, sorted by magnitude
Variable
Downside
Upside
Cash impact
Revenue impact
sales cycle
9 months because buyers wait for incumbent ELN or model-vendor roadmaps.
4-5 months once reference implementations and partner intros shorten diligence.
-$540K
-$638K
ARPU
$250K ACV if buyers cap usage at one narrow workflow and resist premium analytics.
$340K ACV with adjacent workflow expansion and analytics upsell.
-$310K
-$430K
hiring pace
Pull forward extra post-sale and compliance hires before production proof is visible.
Delay the second customer-success hire until partner-sourced expansions are repeatable.
-$210K
$0K
gross margin
68% if pilots stay services-heavy and validation work remains bespoke.
75% once packet templates and QA assets are reusable across accounts.
-$180K
$0K
CAC
$220K CAC if every deployment needs bespoke founder and solutions effort.
$150K CAC if one partner channel pre-qualifies buyers and implementation needs less custom work.
-$160K
$0K
churn
2.5% monthly churn if the product stays project-like and buyers delay multi-program standardization.
1.0% monthly churn once packet templates and outcome traces become embedded.
-$95K
-$120K
Scenarios
Scenario
Y3 revenue
Y3 EBITDA
Cash low point
Description
Key changes
Downside
$1.71M
$-1.03M
$87K
Incumbent feature catch-up and slower buyer urgency stretch the cycle toward 9 months, leaving fewer programs live by the end of year 3.
Production ACV falls from $300K to $250K per governed program.
Gross margin drops from 72% to 68% as implementation stays more services-heavy.
The company exits Y3 with 8 live programs instead of 9 and slower pilot-to-production timing.
Base
$2.34M
$-506K
$958K
Two early paid pilots convert into a repeatable governed-program motion that reaches 5 production programs by the end of Y2 and 9 live programs by the end of Y3.
Production ACV stays at $300K per program and pilot pricing stays at the midpoint of the business-plan range.
Gross margin holds at the 72% target from the business plan.
Pilot-to-production conversion stays near 50% and one account expands to a second program within 12 months.
Upside
$3.03M
$82K
$1.95M
Cycle-time savings become obvious in the first reference accounts, shortening the sales cycle and pulling forward second-program expansion.
Production ACV rises from $300K to $340K as buyers adopt premium analytics and adjacent workflow coverage.
Gross margin improves from 72% to 75% as packet templates and validation assets standardize delivery.
The company reaches 10 live programs by Q4Y3 because one partner channel and faster 4-5 month cycles accelerate expansion.
Sensitivity
Variable
Downside
Base
Upside
ARPU
$250K ACV if buyers cap usage at one narrow workflow and resist premium analytics.
$300K ACV per governed program.
$340K ACV with adjacent workflow expansion and analytics upsell.
CAC
$220K CAC if every deployment needs bespoke founder and solutions effort.
$180K CAC in the modeled founder-led enterprise motion.
$150K CAC if one partner channel pre-qualifies buyers and implementation needs less custom work.
churn
2.5% monthly churn if the product stays project-like and buyers delay multi-program standardization.
1.5% monthly churn.
1.0% monthly churn once packet templates and outcome traces become embedded.
sales cycle
9 months because buyers wait for incumbent ELN or model-vendor roadmaps.
6 months with founder-led selling and validation-first deployments.
4-5 months once reference implementations and partner intros shorten diligence.
gross margin
68% if pilots stay services-heavy and validation work remains bespoke.
72% target gross margin.
75% once packet templates and QA assets are reusable across accounts.
hiring pace
Pull forward extra post-sale and compliance hires before production proof is visible.
Add only one more engineer in Y2 and scale solutions and customer success after conversions.
Delay the second customer-success hire until partner-sourced expansions are repeatable.
Key assumptions (19)
ID
Name
Value
Unit
Source
A1
Model start month
2026-06
month
Starts the first full month after the 2026-05-19 business-plan date.
A2
Customer counting unit
Governed discovery program
customer_definition
[BP businessModel.unitOfValue] Pricing and expansion are per live discovery program, so modeled customers are governed programs rather than enterprise logos.
A3
Paid pilot package
$125.0K over 4 months
usdK_per_pilot
[BP investorMemo.firstCustomer.initialContract] Midpoint of the stated $100k-$150k paid pilot range for one governed workflow and packet-template setup.
A4
Production ACV
$300.0K per governed program per year
usdK_per_program_year
[BP market.som + research.market.som] Both files size the reachable year-3 case at roughly $300K annual ACV per governed program.
A5
Revenue recognition per program
$31.25K per month during the 4 pilot months, then $25.0K per month in production
usdK_per_program_month
Derived from A3 and A4 so revenue reconciles to pilot months plus annual subscriptions.
2 live programs by M8, 5 by Q4Y2, and 9 by Q4Y3 with 7 in production by the end of Y3
live_programs
[BP milestones + research.market.som] Mirrors the plan for 2 paid pilots in Y1, 2+ production conversions and one second-program expansion in Y2, then broader multi-program rollout in Y3 while staying below the full 15-program SOM.
A8
Sales cycle and conversion
6-month median enterprise cycle, 25% qualified-buyer-to-paid-pilot, and 50% pilot-to-production conversion
funnel
[BP gtm.funnelTargets + BP risks] Base case keeps the business-plan conversion targets but assumes pharma validation and procurement still stretch the cycle to about 6 months.
A9
Monthly churn
1.5%
percent
Startup-finance heuristic for sticky but still early enterprise workflow software sold into top-pharma programs.
A10
Fully loaded CAC
$180.0K per production program
usdK_per_program
[BP gtm.channels + research.reportMemo.distributionChannels] Founder-led enterprise sales, pilot delivery, and validation-heavy deployments imply a high but still manageable CAC.
Startup-finance heuristic for U.S.-based early-stage life-sciences workflow software, anchored to the business-plan team roles and enterprise hiring needs.
[BP team + strategicChoices.sequencingRationale] Follow the named Month 0, Month 4, and Month 9 hires first, then add only the minimum engineering and post-sale capacity needed after production proof.
A13
Non-payroll operating spend
Rises from $18K per month in Q1Y1 to $54K per month by Q4Y3
usdK_per_month
Startup-finance heuristic covering cloud compute, validation support, customer travel, legal/compliance work, and software tooling for a regulated enterprise deployment motion.
A14
Starting cash after seed close
$3.0M
usdM
[BP fundingAsk.targetFundingRangeUsd] Modeled at the low end of the stated $3-5M seed range to keep the plan lean and milestone-driven.
Derived from the modeled 18-month burn mix and the requirement to preserve six months of cushion after the seed milestone.
A16
Y2-Y3 opex smoothing
Quarterly opex rises gradually from $388.8K in Q1Y2 to $594.6K in Q4Y3 instead of stepping only at the required year-end snapshots
method
[Financial Modeler instructions] Salary and non-payroll costs are smoothed between snapshot columns so the quarterly opex path stays consistent with staged hiring.
A17
Downside scenario deltas
$250K ACV, 68% gross margin, 9-month cycle, and only 8 live programs by Q4Y3
scenario_inputs
Built from BP and research risks around incumbents closing the feature gap, optional budget perception, and longer pharma procurement.
A18
Upside scenario deltas
$340K ACV, 75% gross margin, 4-5 month cycle, and 10 live programs by Q4Y3
scenario_inputs
Upside assumes packet ROI is obvious early, one partner channel becomes repeatable, and second-program expansion lands faster than the base plan.
A19
Cash-flow simplification
Cash movement equals EBITDA in this operating model
method
Startup-finance heuristic for an early software company with no modeled debt, capex, or material working-capital swings in the plan horizon.
unit economics flow
flowchart LR
Targets[Target pharma programs] --> Pilots[Paid pilots]
Pilots --> Production[Production governed programs]
Production --> Expansion[Second-program expansion]
Expansion --> Revenue[Subscription and usage revenue]
Revenue --> GrossProfit[72% gross profit]
GrossProfit --> Opex[Hiring plus compliance and delivery spend]
Opex --> Cash[Ending cash]
Flags: The model assumes pharma buyers fund a neutral packet-governance layer now instead of waiting for Benchling, Dotmatics, or SandboxAQ to extend their stack. · Revenue per FTE only clears the low end of software benchmarks by Y3, so any extra services work or early hiring can meaningfully worsen burn efficiency. · The base case ends Y3 with 2 live pilots still converting, so next-round quality depends on maintaining the modeled pilot-to-production cadence. · Gross margin can slip quickly if validation packages, CRO exports, or data-mapping work remain bespoke account by account.
Section
Top risks
Platform encroachment. Scientific model vendors or Claude platform providers could add basic provenance and workflow packaging themselves. Mitigation: Start with model-neutral approvals, cross-vendor comparisons, and CRO handoff workflows that single-model vendors do not own.
Slow validation cycles. Pharma teams may require months of shadow-mode evidence before trusting a new layer in assay planning. Mitigation: Launch on one potency-ranking workflow, benchmark against historical campaigns, and prove cycle-time reduction before broader rollout.
Scientific liability. A bad recommendation or poorly bounded model output could undermine trust quickly in a high-stakes R&D environment. Mitigation: Keep a human approval gate, expose uncertainty and source provenance, and block packet export when inputs fall outside validated ranges.