Cross-modal biology OS that helps oncology biotechs rank and validate asset combinations before expensive expansion trials.
Clinical-stage oncology biotechs face a brutal combination-design problem once a lead asset shows partial efficacy but needs the right partner drug, biomarker logic, or patient subgroup to unlock the next trial. The relevant evidence sits across internal omics files, assay data, PK and safety readouts, biomarker analyses, public literature, and translational team intuition, so hypothesis generation is still run through slow meetings, siloed analysts, and consultant-heavy reviews.
Why now
- Cross-modal biology is now being treated as a distinct architecture category, which creates room for a focused software layer rather than another generic model wrapper.
- The expansion of a Qualified Access Program shows pharma and biotech teams are already willing to trial this kind of reasoning workflow inside real R&D programs.
- A live oncology engagement already turned a huge search space into 15 candidates and 3 actionable hypotheses, which is exactly the kind of decision compression buyers pay for.
- Specialist venture funding is arriving early, which means the category can be built before a few platform incumbents lock up customer data and workflow patterns.
Catalyst. Ingenix's financing, Qualified Access Program expansion, and oncology proof point show that cross-modal biology reasoning has moved from abstract platform promise to a workflow buyers can test immediately in high-stakes combination-design decisions.
The idea
The startup would sell a secure reasoning layer for translational oncology teams that need sharper, faster combination hypotheses without building a large internal platform group. Customers would connect their internal preclinical and early clinical datasets, map the relevant asset and tumor context, and receive a ranked set of partner-drug or biomarker hypotheses with transparent evidence trails across modalities. The product would not stop at scoring ideas; it would package why a hypothesis matters, what contradictory evidence exists, and which assays or retrospective analyses should validate it next. Each review packet would be designed for real portfolio and protocol meetings, so teams can move from scattered evidence to one defendable recommendation set. Over time, the moat becomes the feedback loop between predicted hypotheses, wet-lab validation, and downstream program outcomes across many oncology assets.
What's different. CROs provide analysis labor, pathway tools provide fragments of evidence, and foundation-model companies promise general biological intelligence, but none of them own the exact moment when a biotech must choose which combination hypotheses deserve scarce validation dollars. This startup wins by packaging cross-modal reasoning into a program-review workflow with transparent evidence trails and validation planning, not just another black-box score. Its defensibility compounds as more customers feed back which predicted combinations advanced, failed, or produced biomarker signals in real programs.
| Beachhead | Clinical-stage oncology biotechs with one Phase I or Phase II targeted therapy asset in solid tumors, internal translational datasets spread across multiple teams, and an active need to nominate 2-5 combination hypotheses before the next board, partnering, or protocol-expansion decision |
|---|---|
| Wedge | A cross-modal oncology combination workspace that ingests internal assay, omics, PK, biomarker, and early clinical readouts, then returns ranked partner-drug hypotheses, mechanistic rationale, and a validation-ready experiment brief for translational review meetings |
| Non-obvious insight | The newly valuable company is not another general-purpose biology foundation model. It is the program-specific reasoning layer that can fuse a biotech's messy internal datasets with external biology and turn them into a ranked, mechanistically grounded combination plan before management commits to expensive validation or expansion cohorts. |
| Venture-scale path | Start with oncology combination planning, then expand the same reasoning layer into target-indication matching, biomarker stratification, translational portfolio triage, licensing diligence, and eventually the decision backbone for biotech and pharma asset-development workflows. |
| Primary user | VP Translational Oncology or Head of Computational Biology at a clinical-stage oncology biotech planning combination-expansion studies around one lead asset |
|---|---|
| Secondary user | Translational medicine directors and biomarker leads responsible for preclinical-to-clinical hypothesis packages |
| Economic buyer | Chief Scientific Officer or SVP R&D at the biotech |
| First customer | A Series B to pre-IPO oncology biotech with one lead solid-tumor asset entering or exiting Phase II, 20-80 scientists across discovery and translational medicine, and pressure to define a combination-expansion strategy before the next financing or BD milestone |
|---|---|
| Buying trigger | Monotherapy readouts plateau, an internal portfolio review demands a combination roadmap, or a potential pharma partner asks for a mechanistically credible expansion plan before committing capital |
| Current alternative | Internal translational-biology meetings, literature reviews, consultant-led hypothesis generation, CRO bioinformatics support, and single-purpose pathway-analysis tools stitched together in slides and spreadsheets |
| Switching reason | The wedge reduces weeks of manual synthesis, surfaces non-obvious combinations grounded in the company's own data, and gives R&D leadership an auditable rationale for which experiments or cohorts to fund next |
| Pricing hypothesis | Annual platform subscription priced by active asset program plus onboarding fees for data integration, with premium fees for major portfolio reviews and validated hypothesis packages |
Jobs to be done
| Job | Current alternative | Success metric |
|---|---|---|
| When a lead oncology asset needs a combination-expansion strategy, help the translational team narrow hundreds of plausible options into a few mechanistically strong hypotheses, so they can fund the right validation experiments and protocol work. | Manual translational review meetings, consultant analyses, and slide-based literature synthesis | Time from data freeze to approved combination shortlist and percent of shortlisted hypotheses advanced to validation |
| When the CSO prepares for a board or partnering discussion, help the team produce a defendable evidence packet for why a specific combination path should be prioritized, so external stakeholders can trust the expansion plan. | Ad hoc slide decks assembled by scientists and bioinformatics teams | Time to produce review materials and number of revision cycles before approval |
flowchart LR Buyer[Translational oncology lead] --> Pain[Too many weak combination hypotheses across siloed biology data] Pain --> Product[Cross-modal combo reasoning OS] Product --> Outcome[Faster and more defensible expansion decisions]
- Signal · 5/5The cluster contains a differentiated architectural claim, a concrete oncology proof point, and a clear partner-access signal from two verified sources.
- Pain · 4/5Combination-selection mistakes burn time, assay budget, and strategic optionality for oncology biotechs with only a few shots on goal.
- Wedge · 5/5The first use case targets a narrow buyer, one asset-level workflow, and a clear output: ranked combination hypotheses with validation plans.
- Defense · 4/5Outcome-linked feedback on which cross-modal hypotheses converted into real experiments and programs can become proprietary workflow data that is hard to copy.
- Scale · 5/5The same reasoning layer can expand from oncology combinations into broader translational, portfolio, and drug-development decision workflows across biopharma.
- Translational oncology design partners
- CROs and assay labs executing downstream validation
- Specialist life-science investors and KOL networks
- Data infrastructure vendors used by biotech R&D teams
- Normalizing internal and external biology evidence
- Ranking and explaining combination hypotheses
- Producing validation briefs for translational review
- Learning from customer experiment outcomes to improve ranking quality
- Cross-modal biology reasoning engine
- Secure data connectors for assay, omics, PK, and biomarker data
- Outcome dataset linking predicted hypotheses to experimental results
- Scientific workflow templates for translational oncology reviews
- Rank combination hypotheses across internal and external biology evidence
- Shorten time from data review to validation-ready experiment plan
- Produce board- and partner-ready rationale for expansion decisions
- Build a reusable evidence graph around each oncology asset
- High-touch onboarding around one lead asset
- Joint translational review sessions with customer scientists
- Expansion from one asset program into portfolio-wide reasoning workflows
- Founder-led sales to CSOs and translational oncology leaders
- Investor and scientific-advisor introductions into venture-backed biotechs
- Design-partner programs tied to active portfolio or protocol reviews
- Clinical-stage oncology biotechs
- Translational medicine teams preparing combination-expansion programs
- Mid-size pharma oncology units evaluating external and internal asset combinations
- Scientific software and model development
- Customer-specific data integration and implementation
- Scientific success and workflow support
- Business development into venture-backed biotech accounts
- Annual software subscription per active asset program
- Data integration and implementation fees
- Premium workflow packages for major portfolio and partnering reviews
Market
| TAM | $259.5M Bottom-up estimate: 2,162 oncology trial starts in 2024 × 48% Phase II share = 1,038 programs [2]; apply $250k annualized platform-plus-integration value per active program using biotech AI build-vs-buy budget proxies [30] => about $259.5M. Cross-check: this sits below the adjacent $407.2M 2026 AI-in-oncology-drug-discovery estimate [19]. |
|---|---|
| SAM | $104.0M Apply a beachhead constraint of ~40% of Phase II oncology starts to targeted solid-tumor, clinical-stage biotech programs where combination and biomarker reasoning is most acute; 1,038 × 40% × $250k ≈ $104.0M. |
| SOM | $5.0M Year-3 reachable share assumes ~20 active asset programs at roughly $250k annualized value each, which is ambitious but plausible through founder-led design-partner sales in a narrow biotech segment. |
Executive takeaways
- The beachhead is real because oncology combination and biomarker decisions still depend on manual, cross-functional evidence synthesis, and that workflow breaks under rising modality and trial complexity.
- The most defensible opening is not broad “AI for biology,” but one asset-level workflow: turning internal assay, omics, PK, biomarker, and early clinical signals into a ranked combination shortlist for translational review.
- Competition is active but fragmented across preclinical evidence graphs, multimodal patient-data platforms, and full-stack TechBio companies; none obviously owns the exact protocol-expansion review moment for mid-size oncology biotechs.
- The main adoption risk is trust, not raw model capability: buyers need auditable evidence trails, narrow deployment scope, and fast time-to-value before they will displace consultants, CRO analyses, and slide-driven review meetings.
Market definition
Software and data products that help oncology drug developers convert multimodal translational evidence into asset-level target, combination, and biomarker decisions before expensive validation or protocol-expansion spend.
Customer and buyer
Daily champion is a VP Translational Oncology or Head of Computational Biology running one lead program; the economic buyer is usually the CSO or SVP R&D, with biomarker, clinical-development, and IT/compliance stakeholders acting as veto points.
Buying triggers
- Monotherapy efficacy plateaus and the translational team needs a defensible combination shortlist before expansion-cohort or board decisions. [13][21][22]
- Biomarker testing and MTB preparation become coordination bottlenecks, making manual synthesis too slow for live portfolio review cycles. [4][31][32]
- A partner or investor asks for stronger mechanistic rationale and patient-segmentation logic before funding the next oncology step. [3][13][15]
Willingness to pay
Buyer behavior already supports meaningful budget for this category: Tempus, AstraZeneca, and Pathos structured a $200M multimodal oncology model deal, Recursion discloses $80M–$150M upfront-style platform partnerships with large milestone pools, and biotech build-vs-buy guidance frames commercial AI platforms as materially cheaper than building internal systems but still worth buying when they shorten time-to-value. [15][16][17][30][46]
Category dynamics
Tailwinds
- Oncology continues to command a huge share of clinical development, with more novel modalities and combinations moving into active trials.
- Multimodal oncology model deals show large buyers already believe richer data fusion can improve discovery and development decisions.
- Biomarker-driven enrollment and subgroup logic make structured translational reasoning more valuable than broad literature search alone.
Headwinds
- Data privacy, AI credibility, and regulated-record obligations can widen implementation scope for any tool touching sensitive patient or development data.
- Internal data heterogeneity and MTB-style manual work still create onboarding and change-management drag.
- Broader platforms with larger data assets can make buyers ask why they need a separate workflow-specific product.
Validation signals
- Ingenix claims it reduced a dual-payload ADC search problem to 15 candidate combinations and surfaced 3 actionable novel hypotheses in an oncology engagement.
- BenchSci reported early-adopter outcomes such as new indications/targets in 40% of projects and earlier identification of safety or efficacy risks in 33% of projects.
- Tempus, AstraZeneca, and Pathos structured a $200M multimodal oncology foundation-model deal, showing large buyers are already funding the underlying category.
- ConcertAI and Foundation Medicine positioned a nearly half-million-patient clinically linked dataset specifically for translational research and drug-development decisions.
Regulatory & technical constraints
- If the system is used to support regulatory decision-making for safety, efficacy, or quality, sponsors should expect FDA-style risk-based credibility plans, model documentation, and lifecycle maintenance requirements.
- Any deployment that creates or modifies regulated electronic records should assume validated systems, audit trails, access controls, and linked electronic signatures are required.
- EU personal-data handling immediately raises GDPR obligations, and customer security reviews will be stricter when multimodal patient data is in scope.
- Trial-informatics systems already rely on secure cloud, role-based access, interoperability, and chain-of-custody controls, which sets the bar for enterprise deployment.
Competition
The practical alternatives split into four groups: internal translational meetings and consultants; preclinical evidence platforms such as BenchSci; multimodal oncology data networks such as Tempus and ConcertAI; and broader AI-first biology platforms such as Owkin, Recursion, and Isomorphic Labs. The proposed startup should look less like another general “foundation model” and more like a narrow, evidence-traceable decision layer for one live asset program.
| Competitor | Stage | Wedge | Pricing | Strength | Weakness vs. us |
|---|---|---|---|---|---|
| Ingenix | seed | Cross-modal biology reasoning engine for translational and clinical R&D decisions. | Not public; Qualified Access Program / enterprise engagement | Closest strategic analogue to the proposed wedge and already demonstrates an oncology combination-style proof point. | Still positioned as a broad reasoning layer, leaving room for a narrower productized workflow around one biotech asset review. |
| BenchSci | scale-up | Neuro-symbolic disease-biology platform for target due diligence, literature synthesis, and experiment planning. | Enterprise SaaS; not publicly itemized | Deep adoption in large pharma and strong story around evidence grounding and internal-data ingestion. | Center of gravity is preclinical biology and experiment planning, not the clinical-stage combination-expansion packet. |
| Owkin | scale-up | Agentic biological reasoning built on multimodal patient data and oncology-focused validation. | Enterprise platform / licensing; not publicly itemized | Rich patient-data network and explicit product ambition across discovery-to-development decision making. | Broader and heavier platform scope may be overbuilt for a single-program biotech buyer with one urgent review need. |
| Tempus | scale-up | Multimodal oncology data network and foundation-model strategy for drug discovery and care decisions. | Custom data / platform / model-development agreements | Massive oncology dataset and clear willingness-to-pay validation from AstraZeneca and Pathos. | Optimized for data-network scale and large partnerships rather than a lightweight biotech-native translational meeting workflow. |
| ConcertAI | incumbent | Clinically linked oncology data, translational research, and clinical-trial decision tools for life sciences. | Custom enterprise contracts | Strong life-sciences penetration and credible translational / trial-support positioning. | Broader RWE and trial-operations focus leaves space for a more opinionated product that ranks combination hypotheses inside one asset team. |
Why incumbents do not win by default
- Preclinical evidence SaaS. BenchSci is strong at surfacing disease-biology evidence, but its center of gravity is earlier preclinical research rather than a clinical-stage combination-expansion decision packet.
- Multimodal patient-data platforms. Tempus and ConcertAI win on data scale and real-world evidence, but their products are broader platform sales motions that do not automatically solve the biotech-specific translational review workflow.
- Agentic biology copilots. Owkin is pushing agentic biological reasoning, but its scope spans discovery through development and may be heavier than what a single-asset biotech needs for one urgent review cycle.
- Full-stack TechBio platforms. Recursion and Isomorphic Labs demonstrate that pharma will spend heavily on AI-first biology platforms, but they are partnership-heavy platform builders rather than obvious plug-in tools for a mid-size oncology biotech team.
Business plan
Clinical-stage oncology biotechs regularly hit a high-stakes moment when a lead asset shows partial efficacy and management needs a defensible combination or biomarker expansion plan before the next board, partnering, or protocol decision. Today that decision is still assembled through manual translational meetings, consultant work, siloed bioinformatics, and slide decks built from assay, omics, PK, biomarker, and early clinical data. The first customer should be a Series B to pre-IPO oncology biotech with one Phase I or II targeted solid-tumor asset and enough internal data complexity that combination review is already slowing an imminent funding or BD milestone. The wedge is a secure cross-modal reasoning workspace that ingests the minimum useful internal dataset, returns a ranked shortlist of partner-drug or biomarker hypotheses, and packages an auditable validation brief for translational review. The deliberate choice is to start as bounded decision support for one asset program rather than as a broad biology foundation model, a regulated submission system, or a full multimodal data network. Research supports a modeled TAM, SAM, and year-3 SOM of about $259.5M, $104.0M, and $5.0M respectively, which is enough for a wedge only if the company expands from combination planning into adjacent asset-development workflows after it proves repeatable pilot conversion. The main execution risk is trust and onboarding speed: buyers will not replace existing meetings, CRO analyses, and broader platforms unless the product shows transparent evidence trails and reaches a useful shortlist inside one live decision cycle. Public buyer budget ranges and the minimum dataset needed for sub-six-week time to value remain partially unproven in the source material, so the first 12 months should optimize for paid pilots, production conversion, and budget-path clarity rather than aggressive market expansion.
Problem
- Clinical-stage oncology biotechs still synthesize internal assay, omics, PK, biomarker, and early clinical evidence manually when deciding which combinations or patient subgroups deserve scarce validation dollars.
- Monotherapy plateaus, board reviews, and pharma partnering discussions create time-bound pressure, but the current process is slow, biased toward familiar mechanisms, and hard to defend with one coherent evidence trail.
- Existing alternatives split across consultants, CRO bioinformatics, pathway tools, and broad data platforms, so no product cleanly owns the asset-level translational review moment for a mid-size biotech team.
Solution
- Ingest the minimum useful package of internal assay, omics, PK, biomarker, and early clinical data for one oncology asset and return a ranked shortlist of partner-drug or biomarker hypotheses with mechanistic rationale.
- Show evidence traces, contradictory signals, and next-step validation recommendations in a review packet built for translational, portfolio, and protocol-expansion meetings rather than for generic scientific search.
- Launch as decision support for one live program, with analyst-assisted onboarding and validation planning, before expanding into broader portfolio workflows or regulated evidence support.
Why we win
- The product is scoped to a specific urgent workflow that broad multimodal platforms and biology copilots do not obviously own by default: the combination-expansion review for one clinical-stage oncology asset.
- Transparent evidence trails and contradiction handling directly address the trust barrier that blocks adoption of black-box model outputs in translational oncology.
- A proprietary dataset linking ranked hypotheses to validation outcomes, rejected options, and downstream program decisions can compound into a workflow moat that consultants and generic model vendors do not see.
| Beachhead | Clinical-stage oncology biotechs with one Phase I or II targeted solid-tumor asset, fragmented translational datasets, and an active need to nominate 2-5 combination hypotheses before a board, partnering, or protocol-expansion decision. |
|---|---|
| Wedge rationale | This beachhead produces faster proof than a broad pharma or pan-oncology sale because the buyer pain is concentrated around one asset, one review cycle, and one measurable output: a funded shortlist of hypotheses that management can act on immediately. |
| Sequencing | Start with one-asset decision support, narrow ingestion templates, and founder-led design-partner sales so the company can prove trust, onboarding speed, and budget ownership before investing in portfolio analytics, deeper enterprise deployment, or broader therapeutic coverage. Only after paid pilots convert should the company add repeatable implementation hires, CRO and biomarker-lab partnerships, and adjacent workflows such as biomarker stratification or licensing diligence. |
| Not yet | Broad target-discovery or preclinical research workflows · Full multimodal patient-data network ambitions that mimic Tempus or ConcertAI · Autonomous recommendations for regulated submission support · Non-oncology indications or mid-size pharma portfolio sales before the biotech playbook is repeatable |
| Wedge | Sell a paid combination-review pilot tied to one imminent board, partnering, or protocol-expansion decision, and win by reducing manual synthesis time while improving the defensibility of which experiments or cohorts get funded next. |
|---|---|
| Channels | Founder-led sales to CSOs, SVPs R&D, VPs Translational Oncology, and Heads of Computational Biology at venture-backed oncology biotechs · Investor, advisor, and KOL introductions into design-partner biotechs already under milestone pressure · Design-partner offers anchored to live translational or portfolio review cycles rather than broad platform pitches · CRO, assay-lab, and biomarker-network referrals that can attach validation execution after the shortlist is approved |
| Funnel targets | Target account→qualified design-partner discussion 25-35%, qualified discussion→paid pilot 20-30%, paid pilot→annual production contract 50%+, first production asset→second asset or adjacent workflow expansion 30%+ within 12 months |
| Pricing | Asset-program subscription with upfront onboarding and optional premium review-packet services, priced around the value of one active program decision rather than seats or generic usage. Research supports a working assumption of about $250k annualized value per active program, but the exact pilot budget and contracting owner must be validated in the first 3-5 deals. |
| MVP | MVP is a secure oncology combination-review workspace for one active asset that ingests common assay, omics, PK, biomarker, and early clinical files, produces a ranked shortlist of hypotheses, and packages each recommendation with evidence traces, contradictory signals, and a validation brief. Version 1 should support retrospective back-testing, analyst-assisted review, and exportable meeting packets without claiming autonomous scientific judgment or full submission-system compliance. |
|---|---|
| 6 months | Ship narrow ingestion templates for the most common oncology data types, retrospective back-testing on design-partner assets, review-packet exports, audit logging, and a customer-hosted or VPC deployment path for security sensitive accounts. |
| 12 months | Run 3-5 paid pilots on live assets, add collaboration workflows for translational review meetings, benchmark shortlist quality and time saved against the old process, and support handoff into CRO or assay-partner validation planning. |
| 24 months | Expand from one-asset combination planning into biomarker stratification, target-indication matching, and portfolio triage for customers that already converted, while keeping the core product focused on auditable decision-support rather than becoming a general-purpose biology platform. |
| Key bets | A minimum useful internal dataset is sufficient to produce a decision-grade shortlist within one live review cycle. · Translational teams will trust evidence-traceable ranked outputs more than black-box scores or generic literature copilots. · Buyers will purchase an asset-level workflow faster than they will buy a broad biology platform or fund an internal build. · Early design-partner outcome data will create a defensible learning loop before broader platforms narrow into the same workflow. |
| Revenue streams | Annual subscription per active asset program · One-time onboarding and data-normalization fees · Premium scientific review-packet and portfolio-review services · Enterprise deployment, security, and compliance add-ons for larger accounts |
|---|---|
| Unit of value | Active oncology asset program under translational review |
| Target gross margin | 70% |
| Expansion levers | Expand from one asset program to additional assets within the same biotech · Add biomarker stratification, target-indication matching, and portfolio-triage modules for converted accounts · Layer in enterprise deployment and governance features for pharma or larger biotech buyers · Build an outcomes dataset that improves ranking quality and supports premium validation and benchmarking products |
| North-star metric | Active asset programs where the platform-generated shortlist is used to fund a validation plan or protocol-expansion decision. |
|---|---|
| Input metrics | Qualified biotech design-partner meetings per quarter · Median time from raw data handoff to review-ready shortlist · Share of ranked hypotheses discussed in live translational review meetings · Pilot to production conversion rate · Production accounts expanding to a second asset or workflow · Percentage of shortlisted hypotheses advanced to validation · Customer-rated trust score for evidence transparency |
| Moats to build | Outcome-linked dataset connecting ranked hypotheses to validation results, rejected options, and downstream program decisions · Oncology-specific evidence graph and contradiction-handling layer built around real translational review packets · Security, deployment, and onboarding playbooks that reduce time to first value for messy biotech datasets · Embedded workflow templates that mirror how translational, biomarker, and portfolio teams already make decisions |
| Kill criteria | Fewer than 3 of the first 10 qualified biotech accounts agree to a paid pilot tied to a live combination or biomarker review · Median onboarding time stays above 6 weeks for the minimum useful dataset across the first 4 pilots · Fewer than 2 of the first 4 paid pilots convert to production contracts at or above roughly $200k annualized value · Translational teams fail to use the shortlist in real review meetings or advance any recommended hypothesis to validation in most pilots |
Milestones
- Sign 3-5 design-partner pilots tied to live oncology asset decisions
- Prove median onboarding time of 6 weeks or less for the minimum useful dataset
- Convert at least 2 paid pilots into production contracts
- Launch customer-hosted or VPC deployment for security-sensitive accounts
- Expand at least 2 customers from one asset to multiple assets or adjacent translational workflows
- Build the first outcome-linked dataset connecting ranked hypotheses to validation results
- Standardize a repeatable onboarding and review-packet playbook that reduces services effort per account
- Reach about 20 active asset programs consistent with the modeled year-3 SOM
- Add biomarker stratification and portfolio-triage modules for converted customers
- Begin selective up-market expansion into mid-size pharma oncology units with enterprise deployment requirements
flowchart LR Wedge[One-asset oncology review wedge] --> MVP[Evidence-traceable reasoning workspace] MVP --> Proof[Paid pilots and funded validation plans] Proof --> Expansion[Portfolio workflows and second-asset expansion]
Founding team
| Role | Start timing | Rationale |
|---|---|---|
| Founding eng | Month 0 | Build the core reasoning, ingestion, and evidence-traceability system fast enough to support retrospective pilots without a large platform team. |
| Computational biology lead | Month 0 | Own ontology choices, shortlist quality, and credibility with translational scientists during early pilots. |
| Product/scientific solutions lead | Month 3 | Translate customer datasets and review workflows into repeatable pilot implementations and review packets. |
| Platform/data engineer | Month 6 | Reduce onboarding friction, support customer-hosted deployment, and harden security and auditability for production accounts. |
| Founder-led GTM | Month 0 | Early sales depend on scientific credibility, investor-network access, and close involvement in each pilot trigger and conversion path. |
Experiment roadmap
| Horizon | Experiment | Hypothesis | Success metric | Owner |
|---|---|---|---|---|
| 0–90 days | Run 15 structured buyer interviews with translational oncology leaders, computational biology heads, and CSOs at qualified biotechs. | At least 5 accounts have an imminent combination or biomarker review where the current process is slow enough to justify a paid pilot. | 10 qualified meetings, 5 documented buying triggers, and 3 pilot-scoping follow-ups. | Founder/CEO |
| 0–90 days | Define the minimum useful data package and test ingestion on 2 retrospective asset datasets. | Common oncology assay, omics, PK, biomarker, and early clinical files are enough to generate a review-ready shortlist without full custom integration. | 2 retrospective pilots completed in 6 weeks or less with customer-rated usefulness above 7 out of 10. | Founding eng |
| 90–180 days | Run 3 analyst-assisted design-partner pilots tied to live board, protocol-expansion, or partnering decisions. | Teams will use auditable review packets in real decision meetings if the shortlist includes contradictory evidence and validation next steps. | 3 paid pilots, 2 live review meetings using the product output, and at least 1 recommended hypothesis advanced to funded validation per converted pilot. | Computational biology lead |
| 90–180 days | Test two commercial offers: analyst-assisted pilot versus lighter-touch software workflow. | The market initially buys a software-plus-services motion faster than a self-serve product, but usage data will show which path can scale. | Side-by-side conversion data from at least 4 opportunities and a documented preferred packaging decision. | Founder/CEO |
| 180–365 days | Add customer-hosted or VPC deployment and run 2 security reviews with qualified accounts. | Security and privacy objections can be reduced enough to keep pilots inside a normal biotech procurement cycle. | 2 security approvals without custom one-off architecture and no pilot lost solely to deployment concerns. | Platform/data engineer |
| 180–365 days | Expand the first production account from one asset to a second asset or adjacent biomarker workflow. | The same review system can land-and-expand before the company sells into broader pharma accounts. | 1 second-asset expansion and 1 adjacent workflow expansion from the first 3 production customers. | Product/scientific solutions lead |
Risk assessment
- R1Scientists may distrust model-generated hypotheses even if the rankings are directionally useful. — Keep the product explanation-first, show contradictory evidence beside each recommendation, and measure adoption in live review meetings before expanding scope.
- R2Messy internal biotech data may make onboarding too slow for an urgent decision cycle. — Start with narrow ingestion templates, require a minimum useful data package, and use services-assisted pilots until the implementation path is repeatable.
- R3Broader incumbents or service providers may bundle adjacent functionality once the wedge proves valuable. — Win on workflow specificity, outcome-linked learning data, and fast trusted deployment around one asset review rather than on generic model capability.
- R4Customer legal or compliance teams may classify the workflow as regulated evidence support earlier than planned. — Keep the initial posture as decision support, add audit and governance controls early, and avoid submission-system claims until the product and budget can support that scope.
| Risk | Likelihood | Impact | Mitigation |
|---|---|---|---|
| Scientists may distrust model-generated hypotheses even if the rankings are directionally useful. | High | High | Keep the product explanation-first, show contradictory evidence beside each recommendation, and measure adoption in live review meetings before expanding scope. |
| Messy internal biotech data may make onboarding too slow for an urgent decision cycle. | High | High | Start with narrow ingestion templates, require a minimum useful data package, and use services-assisted pilots until the implementation path is repeatable. |
| Broader incumbents or service providers may bundle adjacent functionality once the wedge proves valuable. | Medium | High | Win on workflow specificity, outcome-linked learning data, and fast trusted deployment around one asset review rather than on generic model capability. |
| Customer legal or compliance teams may classify the workflow as regulated evidence support earlier than planned. | Medium | Medium | Keep the initial posture as decision support, add audit and governance controls early, and avoid submission-system claims until the product and budget can support that scope. |
| Title | VP Translational Oncology at a Phase I or II solid-tumor biotech |
|---|---|
| Profile | A Series B to pre-IPO oncology biotech with one lead targeted therapy asset, 20-80 scientists across translational and computational biology, and an upcoming expansion or partnering decision that needs a mechanistically grounded combination plan. |
| Trigger | Monotherapy efficacy plateaus, the board demands a combination roadmap, or a pharma partner asks for stronger mechanistic rationale before committing capital. |
| Buyer | CSO or SVP R&D |
| Initial contract | Paid 8-12 week design-partner pilot in the $75k-$150k range for one active asset, converting to about $200k-$300k annualized per-program subscription plus onboarding once the shortlist is used to fund validation work or protocol-expansion planning. |
What must be true
- At least 3 of the first 10 qualified oncology biotech accounts will fund a paid pilot for one live asset decision instead of staying with internal meetings, CRO support, or consultant decks.
- The MVP will ingest the minimum useful dataset and deliver a review-ready shortlist inside 6 weeks for the majority of early pilots.
- In at least half of early pilots, translational teams will discuss the ranked output in a real review meeting and advance one or more recommended hypotheses to validation.
- A CSO, SVP R&D, or equivalent budget owner will approve production contracts around the modeled per-program value without requiring a full enterprise software cycle.
- At least half of the first production customers will expand to a second asset or adjacent workflow within 12 months, supporting the venture-scale expansion case.
Open diligence questions
- Which exact budget line pays for the first pilot if the sale is framed as decision support rather than infrastructure?
- What minimum file package is enough to produce a trustworthy shortlist without long custom integration?
- How often do buyers require analyst-assisted review packets rather than software-only output in the first year?
- Why would a biotech buy this workflow instead of using BenchSci, Tempus, ConcertAI, or its current CRO and consultant mix?
- What evidence threshold makes a ranked hypothesis credible enough for translational and biomarker leads to act on?
| Call | Meet / investigate further |
|---|---|
| Conviction | Sharp workflow wedge and credible category timing, but conviction depends on proving a repeatable budget owner and fast trusted onboarding. |
| Why believe | Oncology biotechs already feel acute combination and biomarker decision pain, and the proposed product is narrowly aligned to the exact review moment that broader platforms and service providers do not fully own. |
| Why doubt | The company can fail if buyers still prefer consultant-heavy reviews or if messy internal data and trust friction prevent sub-six-week time to value. |
| Next diligence | Confirm 3-5 live design-partner pilots, map the first contracting path, and test whether teams actually use the shortlist in board or protocol review decisions. |
Financial model
| Year 1 revenue | $230K EBITDA $-1.33M · Cash EOP $2.67M |
|---|---|
| Year 2 revenue | $1.74M EBITDA $-1.14M · Cash EOP $1.53M |
| Year 3 revenue | $4.64M EBITDA $108K · Cash EOP $1.64M |
| ARPU (annual) | $250K |
|---|---|
| Gross margin | 70% |
| CAC | $150K Payback 10.3 months |
| LTV / CAC | 4.9x LTV $729K |
| Round | seed · $4.0M |
|---|---|
| Runway | 24 months |
| Milestone | Reach 12 active asset programs, prove repeatable <=6 week onboarding, ship VPC deployment, and show first second-asset expansion before a Series A raise. |
Model sanity
- Revenue engine. Base-case revenue is driven by four Y1 paid pilots compounding into 20 active asset programs by Q4Y3 at a blended $290K annual program value.
- Must go right. Onboarding has to stay at or below six weeks so pilots convert into production and at least two customers expand to second assets on schedule.
- Model breaks if. If sales cycles stretch and Y3 ends closer to 16 active programs at $250K value, cash falls to roughly breakeven and EBITDA remains deeply negative.
- Next-round proof. A Series A story appears once the company shows repeatable pilot-to-production conversion, VPC deployment, and real second-asset expansion rather than pilot revenue alone.
- Revenue (line, area)
- Cash EOP (dashed)
- EBITDA (bars, gray = loss)
- Founder / CEO
- Founding engineer
- Computational biology lead
- Product / scientific solutions
- Platform / data engineer
- Founder-led GTM / AE
- Implementation scientist
- ML / research engineer
- G&A / ops
- Customer success / partnerships
| Y3 revenue | Y3 EBITDA | Cash low point | Description | |
|---|---|---|---|---|
| Downside | Pilot conversion slips, land-and-expand is slower, and blended program value stays closer to the core contract price. | |||
| Base | Founder-led sales closes four paid pilots in Y1, converts into repeatable production programs in Y2, and reaches the modeled 20 active programs by Q4Y3. | |||
| Upside | Pilot proof arrives faster, second-asset expansion shows up earlier, and the company sells more premium review services into converted accounts. |
| Variable | Downside | Upside | Cash impact | Revenue impact |
|---|---|---|---|---|
| sales cycle | 9-month average cycle with slower procurement and security review | 4-month average cycle around urgent live decisions | ||
| churn | 3.5% monthly churn because budgets reset after pilots | 1.0% monthly churn after the workflow embeds | ||
| ARPU | $250K blended Y3 program value | $320K blended Y3 program value | ||
| hiring pace | Add later Y2-Y3 hires two quarters earlier than revenue proof | Hold two later hires until expansion proof is visible | ||
| gross margin | 65-68% Y3 gross margin because analyst-assisted work persists | 71-73% Y3 gross margin after services leverage improves | ||
| CAC | $190K blended CAC because enterprise selling stays custom | $120K blended CAC with stronger referrals and repeatability |
Scenarios
| Scenario | Y3 revenue | Y3 EBITDA | Cash low point | Description | Key changes |
|---|---|---|---|---|---|
| Downside | $3.03M | $-1.09M | $36K | Pilot conversion slips, land-and-expand is slower, and blended program value stays closer to the core contract price. |
|
| Base | $4.64M | $108K | $1.47M | Founder-led sales closes four paid pilots in Y1, converts into repeatable production programs in Y2, and reaches the modeled 20 active programs by Q4Y3. |
|
| Upside | $5.89M | $1.10M | $1.93M | Pilot proof arrives faster, second-asset expansion shows up earlier, and the company sells more premium review services into converted accounts. |
|
Sensitivity
| Variable | Downside | Base | Upside |
|---|---|---|---|
| ARPU | $250K blended Y3 program value | $290K | $320K blended Y3 program value |
| CAC | $190K blended CAC because enterprise selling stays custom | $150K | $120K blended CAC with stronger referrals and repeatability |
| churn | 3.5% monthly churn because budgets reset after pilots | 2.0% monthly churn | 1.0% monthly churn after the workflow embeds |
| sales cycle | 9-month average cycle with slower procurement and security review | 6-month average cycle | 4-month average cycle around urgent live decisions |
| gross margin | 65-68% Y3 gross margin because analyst-assisted work persists | 69-70% Y3 gross margin | 71-73% Y3 gross margin after services leverage improves |
| hiring pace | Add later Y2-Y3 hires two quarters earlier than revenue proof | Current ramp | Hold two later hires until expansion proof is visible |
Key assumptions (19)
| ID | Name | Value | Unit | Source |
|---|---|---|---|---|
| A1 | Model start month | 2026-06 | month | [BP date 2026-06-04; model starts in the same month as the plan] |
| A2 | Customer unit in the model | active paid asset program | definition | [BP businessModel.unitOfValue and BP market.som; each customer equals one paid oncology asset program] |
| A3 | Opening cash from seed round | 4000.0 | USDK | [BP fundingAsk targetFundingRangeUsd $3–5M; base case uses a $4.0M seed close near the middle of the range] |
| A4 | Year 1 customer adds by month | [0,0,1,0,1,0,0,1,0,0,1,0] | new customers | [BP milestones 0-12 months: sign 3-5 design-partner pilots; model assumes 4 paid asset-program starts in Y1] |
| A5 | Year 2 customer endpoints | [5,7,9,12] | customers EOP by quarter | [BP milestones 12-24 months: convert pilots, standardize onboarding, and expand at least 2 customers into more programs or adjacent workflows] |
| A6 | Year 3 customer endpoints | [14,16,18,20] | customers EOP by quarter | [BP milestones 24-36 months and BP/RS market.som: reach about 20 active asset programs by year 3] |
| A7 | Year 1 blended annualized program value | 120.0 | USDK annual | [BP investorMemo.firstCustomer initialContract $75k-$150k for an 8-12 week pilot; base case uses a $120k annualized pilot/services blend] |
| A8 | Year 2 blended annualized program value | 240.0 | USDK annual | [BP gtm.pricing and investorMemo.firstCustomer conversion to about $200k-$300k annualized per program plus onboarding; base case uses $240k] |
| A9 | Year 3 blended annualized program value | 290.0 | USDK annual | [BP businessModel.revenueStreams and expansionLevers; second-asset expansion and premium review services lift blended value above the $250k core program assumption] |
| A10 | Revenue recognition method | average active programs in period x period ARPU | formula | [Startup-finance heuristic named source: Financial Modeler mid-period go-live rule; revenue uses average beginning and ending active programs each month or quarter] |
| A11 | Gross margin ramp | Y1 40%-58%; Y2 60%,63%,66%,68%; Y3 69%,70%,70%,70% | percent | [BP businessModel.targetGrossMarginPct 70 and BP operatingAssumptions on services-assisted delivery; model reaches target only after onboarding becomes repeatable] |
| A12 | Loaded annual salaries by role | Founder 180; Founding eng 210; Computational biology lead 220; Product/scientific solutions 180; Platform/data eng 190; Implementation scientist 160; GTM 180; ML/research eng 200; G&A 120; Customer success/partnerships 150 | USDK annual per FTE | [BP team and operations] plus [startup-finance heuristic for lean US biotech-software compensation including payroll burden] |
| A13 | Hiring sequence | Founder, founding eng, and computational biology lead at start; product/scientific solutions in Q2Y1; platform/data engineer in Q3Y1; first GTM in Q4Y1; implementation scientist in Q1Y2; ML/research engineer in Q2Y2; G&A in Q3Y2; second platform/data engineer in Q1Y3; customer success/partnerships in Q2Y3; second GTM in Q3Y3 | timing | [BP team startTiming and BP strategicChoices.sequencingRationale] with later hires only after pilot conversion and security/onboarding proof] |
| A14 | Non-payroll operating spend ramp | Y1 S&M 8-20/month, R&D 18-24/month, G&A 10-16/month; Y2 quarterly opex 175,195,215,240; Y3 quarterly opex 265,285,305,325 | USDK | [BP gtm channels, operations, and fundingAsk.useOfFundsSummary] plus [startup-finance heuristic for cloud, travel, legal, security, and implementation tooling] |
| A15 | Steady-state monthly churn | 2.0 | percent | [Startup-finance heuristic: high-ACV workflow software is sticky once embedded, but biotech budget resets and trust risk create non-trivial early churn] |
| A16 | Blended CAC per active program | 150.0 | USDK per customer | [Model-derived from founder-led GTM payroll plus S&M spend through the first 12 active programs, rounded up for conservative enterprise selling] |
| A17 | Funding sizing rule | capital sized to the Q4Y2 proof milestone plus 6 months of buffer | policy | [Developer instruction] anchored to [BP fundingAsk runway target and milestone sequencing] |
| A18 | Next financing milestone | 12 active programs, repeatable <=6 week onboarding, VPC deployment, and first second-asset expansions before a Series A process | milestone | [BP milestones 0-24 months and BP investorMemo.nextDiligence] |
| A19 | Cash flow simplification | cash approximates EBITDA with no debt, capex, taxes, or working-capital timing modeled | heuristic | [Startup-finance heuristic named source: early-stage planning model simplification] |
flowchart LR Buyers[Qualified biotech accounts] --> Pilots[Paid pilots] Pilots --> Programs[Active asset programs] Programs --> Revenue[Subscription and onboarding revenue] Revenue --> GrossProfit[Gross profit after delivery and cloud costs] GrossProfit --> Cash[Cash after payroll and operating spend]
Flags: The base case still depends on founder-led selling and only a modest paid demand-gen motion, so CAC could rise once the company hires beyond the first GTM rep. · Y3 EBITDA is only modestly positive, which means a next round still depends on proving services leverage and expansion quality rather than headline growth alone. · Cash is modeled as EBITDA and excludes enterprise collection timing or deferred revenue, so real quarterly cash swings could be lumpier than shown here.
Top risks
- Evidence credibility gap. Translational teams may distrust model-generated hypotheses unless the evidence chain is transparent and maps cleanly to known biology. Mitigation: Start with explanation-first review packets, expose contradictory evidence and source provenance, and keep the product focused on decision support rather than autonomous recommendations.
- Data integration friction. Small biotech teams often have messy internal assay and biomarker data, which can slow time to first value. Mitigation: Launch with a narrow ingestion template for the most common oncology datasets and offer a services-assisted onboarding path for the first asset program.
- Platform incumbent pull-in. Large model vendors or existing R&D software providers could add lightweight combination-ranking features once the wedge proves valuable. Mitigation: Win on program-specific review workflow, cross-modal explanation quality, and a proprietary outcomes dataset tied to real validation and expansion decisions.
Evidence
Cited sources (37)
- Sofinnova Partners. Ingenix raises €13m from Sofinnova Partners-led syndicate to scale Modality Fusion, a novel architecture for drug development · https://sofinnovapartners.com/news/ingenix-raises-eur13m-from-sofinnova-partners-led-syndicate-to-scale-modality-fusion-a-novel-architecture-for-drug-development
- IQVIA. Global Oncology Trends 2025 · https://www.iqvia.com/insights/the-iqvia-institute/reports-and-publications/reports/global-oncology-trends-2025
- National Cancer Institute. About NCI-MATCH · https://www.cancer.gov/about-nci/organization/cbiit/projects/match
- PubMed Central. ASCP explores the cancer biomarker testing navigator as a novel role to improve cancer biomarker testing workflow efficiency and care quality · https://pmc.ncbi.nlm.nih.gov/articles/PMC12159526/
- PubMed Central. Challenges, complexities, and considerations in the design and interpretation of late-phase oncology randomized controlled trials · https://pmc.ncbi.nlm.nih.gov/articles/PMC10917127/
- BenchSci. Unlocking the Black Box of Disease Biology · https://www.benchsci.com/
- Owkin. Building Biological Artificial Superintelligence · https://www.owkin.com/
- Recursion. Pioneering AI Drug Discovery · https://www.recursion.com/
- gdpr-info.eu. General Data Protection Regulation (EU) 2016/679 · https://gdpr-info.eu/
- AACR. Navigating the Meandering Path of Oncology Drug Development · https://www.aacr.org/blog/2025/05/28/navigating-the-meandering-path-of-oncology-drug-development/
- BioSpace. BenchSci Announces Launch of ASCEND™ – a First-of-its-Kind Map of All Disease Biology That Aims To Transform Pharmaceutical Research · https://www.biospace.com/benchsci-announces-launch-of-ascend-a-first-of-its-kind-map-of-all-disease-biology-that-aims-to-transform-pharmaceutical-research
- Pharmaceutical Technology. AstraZeneca enters $200m AI cancer pact with Tempus and Pathos · https://www.pharmaceutical-technology.com/news/astrazeneca-enters-200m-ai-cancer-pact-with-tempus-and-pathos/
- Fierce Biotech. Tempus AI in line for $200M from AstraZeneca cancer model deal · https://www.fiercebiotech.com/biotech/tempus-ai-line-200m-astrazeneca-pathos-deal-develop-cancer-model
- Recursion. Partners · https://www.recursion.com/partners
- MarketsandMarkets. AI in Oncology Market: Growth, Size, Share, and Trends · https://www.marketsandmarkets.com/Market-Reports/ai-in-oncology-market-256715534.html
- Knowledge Sourcing Intelligence. AI in Oncology Drug Discovery Market · https://www.knowledge-sourcing.com/report/ai-in-oncology-drug-discovery-market
- Precision for Medicine. The State of Antibody-Drug Conjugate Clinical Trials in 2025 · https://www.precisionformedicine.com/blog/clinical-trial-trends-antibody-drug-conjugates
- Crown Bioscience. The Oncology Drug Development Landscape: Regulatory Trends, Modalities, and Translational Priorities for 2026 · https://blog.crownbio.com/the-oncology-drug-development-landscape
- Foley & Lardner. AI Drug Development: FDA Releases Draft Guidance · https://www.foley.com/insights/publications/2025/01/ai-drug-development-fda-releases-draft-guidance/
- Domino Data Lab. AI for drug development: Ensure FDA compliance · https://domino.ai/blog/ai-for-drug-development-a-roadmap-for-fda-compliance
- PubMed Central. Reimagining drug regulation in the age of AI: a framework for the AI-enabled ecosystem for therapeutics · https://pmc.ncbi.nlm.nih.gov/articles/PMC12571717/
- Clinical Leader. FDA Issues Draft Guidance On The Use Of AI To Support Regulatory Decision-Making For Drug And Biological Products · https://www.clinicalleader.com/doc/fda-issues-draft-guidance-on-the-use-of-ai-to-support-regulatory-decision-making-for-drug-and-biological-products-0001
- Nature Reviews Drug Discovery. Two dual-payload antibody–drug conjugates are now in the clinic for cancer, with many more on the way. · https://www.nature.com/articles/d41573-025-00121-y
- Intuition Labs. AI for Biotech: A Build vs. Buy Decision Framework · https://intuitionlabs.ai/articles/build-vs-buy-ai-biotech
- PubMed Central. The Molecular Tumor Board Portal supports clinical decisions and automated reporting for precision oncology · https://pmc.ncbi.nlm.nih.gov/articles/PMC8882467/
- Frontiers in Oncology. Data-driven support to decision-making in molecular tumour boards for lymphoma: A design science approach · https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2022.984021/full
- arXiv. Multimodal Data Integration for Precision Oncology: Challenges and Future Directions · https://arxiv.org/abs/2406.19611
- Foundation Medicine. ConcertAI and Foundation Medicine Integrate Genomic and Clinical Data to Transform Translational Research and Efficient Drug Development · https://www.foundationmedicine.com/press-release/concertai
- BenchSci. From Coffee to Collaboration: Bringing ASCEND to Life with AMGEN · https://blog.benchsci.com/bringing-ascend-to-life-with-amgen
- Isomorphic Labs. Partnerships · https://www.isomorphiclabs.com/partnerships
- Neo4j. BenchSci Decodes Disease Biology with Neo4j to Accelerate Drug Discovery · https://neo4j.com/customer-stories/benchsci/
- USDM. What Is 21 CFR Part 11 Compliance? · https://www.usdm.com/resources/blogs/usdms-guide-to-21-cfr-part-11
- PSC Software. 21 CFR Part 11 Compliance · https://pscsoftware.com/resource-center/article/21-cfr-part-11-compliance/
- Pathos AI. Pathos signs strategic agreements with AstraZeneca and Tempus to develop the largest multimodal foundation model in oncology · https://www.pathos.com/pathos-signs-strategic-agreements-with-astrazeneca-and-tempus-to-develop-the-largest-multimodal-foundation-model-in-oncology
- ConcertAI. Life Sciences and Pharma · https://www.concertai.com/life-sciences-and-pharma
- Owkin. AI Agent for Biopharma Decision Making | Owkin K Pro · https://www.owkin.com/k-os/k-pro
- Isomorphic Labs. Solve all disease · https://isomorphiclabs.com/