Jio Arogya · AI Cost Explorer
This site is access-restricted.
Please enter the password to continue.
Incorrect password. Try again.
← Home
Jio Arogya · AI Cost Model · Operating Plan (2B)

JioCare AI Investment Decision Explorer — Rent baseline (2B)

Interactive cost analysis: Rent vs Build · RENT baseline from 2B_AI_Light_Rent_Lean.md (operating-plan rent posture) | Confidential — Revised 7 May 2026 (member ramp pulled in to 25M Y5, R&D ramp from xlsx, new Omics/Pharma reference panel)

Member Scale (actual user count per year)

Slider-driven
Manual edits override the growth slider for that year. Touch the growth slider to reset.

Engagement Multiplier by Year

Slider-driven
Manual edits override the global engagement slider for that year. Touch the engagement slider to reset.

Scenario Controls

Stress-test assumptions against the 2B operating-plan rent baseline (2B_AI_Light_Rent_Lean.md). Sliders apply scenario overrides only and do not rewrite source definitions in the Reference section.

Usage Growth SpeedMultiplier on the baseline member ramp (10K → 25M by Y5, hold at 25M through Y7). 1.0x = baseline forecast (260507 memo / xlsx). 1.0x
Slow (0.5x)Fast (2.0x)
API Price Decline Rate (per year)Annual percentage drop applied on top of the embedded trajectory for this 2B baseline (§6.1 rates already step down year-over-year in-source). At 0%, no extra decline is stacked. Non-zero values model additional sensitivity. 0%
0% (no decline)70%/year
For this page’s 2B rent baseline, API price declines are already embedded in the inference series (§6.1 trajectory). A non-zero slider stacks as extra sensitivity on top.
Build Start YearThe year proprietary model training begins. Before this year, the Build posture runs on rented APIs and accumulates consented, de-identified usage data that can lower later annotation and corpus construction costs. Y3
Y1 (day-zero anchor)Y5
Engagement IntensityMultiplier on baseline AI interactions per member per year (496 events a year at 1.0x). An event is a single AI-mediated action — user-initiated (chat, voice query, document scan) or system-initiated on behalf of the member (CHW prep briefs, telehealth pre-consult summaries, clinic follow-up triggers). 1.0x · 496 ev/yr
Low (0.25x · ~124 ev/yr)High (3.0x · ~1,488 ev/yr)

Scenario Presets

Annual Cost Comparison

Total cost per year across Rent vs dynamic Build ($M)

Cumulative Cost Over Time

Running total spend — crossover points highlighted ($M)

Per-Member Inference Cost

Unit economics: $/member/year by posture

CapexRENT capex (2B operating plan): PII gate + orchestration implementation; Indian clinical eval & safety harness refresh; scaled AI/product/platform team; AI tooling + non-production LLM dev tokens. BUILD capex adds training compute, GPU cluster, corpus, annotation, distillation, eval infra. vs OpEx Breakdown

Investment structure by year — Rent vs Build grouped stacked view ($M)

Data Asset Accumulation

Shows data generated by member activity. Not priced as an asset — shown to illustrate the compounding data flywheel that underpins the Build thesis.

Cumulative Platform Data Over Time

Raw data exhaust vs training-usable corpus — cumulative tokens across all members

Reference: Total publicly available Indian medical corpus is estimated at ~70–80B tokens. The platform surpasses this by Y2 in cumulative raw exhaust.

Data Composition Breakdown

Annual token volume by source — scaled by member count and engagement

Agent conversations dominate volume (~46%), but clinical encounters (doctor, specialist, hospital) carry highest per-token diagnostic value for model training.

Detailed Year-by-Year Breakdown ($M)

Decision Insights

Reference: Definitions, Assumptions & Methodology

These panels describe the baseline assumptions behind the model. Controls above test alternate scenarios without changing the underlying source definitions. RENT cost loads from 2B_AI_Light_Rent_Lean.md (operating-plan rent baseline); Build posture panels still trace 3_AI_Full_Build.md / context doc. All engagement figures are Y2+ steady-state unless noted.

Source documents: 2B_AI_Light_Rent_Lean.md (RENT baseline for this page) · 1_Usage_and_Inference_Demand.md · 2_AI_Light_Rent.md (lean-floor bookend) · 3_AI_Full_Build.md (incl. Data Dividend for Delayed Build Starts) · 4_AI-Build-Posture_Full-Context.md — AI Cost Model, May 2026.

1

What Drives These Costs: Per-Member AI Engagement Breakdown

← 1_Usage_and_Inference_Demand.md
▼

Every cost figure in this model traces back to a concrete set of AI engagement events per member per year. The baseline is ~496 personal agent events/member/year (Y2+ steady-state), plus clinical encounter events layered on top.

A. Personal Agent Events (496/member/year)
Feature Category Events/member/yr LLM calls/event Tokens/call Routing (Frontier / Mid / Flash) Voice & OCR per event
Passive / proactive
Daily briefings, wearable alerts, weekly patterns
247 1 500 0% / 10% / 90% None
Admin / reminders
Medication adherence, appointments
109 1 500 0% / 20% / 80% 30% voice: 1 min ASR + 700 chars TTS
Wellness / diet / medication coaching 113 2 1,500 0% / 30% / 70% 50% voice: 1.5 min ASR + 1K chars TTS; 20% meal photo: 1 OCR page
Symptom triage / care navigation 14 5 3,000 20% / 60% / 20% 50% voice: 2 min ASR + 1.2K chars TTS
Document / lab / prescription help 13 3 6,000 20% / 70% / 10% 2 OCR pages / event
Total personal agent events 496
B. Clinical Encounter Events
Encounter Type Rate/member/yr (Y5) LLM calls/encounter Key Workflows
CHW visits 5.0 6 Pre-visit prep, screening, escalation summary, follow-up, voice capture
Doctor consults
Telehealth + clinic
13.0 5 Pre-consult summary, differential + safety check, care plan + documentation
Specialist referrals 2.0 6 Referral package, post-visit reconciliation, patient summary
Hospital admissions 0.15 7 Admission brief, discharge reconciliation, discharge instructions
Pharmacy events 12.0 3 Drug interaction, substitution, adherence plan
Post-encounter reconciliation fires after CHW + doctor 1 Longitudinal record update
C. Y5 Per-Member Demand Summary
Modality Annual Volume per Member
LLM calls (all tiers) 855
LLM tokens (all tiers, incl. 1.4x overhead) 2.56M
   — frontier tokens 329K
   — mid-tier tokens 1,395K
   — flash tokens 834K
ASR minutes 213
TTS characters 113,373
OCR pages 64
2

User Segments: Who Uses What

← 1_Usage_and_Inference_Demand.md
▼

Not all members engage equally. The 496 events/year is a blended average across three engagement segments. This distribution is critical — it means 20% of users drive disproportionate cost, and 40% barely use the personal agent at all.

Segment Share Events/year Profile
Power users 20% 1,383 Chronic disease (diabetes, cardiac, pregnancy), daily wearable, highly motivated
Moderate 40% 399 Engaged 2–3x/week, responds to nudges, logs meals occasionally
Light 40% 150 Enrolled via insurance, minimal self-initiated use, engages around CHW visits
Blended 100% ~496 Weighted average across segments

Verification: (0.20 × 1,383) + (0.40 × 399) + (0.40 × 150) = 276.6 + 159.6 + 60.0 = 496.2 ≈ 496

3

Pricing Assumptions: API & Self-Hosted Rates

← 2B_AI_Light_Rent_Lean.md §6.1 + §6.2 · 3_AI_Full_Build.md
▼
Operating-plan multi-year API rates (2B §6.1)

This explorer’s rent inference baseline embeds the year-over-year price trajectory below (already reflected in §6.2 totals loaded into BASE_DATA.rent). FX held at ₹92/$ throughout.

Service Y1 Y2 Y3 Y4 Y5 Use in model
Frontier LLM ($/M tokens) $10.00 $9.00 $8.10 $7.29 $6.56 Differential diagnosis, safety checks, specialist referral briefs
Mid-tier LLM ($/M tokens) $4.38 $2.63 $1.58 $0.95 $0.57 Pre-consult summaries, care plans, coaching, reconciliation
Flash LLM ($/M tokens) $0.260 $0.160 $0.090 $0.060 $0.030 Reminders, admin, passive briefings, low-risk follow-ups
STT ($/minute) $0.0060 $0.0042 $0.0029 $0.0021 $0.0014 Voice input (transcript gated on-device)
TTS ($/minute) $0.0150 $0.0105 $0.0074 $0.0051 $0.0036 Voice responses (1K chars = 1 min)
Vision + document extraction ($/page) $0.0300 $0.0180 $0.0108 $0.0065 $0.0039 Meal photos, prescriptions, labs, notes, discharge summaries
API Prices (May 2026 list snapshot — lean-floor contrast)
Service Unit Price Use
Frontier LLM (Opus / GPT-4.5) $10.00 / M tokens Differential diagnosis, safety checks
Mid-tier LLM (GPT-4o / Sonnet) $4.38 / M tokens Pre-consult summaries, care plans
Flash LLM (Gemini Flash) $0.26 / M tokens Reminders, admin, low-risk follow-ups
ASR (Indic + English) $0.006 / minute Voice input
TTS (Indic + English) $0.015 / 1K chars Voice responses
OCR / document extraction $0.03 / page Prescriptions, lab reports
Self-Hosted Prices (Build posture, by model phase)
Model $ / M tokens What It Serves
Phase 1 (8B active, MoE 30B) $0.30 First proprietary model
Phase 2 (60B active, MoE 300B) $0.50 Mid-scale clinical workloads
Phase 3 / GP-level (100B active, MoE ~900B) $0.80 Full GP model
Cloud distilled clinical $0.12 Mid-tier equivalent
Cloud distilled flash $0.02 Routine / admin
Local / JORO / on-device $0.00 Eligible workloads on user hardware

2B embeds annual API price declines in the §6.1 trajectory (see table above); the “API Price Decline Rate” slider stacks extra sensitivity. Self-hosted inference cost assumes amortized GPU cluster (768× H100/H200) over 5 years.

4

Build Investment Breakdown: Where the Capex Goes

← 3_AI_Full_Build.md
▼
Phased Model Training — Kill Gates

Each phase has a binary kill-gate: if the answer is "no", stop investing and revert to Rent. This is the core risk-management mechanism of the Build posture.

Phase Timeline Model Cost Kill-Gate Question
0 Months 1–6 Fine-tune OSS 70B $0.3M Does fine-tuning improve Indian clinical performance?
1 Months 6–12 30B MoE (8B active) $7.4M Beat frontier APIs on Indian clinical evals?
2 Months 12–18 300B MoE (60B active) $39.7M Achieve task-shift multipliers in pilot?
3 Months 18–24 800B–1T MoE (100B active) $199M Full production deployment
Full 5-Year Capex — Y1 Build Anchor
Component Cost ($M)
Training compute (Phases 0–3 + retraining) $288M
Corpus construction $22M
Manual annotation (1,776 peak FTE) $30M
Synthetic data pipeline $13M
Eval & safety infrastructure $11M
Distillation + local modalities $18M
GPU cluster (768 H100/H200) + clinical validation $66M
PII gate + orchestration $4M
R&D team (22→79 headcount, $225K → $300K avg all-in) $94.6M
Total 5-yr capex ~$546M

Component sum with revised R&D ($94.6M) is $546.6M; rounded to $546M to match the underlying BASE_DATA.buildY1.capex JS series [116.9, 286.9, 51.2, 46.1, 44.9] which sums exactly to $546.0M. The +$0.6M delta sits inside year-by-year rounding and is not redistributed.

R&D Team Ramp by Year (260507 xlsx · Product & Engineering)

Per-year build of the R&D org from the 260507 unit-economics xlsx (Fixed Costs > Product and Engineering). Headcount mix: AI/ML, Platform, Product Eng, Eval/Safety/Data, Leadership. Salary all-in averages step from $225K (Y1) to $300K (Y5). Overhead included.

Year Headcount Payroll ($K) Total R&D ($M) Total R&D (₹ Cr)
Y1224,950$9.06M₹83.4
Y2358,750$13.12M₹120.7
Y34711,750$16.71M₹153.7
Y46317,325$23.82M₹219.2
Y57923,700$31.92M₹293.7
5-yr—66,475$94.6M₹870.6

FX ₹92/$. The rent-path capex series in BASE_DATA.rent.capex ([9.1, 13.1, 16.7, 23.8, 31.9, …]) already matches this xlsx ramp to rounding; no JS array edit required, only the headcount label and the build-capex total were updated. Caveat: the xlsx Fixed Costs sheet labels overhead "30%" but values resolve to a 20% multiplier (matching Assumptions); the row label is a known source-doc inconsistency and does not affect the totals shown.

Data Dividend for Delayed Build Starts

When Build starts after Y1, the platform has already accumulated consented usage data. The model credits only the portion that survives consent, de-ID, clinical relevance, and quality filters, reducing annotation and corpus construction costs while leaving training compute, GPU cluster, synthetic data, safety, distillation, and team costs unchanged.

Build Start Clean Data Clinical Annotation Pool Annotation Discount Annotation Cost Corpus Discount Corpus Cost Data-Creation Savings
Y10.0B0.0B0.0%$30.1M0.0%$22.0M$0.0M
Y26.4B1.2B2.0%$29.5M1.6%$21.6M$1.0M
Y360.1B10.8B18.6%$24.5M15.0%$18.7M$8.9M
Y41,824.1B328.3B55.0%$13.5M25.0%$16.5M$22.1M
Y520,548.1B3,698.7B55.0%$13.5M25.0%$16.5M$22.1M

Calibration note (post-260507 ramp revision): Y4/Y5 clean-token and clinical-annotation pools above were sized against the prior 15M / 50M member ramp. With the revised 8M / 25M ramp, those token columns scale down ~50%. Annotation and corpus discount %s and the resulting savings columns are governed by the discount caps (55% / 25%) and so remain materially unchanged for Y4/Y5; only Y2/Y3 (uncapped) would shift modestly. Full recompute pending.

4B

Data Platform — Omics / Pharma Monetization

← 260507_JioCare Memo v4 + Unit Economics xlsx
▼

Reference-only panel. The Rent-vs-Build cost charts above scope only AI infrastructure spend (rent inference vs. build capex); they do not ingest any of the data-platform revenue or cost lines below. This panel captures the parallel data-platform P&L introduced by the 7 May 2026 sent memo + unit-economics xlsx, where consented multi-omics + biobank data is licensed to pharma and AI/biotech buyers. Y5 platform licensing is the largest single revenue line in the consolidated model (~40% of combined revenue) and the principal driver of combined EBITDA crossover.

Source-of-truth note: all numbers below are from the xlsx Dashboard sheet (the 25M-Y5 primary path) and the memo §06 Financials, both pegged to the same revised member ramp shown in Panel 6. The xlsx also contains an Analysis_JioCare Deck sensitivity sheet that runs a 50M-Y5 path with different cost build-ups; that sensitivity is not the source for any number on this page.

Consent & Licensed-Record Build-Up
Metric Y1 Y2 Y3 Y4 Y5
% Members consenting to omics use10%20%30%40%50%
Licensing partnerships (#)001210
Avg dataset size (longitudinal records)——8,00040,00080,000
Licensed complete records (#)005,33353,333533,333
Licensed records as % of members0%0%0.36%0.67%2.13%
Platform Revenue (₹ Cr · xlsx Dashboard)
Line Y1 Y2 Y3 Y4 Y5
Platform Licensing Revenue00404008,000
Annual value per linked record (₹)——50,00050,000100,000

Memo Exec Summary headline: "~₹8,000 Cr Y5 platform licensing". Pharma-readiness caveat applies — Y5 figure assumes signed multi-buyer cohort licensing tranches.

Direct Platform Costs & Data EBITDA (₹ Cr)
Line Y1 Y2 Y3 Y4 Y5
Biobanking0.080.7835.2250.2977.5
Omics prep + sequencing0021.3160.0960.0
Partner licensing & rev-share (30% of partner data)009.696.01,920.0
Fixed Costs — Bio & Data ops10203060100
Data EBITDA (₹ Cr)−10.1−20.8−56.1−166.2+4,042.5
Unit Costs (xlsx)
Metric Y1 Y2 Y3 Y4 Y5
Blended omics unit cost (₹)67,00052,00040,00030,00018,000
Biobank holding (₹/member/yr)782782782782782

Cold-chain holding ~₹500/sample-set/year; multi-aliquot blend gives ₹782/member/yr. Memo §Connector: "~₹4 Cr/yr holding at 250K-member scale".

Combined Revenue Mix — Care + Data (₹ Cr · xlsx Dashboard)
Line Y1 Y2 Y3 Y4 Y5
Care revenue (insurer fee + shared savings)3.016.8575.03,433.411,792.9
Data Platform revenue0040.0400.08,000.0
Combined revenue3.016.8615.03,833.419,792.9
% from Data Platform0%0%6.5%10.4%40.4%

Combined EBITDA crosses positive in Y5 at ~₹4,926 Cr (~$535M, FX ₹92/$), driven by the data-platform line. Care EBITDA alone turns positive at ~₹883 Cr Y5; Data EBITDA contributes the remaining ~₹4,043 Cr. Methodology caveat: the xlsx Y5 AI inference cost of $2.36/member/yr (heavy edge + custom-SLM routing) is more aggressive than this page's RENT baseline of $3.94/member/yr from 2B_AI_Light_Rent_Lean.md. The two views are reconcilable but use different source documents: this page's cost charts are the conservative rent posture; the xlsx care EBITDA above already books the more aggressive routing assumption.

External Deal Comparables (Memo §Data Strategy + Appendix B)
Comparable Headline value Per-participant / per-record
23andMe (consumer SNP)—~$60 / participant
deCODE / Amgen (deep WGS)$415M~$2,600 / participant
23andMe — GSK~₹2,500 Cr—
Tempus — AstraZeneca$200M—
Roche — Flatiron$1.9B~$860 / record
UK Biobank — pharma tranches₹40–120 Cr each$4,000–6,000 / participant
GSK — Tempus (upfront)$70M—
Valo — Novo Nordisk (upside)up to $4.6B—

23andMe vs deCODE shows the 43× per-participant premium for deeply-phenotyped, multi-omics, longitudinally-linked cohorts — the positioning JioCare's biobank+EHR linkage targets. Y1 commercial target per Appendix B (signed-deal value across the four asset families): ₹29–65 Cr. The ₹40 Cr Y3 modeled licensing line in the xlsx Dashboard is broadly consistent with these LOI ranges; revenue recognition lags signed deal value, so Y1 LOIs feed Y2/Y3 booked revenue.

Sensitivity Anchors (Memo §Sensitivities)
  • First pharma LOI Y2 vs Y3 ⇒ +₹5,500 Cr Y5 EBITDA
  • JHA capacity halved (250 → 125 HH/mo) ⇒ −₹2,700 Cr Y5 EBITDA
  • CAC stuck at ₹350 (DTC path) vs glide to ₹100 ⇒ −₹800 Cr Y5 EBITDA
  • Loss-ratio glide 75 → 68 (vs 75 → 60) ⇒ insurance-fee path collapses (~50% per-member fee reduction)
5

Overhead Multiplier: Why Raw Demand × 1.4

← 1_Usage_and_Inference_Demand.md
▼

The per-member demand figures (855 LLM calls, 2.56M tokens) already include a 1.4x overhead multiplier applied to raw engagement volumes. This multiplier captures the production reality that every user-facing AI call triggers additional system calls.

Overhead What It Covers Multiplier
Safety pipeline Input guardrails, output hallucination check, clinical audit log 1.15×
PII filtration model Separate NER scrub per cloud call +2% calls
Data validation model Input quality checks on health data +1% calls
Real-world overhead Retries, cache misses, multilingual expansion, A/B testing 1.20×
Combined effective multiplier ~1.4×
6

Member Ramp with Context

← 4_AI-Build-Posture_Full-Context.md
▼

The member ramp is the single largest driver of total cost. The 2,500× growth from Y1 to Y5 (10K → 25M) assumes Jio distribution activation — the same channel that scaled JioPhone to 100M+ users. Updated 7 May 2026: Y4/Y5 ramp pulled in from 15M/50M to 8M/25M to match the 260507 sent memo + unit-economics xlsx.

Year Members What's Happening
Y1 10,000 Jamnagar pilot — Reliance employee families
Y2 50,000 Expanded pilot — Gujarat / Mumbai metro
Y3 1,500,000 National launch — first mass enrollment
Y4 8,000,000 Rapid scale — Jio distribution activated
Y5 25,000,000 Full national scale (revised down from 50M per 260507 memo)

Y6–Y7 hold at 25M (steady state). The "Usage Growth Speed" slider above scales all member counts linearly — at 2.0x, Y5 = 50M members.