Pricing

Simple, transparent pricing

Building in-house costs $300K–$500K/year and 6–12 months. We're a fraction of that, ready on day one.

Explorer
Free

For individuals evaluating datasets and exploring the catalog.

Full catalog access
5 sample records per dataset
Schema & provenance preview
Full dataset downloads
API access
Commercial license
Provenance PDF export
Start free
Most popular
Builder
$149/month
Save 17% with annual ($1,490/yr)

For ML engineers and data scientists building biomedical AI models.

Everything in Explorer
Full dataset downloads (CSV, JSON, Parquet)
100,000 API calls / month
Commercial license for all datasets
Provenance PDF export
Audit log for compliance documentation
Custom curation requests
Start Builder trial
Enterprise
Custom

For pharma, biotech, and health systems with bespoke data needs.

Everything in Builder
Unlimited API calls
Custom dataset curation
FDA-grade provenance documentation
Population-stratified datasets
SLA & dedicated support
Private data catalog integration
Contact sales

Common questions

Can I use these datasets in commercial AI products?

Yes. Every dataset tagged "Commercial" comes with a cleared commercial license. Research-only datasets require academic or non-commercial use.

What formats are datasets delivered in?

Parquet (default, ML-ready), JSON/JSONL, and CSV. HuggingFace Datasets format available on Builder and Enterprise.

Can I use provenance PDFs in an FDA submission?

Yes. Provenance reports include source institution, curation methodology, known limitations, and audit trail — meeting FDA data governance requirements for AI/ML submissions.

What's the difference from scraping PharmGKB or ClinVar myself?

3–10× cost savings vs. hiring in-house ($300K–$500K/yr). 6–12 months faster. We handle license clearing, schema normalization, quality scoring, and provenance documentation you can't easily produce internally.

Ready to stop rebuilding data pipelines?

Start with the free Explorer plan. Upgrade when you're ready to ship.