Contextual AI Pricing
Pricing that scales with your workload, from demo to production
Platform Pricing Overview
The Contextual AI Platform offers two pricing models depending on your needs: (1) on-demand pricing based on platform usage and (2) provisioned throughput pricing to meet robust performance requirements with monthly commitments.
On-Demand
Pay-as-you-go pricing that scales with your usage of the platform
Get startedClaim your $25 in free credits
Query:
- Approx. $0.05 per query
(Actual price depends on total tokens processed)
Document ingestion:
- $48.50 per 1,000 pages
Provisioned Throughput
Guaranteed capacity for predictable throughput performance
Contact sales
Model Unit:
- Each Model Unit provides a guaranteed minimum throughput and can be purchased with a monthly minimum commitment
Platform Management | On-Demand | Provisioned Throughput |
|
|
|
Number of Users | Unlimited | Unlimited |
Number of Admins | Unlimited | Unlimited |
Number of Workspaces | Unlimited | Unlimited |
Number of Agents | Unlimited | Unlimited |
Number of Datastores | Unlimited | Unlimited |
User Roles and Admin Permissions | ✓ | ✓ |
Document Access Entitlements | ✓ | ✓ |
SOC2 Type II Compliance | ✓ | ✓ |
HIPAA Compliance | ✓ | ✓ |
SAML / SSO | ✓ | ✓ |
Role-based Access Control (RBAC) | ✓ | |
Usage Analytics | ✓ | |
Pipeline Observability | ✓ |
Data Ingestion | On-Demand | Provisioned Throughput |
|
|
|
Support for simple Text documents | ✓ | ✓ |
Support for complex docs, charts, and images | ✓ | ✓ |
Support for unstructured data | ✓ | ✓ |
Support for structured data | Contact sales | Contact sales |
UI-based ingestion | ✓ | ✓ |
Continuous data ingestion | Contact sales | Contact sales |
Standard Data Integrations | ✓ | ✓ |
Custom Data Integrations | Contact sales | Contact sales |
Standard Data Retention | ✓ | ✓ |
Custom Data Retention | Contact sales |
Contextual AI Agents | On-Demand | Provisioned Throughput |
|
|
|
Query Optimization Reformulates and decomposes query for better retrieval |
✓ | ✓ |
Retrieve Gets relevant docs from knowledge base |
✓ | ✓ |
Rerank Reorders relevant docs by relevance |
✓ | ✓ |
Filtering Selects top-k most relevant docs |
✓ | ✓ |
Generate Produces response using selected docs |
✓ | ✓ |
Groundedness & Safety Evaluates whether generated response is supported by retrieved docs |
✓ | ✓ |
Tokens Per Second (TPS) Throughput Commitment | ✓ |
Deployment | On-Demand | Provisioned Throughput |
|
|
|
Contextual SaaS | ✓ | ✓ |
Dedicated Contextual VPC | Contact sales | |
Customer VPC | Contact sales |
Component APIs Pricing Overview
For AI teams needing more flexibility to work with their existing RAG architecture, Contextual AI provides powerful platform primitives as component APIs with usage-based pricing.

Our multi-stage document understanding pipeline for converting unstructured content into AI-ready formats
Price
- Basic (text only): $3 / 1,000 pages
- Standard (multimodal): $40 / 1,000 pages

The first instruction-following reranker, providing greater control over how retrieved knowledge is prioritized
Price
- Input: $0.12 / 1M tokens

The most grounded large language model in the world, engineered specifically to minimize hallucinations
Price
- Input: $3 / 1M tokens
- Output: $15 / 1M tokens

Our evaluation-optimized model for preference, direct scoring, and natural language unit test evaluation
Price
- Input: $3 / 1M tokens