Contextual AI Pricing

Pricing that scales with your workload, from demo to production

Platform Pricing Overview

The Contextual AI Platform offers two pricing models depending on your needs: (1) on-demand pricing based on platform usage and (2) provisioned throughput pricing to meet robust performance requirements with monthly commitments.

On-Demand

Pay-as-you-go pricing that scales with your usage of the platform

Get started

Claim your $25 in free credits


Query:

  • Approx. $0.05 per query
    (Actual price depends on total tokens processed)

Document ingestion:

  • $48.50 per 1,000 pages
Provisioned Throughput

Guaranteed capacity for predictable throughput performance

Contact sales

 


Model Unit:

  • Each Model Unit provides a guaranteed minimum throughput and can be purchased with a monthly minimum commitment
Platform Management On-Demand Provisioned Throughput



Number of Users Unlimited Unlimited
Number of Admins Unlimited Unlimited
Number of Workspaces Unlimited Unlimited
Number of Agents Unlimited Unlimited
Number of Datastores Unlimited Unlimited
User Roles and Admin Permissions
Document Access Entitlements
SOC2 Type II Compliance
HIPAA Compliance
SAML / SSO
Role-based Access Control (RBAC)
Usage Analytics
Pipeline Observability

 

Data Ingestion On-Demand Provisioned Throughput



Support for simple Text documents
Support for complex docs, charts, and images
Support for unstructured data
Support for structured data Contact sales Contact sales
UI-based ingestion
Continuous data ingestion Contact sales Contact sales
Standard Data Integrations
Custom Data Integrations Contact sales Contact sales
Standard Data Retention
Custom Data Retention Contact sales

 

Contextual AI Agents On-Demand Provisioned Throughput



Query Optimization
Reformulates and decomposes query for better retrieval
Retrieve
Gets relevant docs from knowledge base
Rerank
Reorders relevant docs by relevance
Filtering
Selects top-k most relevant docs
Generate
Produces response using selected docs
Groundedness & Safety
Evaluates whether generated response is supported by retrieved docs
Tokens Per Second (TPS) Throughput Commitment

 

Deployment On-Demand Provisioned Throughput



Contextual SaaS
Dedicated Contextual VPC Contact sales
Customer VPC Contact sales

Component APIs Pricing Overview

For AI teams needing more flexibility to work with their existing RAG architecture, Contextual AI provides powerful platform primitives as component APIs with usage-based pricing.

 

Parse

Our multi-stage document understanding pipeline for converting unstructured content into AI-ready formats

Price

  • Basic (text only): $3 / 1,000 pages
  • Standard (multimodal): $40 / 1,000 pages
Rerank

The first instruction-following reranker, providing greater control over how retrieved knowledge is prioritized

Price

  • Input: $0.12 / 1M tokens
Generate

The most grounded large language model in the world, engineered specifically to minimize hallucinations

Price

  • Input: $3 / 1M tokens
  • Output: $15 / 1M tokens
LMUnit

Our evaluation-optimized model for preference, direct scoring, and natural language unit test evaluation

Price

  • Input: $3 / 1M tokens