Contextual AI Pricing

Pricing that scales with your workload, from demo to production

 

Platform Pricing Overview

The Contextual AI Platform offers flexible sizing for RAG agents and retrieval datastores that you can seamlessly upgrade in place as your workload scales.

 

Contextual RAG Agents

Agents optimized end-to-end for RAG that are customizable to your domain-specific use case

Contextual Datastores

Scalable storage for agents to actively retrieve your enterprise knowledge

Contextual RAG Agents

Contextual Datastores

Standalone APIs Pricing Overview

For users looking to enhance the performance of their existing RAG architectures, Contextual AI provides powerful standalone components with token-based pricing.

 

Reranker

The first instruction-following reranker, providing greater control over how retrievals are prioritized

Price

  • Input: $0.12 / 1M tokens
GLM

The most grounded language model in the world, engineered specifically to minimize hallucinations

Price

  • Input: $3 / 1M tokens
  • Output: $15 / 1M tokens
LMUnit

An evaluation model optimized for assessing LLM responses with fine-grained unit tests

Price

  • Input: $3 / 1M tokens