Embedding Cost Calculator

Estimate the cost of vectorizing your document corpus for RAG or semantic search. Compare OpenAI, Gemini and Mistral embedding prices.

Price per 1K tokens.
We estimate tokens from this example.

Detected: 0 words ≈ 0 tokens (approx. per document)

How many documents will you embed?
For ongoing monthly cost.

Frequently asked questions

How much does it cost to embed 1 million documents?

With text-embedding-3-small ($0.02/1M tokens), embedding 1 million 500-word documents (~375 tokens each) costs about $7.50 as a one-time cost. With text-embedding-3-large ($0.13/1M tokens), the same corpus costs ~$48.75. You only pay again when documents are updated or added.

Which embedding model is most cost-effective?

text-embedding-3-small is the best default for most RAG applications — it's cheap ($0.02/1M tokens), fast and performs well for English text. Only upgrade to large if retrieval quality is measurably poor. Gemini Embedding 2 and Mistral Embed are similarly priced and good alternatives.

Is it cheaper to use RAG embeddings or pass documents as context?

Embeddings are a one-time cost. Passing the full document as LLM context costs every query and scales with volume. For more than a few hundred queries per month, a RAG architecture is almost always cheaper. Use the File analyzer to compare the per-query cost of context-passing vs. one-time embedding.