Embedding Cost Calculator
Estimate the cost of vectorizing your document corpus for RAG or semantic search. Compare OpenAI, Gemini and Mistral embedding prices.
Detected: 0 words ≈ 0 tokens (approx. per document)
Frequently asked questions
How much does it cost to embed 1 million documents?
With text-embedding-3-small ($0.02/1M tokens), embedding 1 million 500-word documents (~375 tokens each) costs about $7.50 as a one-time cost. With text-embedding-3-large ($0.13/1M tokens), the same corpus costs ~$48.75. You only pay again when documents are updated or added.
Which embedding model is most cost-effective?
text-embedding-3-small is the best default for most RAG applications — it's cheap ($0.02/1M tokens), fast and performs well for English text. Only upgrade to large if retrieval quality is measurably poor. Gemini Embedding 2 and Mistral Embed are similarly priced and good alternatives.
Is it cheaper to use RAG embeddings or pass documents as context?
Embeddings are a one-time cost. Passing the full document as LLM context costs every query and scales with volume. For more than a few hundred queries per month, a RAG architecture is almost always cheaper. Use the File analyzer to compare the per-query cost of context-passing vs. one-time embedding.