GenAI Platform Pricing Public Preview

DigitalOcean GenAI Platform lets you build GPU-powered AI agents with fully-managed deployment. Agents can use pre-built or custom foundation models, incorporate function and agent routes, and implement RAG pipelines with knowledge bases.


The GenAI Platform follows a usage-based pricing model, charging based on the models you select, their frequency of use, and additional features like knowledge bases (KBs) and guardrails. This pricing model ensures that costs adjust according to your usage, helping you manage spending as your needs evolve.

Note
We display pricing per million tokens but bill usage per thousand tokens to provide more accurate charges and avoid overcharging.

An agent’s token usage depends on several factors, including the length of the user’s input, complexity of agent instructions, knowledge base data, and other configurations. To optimize your token usage, test your agents and adjust their settings accordingly.

Open Source Models

We charge you for all input and output tokens to the model when using an agent. We offer the Model Playground for free, but limit the amount of tokens (input and output combined) you can use for each model per day per team.

Here are the token rates for each model:

Model Price
DeepSeek-R1-distill-llama-70B $0.99 per 1,000,000 input tokens
$0.99 per 1,000,000 output tokens
Llama 3.1 8B $0.198 per 1,000,000 input tokens
$0.198 per 1,000,000 output tokens
Llama 3.1 70B $0.70 per 1,000,000 input tokens
$0.70 per 1,000,000 output tokens
Llama 3.3 70B $0.65 per 1,000,000 input tokens
$0.65 per 1,000,000 output tokens
Mistral NeMo $0.30 per 1,000,000 input tokens
$0.30 per 1,000,000 output tokens

Model costs are per token, not for creation.

Commercial Models

Pricing for input and output tokens for commercial models follows the provider’s standard rates. Your model provider bills your account with them directly since you use your own API tokens.

Standard rates for commercial models include:

Knowledge Bases

Creating a KB involves two actions:

  1. Transforming the provided data into vector embeddings (indexing). You are charged based on the number of tokens indexed into the KB.
  2. The storage of these vector embeddings is dictated by OpenSearch pricing.

Here are the indexing token rates for embedding models by size:

Model Price
all-mini-lm-l6-v2 $0.009 per 1,000,000 input tokens
multi-qa-mpnet-base-dot-v1 $0.009 per 1,000,000 input tokens
gte-large-en-v1.5 $0.09 per 1,000,000 input tokens

Guardrails

We charge for all input and output tokens to the model based on the guardrail:

Guardrail Price
Content Moderation $3.00 per 1,000,000 tokens
Jailbreak Detection $3.00 per 1,000,000 tokens
Sensitive Data Detection $0.34 per 1,000,000 tokens

Guardrail costs are per token, not for creation. Editing or duplicating these guardrails does not change their price.

Functions

When you add DigitalOcean Functions to your agent, you are charged based on Functions pricing.

In this article...