DigitalOcean Gradient™ AI Platform Pricing

Validated on 29 Aug 2025 • Last edited on 2 Sep 2025

DigitalOcean Gradient™ AI Platform lets you build fully-managed AI agents with knowledge bases for retrieval-augmented generation, multi-agent routing, guardrails, and more, or use serverless inference to make direct requests to popular foundation models.

Gradient AI Platform has a usage-based pricing model, so costs scale with your actual usage. We charge for model usage (input and output tokens) and for additional features like knowledge bases, guardrails, and log stream insights.

We display prices per million tokens. We charge per thousand tokens for accuracy.

Token usage depends on factors such as the length of user input, complexity of agent instructions, attached knowledge bases, and configuration settings. To optimize usage, test your agents and adjust their parameters.

Open Source Models (Agent Usage)

You are charged for all input and output tokens processed by an agent. Agent creation is free.

Model Price
DeepSeek-R1 distill-llama-70B $0.99 per 1,000,000 input tokens
$0.99 per 1,000,000 output tokens
Llama 3.1 8B $0.198 per 1,000,000 input tokens
$0.198 per 1,000,000 output tokens
Llama 3.3 70B $0.65 per 1,000,000 input tokens
$0.65 per 1,000,000 output tokens
Mistral NeMo $0.30 per 1,000,000 input tokens
$0.30 per 1,000,000 output tokens
OpenAI gpt-oss-120b $0.10 per 1,000,000 input tokens
$0.70 per 1,000,000 output tokens
OpenAI gpt-oss-20b $0.05 per 1,000,000 input tokens
$0.45 per 1,000,000 output tokens

Serverless Inference (Direct API Usage)

Serverless inference lets you call models directly through the API without creating an agent.

All usage is billed per input and output token.

Commercial Models

When using commercial models, billing is handled directly by the provider at the provider’s rates through your API key with them:

Knowledge Bases

Knowledge base pricing includes:

  • Indexing tokens: You are charged for the tokens required to generate embeddings.

    For example, a 10 MB dataset takes about 3 million tokens, and a 1 GB dataset takes about 250 million tokens.

    Actual costs depend on the embedding model:

    Model Price
    all-mini-lm-l6-v2 $0.009 per 1,000,000 input tokens
    multi-qa-mpnet-base-dot-v1 $0.009 per 1,000,000 input tokens
    gte-large-en-v1.5 $0.09 per 1,000,000 input tokens

    One token is roughly four characters, or about 75 words per 100 tokens. Non-Latin scripts, emojis, or binary data may increase token counts.

  • Storage: Embeddings are stored in OpenSearch. See OpenSearch pricing.

Guardrails

Charges apply for all tokens processed through guardrails:

Guardrail Price
Content Moderation $0.20 per 1,000,000 tokens
Jailbreak Detection $0.20 per 1,000,000 tokens
Sensitive Data Detection $0.34 per 1,000,000 tokens

Costs are per token. Creating, editing, or duplicating guardrails has no additional cost.

Functions

If you attach DigitalOcean Functions to your agent, you are billed at functions pricing.

Agent Evaluations

Agent evaluations are charged by token usage at the same rates as model usage. If you evaluate with a commercial model, billing is handled directly by the provider.

Log Stream Insights

Log stream insights uses a third-party model to analyze agent trace data. You are charged per token:

Tokens Price
Input $4.00 per 1,000,000 tokens
Output $20.00 per 1,000,000 tokens

We can't find any results for your search.

Try using different keywords or simplifying your search terms.