DigitalOcean Gradient™ AI Platform Pricing
Validated on 29 Aug 2025 • Last edited on 2 Sep 2025
DigitalOcean Gradient™ AI Platform lets you build fully-managed AI agents with knowledge bases for retrieval-augmented generation, multi-agent routing, guardrails, and more, or use serverless inference to make direct requests to popular foundation models.
Gradient AI Platform has a usage-based pricing model, so costs scale with your actual usage. We charge for model usage (input and output tokens) and for additional features like knowledge bases, guardrails, and log stream insights.
We display prices per million tokens. We charge per thousand tokens for accuracy.
Token usage depends on factors such as the length of user input, complexity of agent instructions, attached knowledge bases, and configuration settings. To optimize usage, test your agents and adjust their parameters.
Open Source Models (Agent Usage)
You are charged for all input and output tokens processed by an agent. Agent creation is free.
Model | Price |
---|---|
DeepSeek-R1 distill-llama-70B | $0.99 per 1,000,000 input tokens $0.99 per 1,000,000 output tokens |
Llama 3.1 8B | $0.198 per 1,000,000 input tokens $0.198 per 1,000,000 output tokens |
Llama 3.3 70B | $0.65 per 1,000,000 input tokens $0.65 per 1,000,000 output tokens |
Mistral NeMo | $0.30 per 1,000,000 input tokens $0.30 per 1,000,000 output tokens |
OpenAI gpt-oss-120b | $0.10 per 1,000,000 input tokens $0.70 per 1,000,000 output tokens |
OpenAI gpt-oss-20b | $0.05 per 1,000,000 input tokens $0.45 per 1,000,000 output tokens |
Serverless Inference (Direct API Usage)
Serverless inference lets you call models directly through the API without creating an agent.
-
For open source models, pricing matches agent usage.
-
For commercial models, pricing follows the provider’s published rates:
All usage is billed per input and output token.
Commercial Models
When using commercial models, billing is handled directly by the provider at the provider’s rates through your API key with them:
Knowledge Bases
Knowledge base pricing includes:
-
Indexing tokens: You are charged for the tokens required to generate embeddings.
For example, a 10 MB dataset takes about 3 million tokens, and a 1 GB dataset takes about 250 million tokens.
Actual costs depend on the embedding model:
Model Price all-mini-lm-l6-v2
$0.009 per 1,000,000 input tokens multi-qa-mpnet-base-dot-v1
$0.009 per 1,000,000 input tokens gte-large-en-v1.5
$0.09 per 1,000,000 input tokens One token is roughly four characters, or about 75 words per 100 tokens. Non-Latin scripts, emojis, or binary data may increase token counts.
-
Storage: Embeddings are stored in OpenSearch. See OpenSearch pricing.
Guardrails
Charges apply for all tokens processed through guardrails:
Guardrail | Price |
---|---|
Content Moderation | $0.20 per 1,000,000 tokens |
Jailbreak Detection | $0.20 per 1,000,000 tokens |
Sensitive Data Detection | $0.34 per 1,000,000 tokens |
Costs are per token. Creating, editing, or duplicating guardrails has no additional cost.
Functions
If you attach DigitalOcean Functions to your agent, you are billed at functions pricing.
Agent Evaluations
Agent evaluations are charged by token usage at the same rates as model usage. If you evaluate with a commercial model, billing is handled directly by the provider.
Log Stream Insights
Log stream insights uses a third-party model to analyze agent trace data. You are charged per token:
Tokens | Price |
---|---|
Input | $4.00 per 1,000,000 tokens |
Output | $20.00 per 1,000,000 tokens |