Identify the right model for your use case by filtering available foundation models by capabilities and price.
Inference
Last verified 22 Jun 2026
Inference provides a single control plane for managing inference workflows. It includes a Model Catalog where you can view available foundation models, including both DigitalOcean-hosted and third-party commercial models, compare model capabilities and pricing, use routing to match inference requests to the best-fit model, and run inference using serverless or dedicated deployments.
Test and compare foundation models in the Model Playground.
Send API requests directly to foundation models without creating an AI agent or managing infrastructure.
Deploy open-source and commercial LLMs on dedicated GPUs as an inference endpoint.
Route serverless inference requests to foundation models using rules.
Determine which model best fits your specific use case.
Batch Inference lets you run large collections of LLM requests as a single asynchronous job.
Use to build fully-managed AI agents with knowledge bases for retrieval-augmented generation, multi-agent routing, guardrails, and more.
Latest Updates
30 June 2026
-
Model Evaluations is now renamed to DigitalOcean Evaluations.
-
Presets are now available for DigitalOcean Evaluations. You can save and reuse evaluation configurations, including the candidate model, system prompt, hyperparameters, judge model, and metrics.
-
DigitalOcean Evaluations is now generally available. Use Evaluations to create test cases, run evaluation datasets, and measure model performance against selected metrics.
-
Custom metrics are now available for DigitalOcean Evaluations. You can define your own metrics to evaluate model behavior against criteria specific to your use case.
-
The Agent Evaluations MCP server tool has been renamed to Evaluations.
-
Insights, agent tracing, and conversation logs are deprecated for all agents, including agents created through the Control Panel, CLI, API, and Agent Development Kit (ADK).
To monitor deployed agent behavior, use Agent Metrics and Runtime Logs instead.
-
Agent evaluations support for the Agent Development Kit (ADK), previously in preview, is now removed.
To evaluate agents, use agent evaluations via the DigitalOcean Control Panel for supported agent types. To monitor ADK agent behavior, use Agent Metrics and Runtime Logs.
-
The following agent evaluation metrics are deprecated and should no longer be used:
- Tone
- Retrieved Chunk Usage
- Prompt Perplexity
Use the currently supported metrics listed in Agent Evaluation Metrics instead. To monitor deployed agent behavior outside of evaluations, use Agent Metrics and Runtime Logs.
29 June 2026
-
Serverless Inference now requires a positive prepaid account balance before you can send inference requests. Usage charges are deducted from this balance, and access is suspended if it reaches $0. You can add a prepayment manually or enable auto-reload to replenish your balance automatically. For more information, see Manage Serverless Inference Prepayment.
24 June 2026
-
The following Z.ai model is now available on DigitalOcean Inference for serverless inference, dedicated inference, Agent Development Kit, and agents:
For more information, see the Available Models page.
For more information, see the full release notes.