Identify the right model for your use case by filtering available foundation models by capabilities and price.
Inference
Validated on 28 Apr 2026 • Last edited on 11 May 2026
Inference provides a single control plane for managing inference workflows. It includes a Model Catalog where you can view available foundation models, including both DigitalOcean-hosted and third-party commercial models, compare model capabilities and pricing, use routing to match inference requests to the best-fit model, and run inference using serverless or dedicated deployments.
Test and compare foundation models in the Model Playground.
Send API requests directly to foundation models without creating an AI agent or managing infrastructure.
Deploy open-source and commercial LLMs on dedicated GPUs as an inference endpoint.
Route serverless inference requests to foundation models using rules.
Determine which model best fits your specific use case.
Batch Inference lets you run large collections of LLM requests as a single asynchronous job.
Use to build fully-managed AI agents with knowledge bases for retrieval-augmented generation, multi-agent routing, guardrails, and more.
Latest Updates
27 May 2026
-
The following DeepSeek model is now available on DigitalOcean Inference for serverless inference, dedicated inference, Agent Development Kit, and agents:
For more information, see the Available Models page.
5 May 2026
-
The following Moonshot AI model is now available on DigitalOcean Inference for serverless inference, Agent Development Kit and agents:
For more information, see the Available Models page.
1 May 2026
-
The following DeepSeek model is now available on DigitalOcean Inference for serverless inference, Agent Development Kit and agents:
For more information, see the Available Models page.
For more information, see the full release notes.