Inference

Validated on 28 Apr 2026 • Last edited on 11 May 2026

Inference provides a single control plane for managing inference workflows. It includes a Model Catalog where you can view available foundation models, including both DigitalOcean-hosted and third-party commercial models, compare model capabilities and pricing, use routing to match inference requests to the best-fit model, and run inference using serverless or dedicated deployments.

Browse Models in Model Catalog

Identify the right model for your use case by filtering available foundation models by capabilities and price.

Use Model Playground

Test and compare foundation models in the Model Playground.

Use Serverless Inference

Send API requests directly to foundation models without creating an AI agent or managing infrastructure.

Deploy to Dedicated Inference Endpoints

Deploy open-source and commercial LLMs on dedicated GPUs as an inference endpoint.

Use Inference Router

Route serverless inference requests to foundation models using rules.

Evaluate Models

Determine which model best fits your specific use case.

Use Batch Inference

Batch Inference lets you run large collections of LLM requests as a single asynchronous job.

Use Agent Platform

Use to build fully-managed AI agents with knowledge bases for retrieval-augmented generation, multi-agent routing, guardrails, and more.

Latest Updates

17 June 2026

  • DigitalOcean Inference supports server-side tools on serverless inference, dedicated inference, and inference routers. You can add the following tools:

    • Web search, web fetch, knowledge base retrieval, and remote MCP server tools to your requests in the Chat Completions and Responses APIs.
    • Provider-native tools such as bash, text editor, computer use, and web fetch for Anthropic models with the Messages API.
    • Function calling and tool search for OpenAI models on the Responses API, and Anthropic models on the Messages API.

    Web search and web fetch are in public preview. For more information, see Use Server-Side Tools.

12 June 2026

10 June 2026

  • We support passthrough tool search on the Messages API for Anthropic models and the Responses API for OpenAI models, enabling deferred loading of tools in agentic workflows. There is no additional cost to using tool search. For more information, see Use Server-Side Tools.

For more information, see the full release notes.

We can't find any results for your search.

Try using different keywords or simplifying your search terms.