Inference Support

Generated on 18 Jun 2026

Inference provides a single control plane for managing inference workflows. It includes a Model Catalog where you can view available foundation models, including both DigitalOcean-hosted and third-party commercial models, compare model capabilities and pricing, use routing to match inference requests to the best-fit model, and run inference using serverless or dedicated deployments.

What retry or backoff behavior should I follow for 429 responses from serverless inference?

Steps to follow for retry and backoff behavior for HTTP 429 responses from serverless inference.

How do I schedule automatic reindexing for my knowledge bases?

Create a scheduled function to automatically reindex a knowledge base.

We can't find any results for your search.

Try using different keywords or simplifying your search terms.