Available Foundation and Embedding Modelspublic

Validated on 7 Feb 2025 • Last edited on 16 Apr 2025

DigitalOcean GenAI Platform lets you build GPU-powered AI agents with fully-managed deployment. Agents can use pre-built or custom foundation models, incorporate function and agent routes, and implement RAG pipelines with knowledge bases.

Foundation Models

A foundation model is a large-scale model pre-trained on a large corpus of data and adaptable to various tasks.

You can experiment with the following models in the Model Playground:

Anthropic Models

Anthropic models use Constitutional AI to guide responses and refine their self-improvement loop, ensuring more reliable and less biased outputs.

We support all Anthropic models, with access determined by your Anthropic API key.

Note
Anthropic does not publish the number of parameters their models use.
Model Parameters Max Tokens Description Use Cases
Claude 3.7 Sonnet Not published 1024 • Handles both small tasks and deeper reasoning
• Follows instructions well and works with text, code, and images
• Generates detailed outputs when needed
• Software development
• Chatbots
• Q&A systems
• Customer support agents
• Content creation
Claude 3.5 Sonnet Not published 1024 • Balanced model for multilingual dialogue and general tasks
• Efficient for business and content generation
• Business workflows
• Content creation
• Coding assistance
Claude 3.5 Haiku Not published 1024 • Optimized for real-time responsiveness
• Suitable for quick, accurate outputs
• Chatbots
• Real-time data extraction
• Content classification
Claude 3 Opus Not published 1024 • Optimized for complex and long-form content
• Excels in reasoning, analysis, and multilingual tasks
• Research
• Strategic analysis
• Advanced problem-solving

DeepSeek Models

DeepSeek models use Chain of Thought (CoT) and reinforcement learning to produce reasoned outputs. Since the model explains its reasoning, it may use significantly more tokens than others.

Warning
If you are using a DeepSeek model in a user-facing agent, we strongly recommend adding all available guardrails for a safe conversational experience.
Model Version Parameters Max Tokens Description Use Cases
DeepSeek-R1 distill-llama-70B model 70B 8K • Strong Mixture-of-Experts (MoE) language model
• Excels in reasoning, analysis, and multilingual tasks
• Chatbots
• Content creation
• Coding assistance

Meta Models

Meta models are open-source, multilingual, and lightweight models that provide a balance between performance and efficiency.

Model Version Parameters Max Tokens Description Use Cases
Llama 3.3 Instruct-70B 70B 2048 • Improved instruction-following and reasoning capabilities
• Optimized for multilingual and long-context tasks
• Complex dialogue systems
• Long-form content generation
Llama 3.1 Instruct-70B 70B 2048 • Handles multilingual dialogue and content generation
• Trained for human-like responses in multiple languages
• Translation
• Summarization
• Data analysis
Llama 3.1 Instruct-8B 8B 512 • Handles multilingual dialogue and instruction-following tasks
• Supports multiple languages
• Chatbots
• Translation
• Natural language generation

Mistral Models

Mistral models focus on making advanced LLM capabilities accessible with fewer parameters, aiming for faster inference and lower computational costs, while maintaining competitive quality.

Model Version Parameters Max Tokens Description Use Cases
Mistral NeMo 12B 512 • Handles multilingual applications, coding, and reasoning tasks
• Processes large and complex documents
• Supports multi-turn conversations
• Problem-solving
• Advanced coding tasks
• Instruction-based interactions

OpenAI Models

OpenAI models provide reasoning and versatility, making them suitable for a wide range of tasks, including code generation, summarization, and content creation.

We support the following OpenAI models, with access determined by your OpenAI key.

Note
OpenAI does not publish the number of parameters their models use.
Model Version Parameters Max Tokens Description Use Cases
OpenAI GPT-4o Not published 16384 • Advanced reasoning
• High computational power
• Accepts both text and image inputs
• Creative projects
• Advanced chatbots
• Content creation
• Code generation
OpenAI GPT-4o mini Not published 16384 • Compact and resource efficient
• Good balance between performance and cost
• Basic chatbots
• Basic content creation
• Summarization
• Content classification
OpenAI o1 Not published 100k • Trained with reinforcement learning
• Advanced reasoning
• Uses chain of thought
• Complex tasks
• Advanced problem-solving
• Content creation
• Code generation
OpenAI 03-mini Not published 100k • Good balance of reasoning and efficiency
• Optimized for speed
• Straight forward answer responses
• Code generation

Embedding Models

An embedding model converts data into vector embeddings, which are stored in an OpenSearch database cluster.

You can use these embedding models to generate embeddings for your knowledge base:

Alibaba Group

Alibaba’s models integrate with its cloud services and e-commerce platforms, focusing on business applications like customer service chatbots and automated product descriptions.

Model Type Version Parameters Description Use Cases
Text Embeddings Alibaba-NLP/gte-large-en-v1.5 434M • Handles long-form text and tasks with extensive context
• Excels in understanding semantic relationships within content
• Semantic search
• Text summarization
• Cross-lingual applications

SBERT Models

SBERT is an open-source Python library for generating sentence embeddings, and can be used for semantic search, semantic textual similarity, and paraphrase mining.

Model Type Version Parameters Description Use Cases
Sentence-Transformers sentence-transformers/all-MiniLM-L6-v2 22.7M • Processes data quickly with minimal resources • Semantic search
• Clustering
• Information retrieval
Sentence-Transformers sentence-transformers/multi-qa-mpnet-base-dot-v1 109M • Performs well in production environments and scales efficiently • Semantic search
• Information retrieval
• Question-answering systems

For more details, read the GenAI Platform pricing page.

We can't find any results for your search.

Try using different keywords or simplifying your search terms.