Give Feedback

Chunking Best Practices for Knowledge Base Indexing in DigitalOcean Gradient™ AI Platformpublic

Validated on 8 Dec 2025 • Last edited on 5 Jan 2026

DigitalOcean Gradient™ AI Platform lets you build fully-managed AI agents with knowledge bases for retrieval-augmented generation, multi-agent routing, guardrails, and more, or use serverless inference to make direct requests to popular foundation models.

Chunking splits your documents into smaller, retrievable units before indexing. The chunking strategy you choose affects retrieval accuracy, indexing cost, and how much context your agent receives during inference. Gradient AI Platform supports several chunking strategies, each configurable per data source. To use chunking, enable it through the Feature Preview.

This guide explains how to choose and tune chunking strategies. For parameter definitions and model-specific ranges and recommendations, see the chunking strategy parameters reference and the embedding model catalog. To understand pricing implications, see the knowledge base pricing page.

General Best Practices

We recommend the following:

Start with the default chunking settings, which work well for most documents.
Configure chunking per data source and mix strategies within the same knowledge base.
Consider indexing and storage costs when choosing a strategy, as different chunking methods consume tokens differently.

Choosing Chunking Strategy

Chunking strategies differ significantly in indexing and retrieval cost. For example, semantic chunking increases indexing cost, while hierarchical chunking increases retrieval cost because parent and child chunks are returned together. For parameter recommendations, see the parameters reference.

The sections below describe when to use each strategy and how they behave during indexing.

Section-Based Chunking

Uses structural elements such as headings, paragraphs, lists, tables, and callouts as natural boundaries. Adjacent sections are merged or split based on the maximum chunk size (max_chunk_size). Section-based chunking produces predictable, readable chunks.

Works best for:

Product documentation
Policies and SOPs
FAQs
Blogs
Structured web content
Markdown files

Choose this strategy if:

Your document is already structured and has natural boundaries such as headings, paragraphs, lists, or tables.
You need predictable, readable chunks.
You want a fast, low-cost option.
You want a strong baseline for structured content.

For more information, see the section-based chunking reference and the pricing page.

Semantic Chunking

Groups text by meaning using the chosen embedding model. It performs two embedding passes:

Detects semantic boundaries (semantic_threshold).
Embeds the final chunks (max_chunk_size).

Semantic chunking produces more semantically aligned chunks, especially for documents without strong formatting.

Use when meaning matters more than formatting.

Works best for:

Academic writing
Research notes
Long-form prose
Dense or inconsistently structured content

Choose this strategy if:

Your document groups content based on semantic similarity.
You need to detect topical shifts even when formatting is poor.
You need more accurate boundaries that reflect shifts in meaning.
You can accept higher indexing cost; semantic chunking may increase cost by 1.5 to 3 times compared to other strategies.

For more information, see the semantic chunking reference and the pricing page.

Hierarchical Chunking

Creates a two-level structure consisting of:

Parent chunks for broad context (parent_chunk_size).
Child chunks for precise retrieval (child_chunk_size).

When a child chunk is retrieved, the system automatically includes its parent chunk to improve grounding.

Use when both broad context and precise retrieval are required.

Works best for:

API reference documentation
Legal contracts
Product manuals
Highly structured technical content
Documents requiring long-context reasoning

Choose this strategy if:

You need both precise retrieval and broader contextual grounding.

Hierarchical chunking has indexing costs similar to section-based strategies, but retrieval costs are higher because parent and child chunks are included together.

For more information, see the hierarchical chunking reference and the pricing page.

Fixed-Length Chunking

Splits text strictly by token count, ignoring formatting or meaning. This produces uniform chunk sizes and predictable indexing behavior.

Use when the document has unreliable formatting or when simplicity is preferred.

Works best for:

Logs
IoT telemetry
OCR text
Time-series or streaming text
Machine-generated content
Code
Highly structured or repetitive data

Choose this strategy if:

You want chunking based solely on token count.
You can ignore document formatting and semantics.
You need a fast, predictable behavior.
You are indexing large-scale, unstructured, or repetitive content.

For more information, see the fixed-length chunking reference and the pricing page.

Improve Chunking Performance

Chunking performance depends heavily on document clarity and formatting. Use the following evaluation loop to optimize your configuration:

Index the data source with the default settings.
Run an Agent Evaluation to measure retrieval accuracy.
Inspect key retrieval metrics, such as retrieved context relevance, response-context completeness, context adherence, and retrieved chunk usage. For more details, see the Agent Evaluation metrics reference.
Modify the chunking strategy or parameters.
Re-index the data source and repeat as needed.

Re-indexing consumes tokens, so plan adjustments strategically.

Chunking Best Practices for Knowledge Base Indexing in DigitalOcean Gradient™ AI Platformpublic

General Best Practices

Choosing Chunking Strategy

Section-Based Chunking

Semantic Chunking

Hierarchical Chunking

Fixed-Length Chunking

Improve Chunking Performance

We can't find any results for your search.