Give Feedback

Enable pgvector and Load Embeddings

Validated on 28 Apr 2026 • Last edited on 8 May 2026

DigitalOcean Managed PostgreSQL for vector search uses the same managed PostgreSQL engine available under Managed Databases, with the pgvector and pgvectorscale extensions for storing and querying vector embeddings alongside relational data.

Copy page as Markdown View page as Markdown

This how-to enables the pgvector extension on a DigitalOcean Managed PostgreSQL cluster, designs a table for embeddings, and ingests vectors from your application. It covers psql, Python, Node.js, and Go.

Prerequisites

A running PostgreSQL cluster on DigitalOcean. The vector extension is supported on PostgreSQL 13 and later.
The cluster’s connection string, retrievable from the Overview tab under Connection Details.
At least one trusted source (IP address, Droplet, App Platform app, Kubernetes cluster, or tag) added on the Network Access tab.
A role with CREATEDB and extension-install permission. The doadmin user has this by default.
An embedding provider such as DigitalOcean Inference, OpenAI, Cohere, or a self-hosted sentence-transformer.

If you have not created a cluster yet, follow the PostgreSQL Quickstart. For connection string retrieval and SSL setup, see Connect to a PostgreSQL Cluster.

Enable the Vector Extension

Connect to the target database with psql. Each database in a cluster has its own extension registry, so enable pgvector in every database where you intend to store vectors.

psql "postgresql://doadmin:PASSWORD@HOST:25060/defaultdb?sslmode=require"

CREATE EXTENSION IF NOT EXISTS vector;

Note

The project is named pgvector, but the extension registers under the name vector. CREATE EXTENSION pgvector; fails with ERROR: could not open extension control file. Always use CREATE EXTENSION vector;.

If you are on a PostgreSQL version that does not support the pgvector release you need, upgrade the cluster in place. On the cluster’s Overview page, scroll to Current Version and click Upgrade Now.

Design Your Table

Choose the vector type that fits your model output and storage budget. The dimension must exactly match the output size of the embedding model you use; mixing dimensions in a single column is not allowed.

vector(n): Single-precision floats, four bytes per element. The default choice. Supports dimensions up to 16,000.
halfvec(n): Half-precision floats, two bytes per element. Roughly halves storage and index size with a small accuracy trade-off.
sparsevec(n): Sparse vectors stored as index/value pairs. Useful for high-dimensional vectors that are mostly zero, such as SPLADE outputs.

Add the columns your application actually needs: identifiers, the original text, metadata used for filtering, and timestamps. Keep the embedding separate from any large payloads you do not need on the hot path.

CREATE TABLE documents (
    id         bigserial PRIMARY KEY,
    source     text         NOT NULL,
    content    text         NOT NULL,
    tenant_id  uuid         NOT NULL,
    embedding  vector(1536) NOT NULL,
    created_at timestamptz  NOT NULL DEFAULT now()
);

CREATE INDEX documents_tenant_created_idx
    ON documents (tenant_id, created_at DESC);

Keep a metadata column such as tenant_id or source so you can scope vector searches with an ordinary WHERE clause. Combining filters with similarity search is one of the main reasons to store vectors in PostgreSQL rather than a separate vector store.

Generate Embeddings in Your Application

Generate embeddings in your ingest pipeline and write them to PostgreSQL. The examples below use OpenAI-compatible APIs, but the shape is the same for any provider: the output is a list of floats that you bind as a parameter to the INSERT statement.

Option A: DigitalOcean Inference

DigitalOcean Inference exposes embedding models through an OpenAI-compatible serverless inference endpoint. Authenticate with a model access key created in the Cloud Console under Inference, then Manage (Model Access Keys), then Create model access key, or via POST /v2/gen-ai/models/api_keys.

from openai import OpenAI
import os

client = OpenAI(
    api_key=os.environ["MODEL_ACCESS_KEY"],
    base_url="https://inference.do-ai.run/v1",
)

resp = client.embeddings.create(
    model="gte-large-en-v1.5",   # 1024 dims; see Available Models for current slugs
    input=["DigitalOcean Managed PostgreSQL supports pgvector."],
)
vector = resp.data[0].embedding

Option B: Self-Hosted Sentence-Transformers

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("BAAI/bge-base-en-v1.5")
vector = model.encode("DigitalOcean Managed PostgreSQL supports pgvector").tolist()

Note

DigitalOcean Managed PostgreSQL does not ship HTTP-calling extensions such as pg_net or http, so you cannot call an external embedding API from inside a SQL function. Treat embedding generation as part of your ingest pipeline.

Insert Embeddings

pgvector accepts a vector as a string of the form [v1, v2, ..., vn] bound as a text or array parameter. The canonical drivers for each language handle this automatically when you register the pgvector adapter.

psql

For ad-hoc loads, supply the vector literal directly:

INSERT INTO documents (source, tenant_id, content, embedding) VALUES (
    'docs',
    '11111111-1111-1111-1111-111111111111',
    'DigitalOcean Managed PostgreSQL supports pgvector.',
    '[0.013, -0.024, 0.117, ...]'::vector
);

Python (psycopg 3)

import os
import psycopg
from pgvector.psycopg import register_vector

with psycopg.connect(os.environ["DATABASE_URL"]) as conn:
    register_vector(conn)
    with conn.cursor() as cur:
        cur.execute(
            """
            INSERT INTO documents (source, tenant_id, content, embedding)
            VALUES (%s, %s, %s, %s)
            """,
            ("docs", tenant_id, content, vector),
        )
    conn.commit()

Node.js (pg plus pgvector/pg)

import pg from "pg";
import pgvector from "pgvector/pg";

const pool = new pg.Pool({
  connectionString: process.env.DATABASE_URL,
  ssl: { rejectUnauthorized: true },
});

pool.on("connect", async (client) => {
  await pgvector.registerTypes(client);
});

await pool.query(
  `INSERT INTO documents (source, tenant_id, content, embedding)
   VALUES ($1, $2, $3, $4)`,
  ["docs", tenantId, content, pgvector.toSql(vector)],
);

Go (pgx plus pgvector-go)

import (
    "context"
    "os"

    "github.com/jackc/pgx/v5"
    "github.com/jackc/pgx/v5/pgxpool"
    "github.com/pgvector/pgvector-go"
    pgxvec "github.com/pgvector/pgvector-go/pgx"
)

cfg, _ := pgxpool.ParseConfig(os.Getenv("DATABASE_URL"))
cfg.AfterConnect = func(ctx context.Context, conn *pgx.Conn) error {
    return pgxvec.RegisterTypes(ctx, conn)
}
pool, _ := pgxpool.NewWithConfig(context.Background(), cfg)

_, err := pool.Exec(ctx, `
    INSERT INTO documents (source, tenant_id, content, embedding)
    VALUES ($1, $2, $3, $4)`,
    "docs", tenantID, content, pgvector.NewVector(vec),
)

For bulk ingest, prefer COPY or multi-row INSERT inside a single transaction. Batch a few hundred to a few thousand rows per round trip to cut network overhead. Create the HNSW index after the initial load; building an index on a populated table is much faster than maintaining it during a bulk insert.

Verify and Inspect

-- Extension status
\dx vector

-- Sample a few rows (strip the embedding for readability)
SELECT id, source, tenant_id, left(content, 60) AS snippet
FROM   documents
ORDER  BY created_at DESC
LIMIT  5;

-- Confirm dimension matches the table definition
SELECT vector_dims(embedding) AS dims
FROM   documents
LIMIT  1;

If vector_dims returns a value that does not match the dimension in the column definition, resolve it before indexing. A dimension mismatch causes every insert with a differently sized vector to fail.