doctl serverless-inference responses create

Generated on 3 Jun 2026 from doctl version v1.160.1

Usage

doctl serverless-inference responses create [flags]

Description

Creates a response using the Responses API. Use –model and –input for quick requests, or –request for a full JSON body. Use –stream to receive output as it is generated.

Example

doctl inference responses create --model openai-gpt-oss-20b --input "Hello"

Flags

Option Description
--help, -h Help for this command
--input Input text (required unless –request is set)
--instructions Optional instructions
--model, -m Model ID (required unless –request is set)
--request Path to JSON request body. Use “-” for stdin.
--stream Stream using server-sent events
Default: false
Command Description
doctl serverless-inference responses Display commands for creating model responses

Global Flags

Option Description
--access-token, -t API V2 access token
--api-url, -u Override default API endpoint
--config, -c Specify a custom config file
Default:
    --context Specify a custom authentication context name
    --http-retry-max Set maximum number of retries for requests that fail with a 429 or 500-level error
    Default: 5
    --http-retry-wait-max Set the minimum number of seconds to wait before retrying a failed request
    Default: 30
    --http-retry-wait-min Set the maximum number of seconds to wait before retrying a failed request
    Default: 1
    --interactive Enable interactive behavior. Defaults to true if the terminal supports it (default false)
    Default: false
    --output, -o Desired output format [text|json]
    Default: text
    --trace Show a log of network activity while performing a command
    Default: false
    --verbose, -v Enable verbose output
    Default: false

    We can't find any results for your search.

    Try using different keywords or simplifying your search terms.