Skip to main content
POST
/
v1
/
models
Create a model in the catalogue
curl --request POST \
  --url https://{host}/v2/workspaces/slug:llm-gateway/webhooks/v1/models \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model_id": "<string>",
  "type": "completion"
}
'
{
  "model_id": "<string>",
  "type": "completion",
  "display": {
    "name": "<string>",
    "brand": "<string>",
    "hidden": true
  },
  "capabilities": {
    "vision": true,
    "audio": true,
    "text": true,
    "image": true
  },
  "limits": {},
  "failover": "<string>",
  "region": "<string>",
  "dimensions": "<unknown>",
  "supported_dimensions": [
    123
  ],
  "metrics": "<unknown>",
  "provider_config": {},
  "pricing": {
    "input_per_1m_tokens": 123,
    "output_per_1m_tokens": 123
  },
  "tags": [
    "<string>"
  ],
  "org_slugs": [
    "<string>"
  ],
  "enabled": true
}

Documentation Index

Fetch the complete documentation index at: https://docs.prisme.ai/llms.txt

Use this file to discover all available pages before exploring further.

Authorizations

Authorization
string
header
required

User session JWT or instance API key (iak_*). Send as Authorization: Bearer <token>.

Body

application/json

Write-shape for POST /v1/models and each item of PUT /v1/models. model_id and type are required.

model_id
string
required
Maximum string length: 128
type
enum<string>
required
Available options:
completion,
embeddings,
image_generation
Maximum string length: 64
display
object
capabilities
object
limits
object
failover
string
Maximum string length: 128
region
string
Maximum string length: 64
dimensions
any
supported_dimensions
any
metrics
any
provider_config
any
pricing
any
tags
any
org_slugs
any
enabled
any

Response

Model created. Returns the inserted document.

Model catalogue document. The catalogue is the source of truth for provider routing, capabilities, pricing, and per-org availability.

model_id
string
required

Stable identifier (matches the value sent in request.model).

Maximum string length: 128
type
enum<string>
required

Model family.

Available options:
completion,
embeddings,
image_generation
Maximum string length: 64
display
object

Display metadata (label, brand, hidden flag).

capabilities
object

Flags advertising what the model supports.

limits
object

Provider-side limits (e.g. context window, max tokens).

failover
string

Optional model_id to route to when the primary fails.

Maximum string length: 128
region
string

Hosting region (free-text).

Maximum string length: 64
dimensions
any

Default embedding dimensionality (embeddings models only).

supported_dimensions
number[]

Allowed values for the request dimensions parameter.

metrics
any

Free-form metrics block (latency, throughput hints, …).

provider_config
object

Provider-specific configuration (batch size, parallelism, …).

pricing
object

Cost configuration used to compute usage.cost.

tags
string[]

Free-form tags used by the search/filter UI.

org_slugs
string[]

When set and non-empty, restricts the model to the listed organizations. Empty / missing means the model is available to all orgs (subject to governance overlays).

enabled
boolean

When false, the model is hidden from routing.