OpenAI-compatible embeddings endpoint. Accepts a single string or an
array of strings as input and routes through the gateway’s provider
layer. Returns the standard OpenAI EmbeddingResponse. When the input
is a single string, the response also exposes a flat embedding field
(Prisme.ai convenience extension) in addition to data[0].embedding.
analytics_context is a Prisme.ai extension used to enrich analytics
events.
Governance overlays may reject the call (forwarded as 403 / 429 with
the upstream result.status, e.g. MODEL_NOT_ALLOWED,
INVALID_DIMENSIONS).
Documentation Index
Fetch the complete documentation index at: https://docs.prisme.ai/llms.txt
Use this file to discover all available pages before exploring further.
User session JWT or instance API key (iak_*). Send as
Authorization: Bearer <token>.
Embedding model id from the catalogue.
256Either a single string or an array of strings to embed.
Requested output dimensionality. Must be one of the model's
supported_dimensions when set; otherwise rejected with
INVALID_DIMENSIONS.
Prisme.ai extension. Caller-supplied analytics context.
Successful embedding response.
OpenAI-compatible embeddings response. When input was a single
string, the response also exposes a flat root-level embedding field
(Prisme.ai convenience extension) in addition to data[0].embedding.