Chat Completion

Authorizations

Authorization

string

header

required

Authenticate using Bearer token. Get your API Key from the WeryAI Console.

Example: Authorization: Bearer sk-xxxxxxxxxxxxxxxx

Body

application/json

model

string

required

Chat model key. Use the /v1/chat/models endpoint to get available models.

Model Name	Model Key
GPT-5.5	GPT_5_5
GPT-5.4	GPT_5_4
Claude-Fable-5	CLAUDE_FABLE_5
Claude-4.8-Opus	CLAUDE_4_8_OPUS
Claude-4.6-Opus	CLAUDE_4_6_OPUS
Gemini-3.5-Flash	GEMINI_3_5_FLASH
Gemini-3.1-Pro	GEMINI_3_1_PRO
GPT-5.1	GPT_5_1
Claude-4.5-Opus	CLAUDE_4_5_OPUS
GPT-5	GPT_5
DeepSeek-R1	DEEPSEEK_R1
Kimi K2 Thinking	KIMI_K2_THINKING
QwQ 32B	QWEN_QWQ_32B
Grok-4	GROK_4
Claude-Sonnet-4.6	CLAUDE_SONNET_4_6
Gemini-3.1-Flash-Lite	GEMINI_3_1_FLASH_LITE
Qwen3.5 Plus	QWEN_3_5_PLUS
GLM 5	GLM_5
Kimi K2.5	KIMI_K2_5
Claude-Sonnet-4.5	CLAUDE_SONNET_4_5
GPT-4o	GPT_4O
GPT-4.1	GPT_4_1
Gemini-2.5-Pro	GEMINI_25_PRO
GLM 4.7 Flash	GLM_4_7_FLASH
Gemini-2.5-Flash	GEMINI_25_FLASH
Seed-2.0-Mini	SEED_2_0_MINI
Claude-4-Opus	CLAUDE_4_OPUS
Claude-4-Sonnet	CLAUDE_4_SONNET

Example:

"GEMINI_25_FLASH"

messages

object[]

required

Message list for the conversation. Supports multi-turn by including history.

Required array length: 1 - 50 elements

Show child attributes

Example:

[
  {
    "role": "user",
    "content": "What is artificial intelligence?"
  }
]

max_tokens

integer

default:1024

Maximum number of tokens to generate. Default 1024. The upper limit depends on the model (use the model list endpoint to check).

Required range: x >= 1

Example:

1024

temperature

number

default:1

Controls randomness of the output. Higher values produce more diverse results.

Required range: 0 <= x <= 2

Example:

1

top_p

number

default:1

Nucleus sampling parameter. Limits cumulative probability of candidate tokens.

Required range: 0 <= x <= 1

Example:

1

presence_penalty

number

Penalizes new topics to reduce repetition

Required range: -2 <= x <= 2

frequency_penalty

number

Penalizes frequent tokens to reduce repetition

Required range: -2 <= x <= 2

seed

integer

Random seed. Same seed with same input produces deterministic results.

integer

default:1

Number of responses to generate

Required range: x >= 1

Example:

1

stream

boolean

default:false

Whether to stream the response

plugins

object[]

Optional plugins for the chat request.

Web Search

Some models support web search. Whether web search takes effect depends on the selected model and upstream provider capabilities. Gemini models are currently integrated with Google Search; for other models, the plugins parameter is passed through and upstream support determines whether it works.

Enable web search with:

{
  "plugins": [
    { "id": "web" }
  ]
}

Notes:

plugins[].id = "web" requests web search.
When Gemini models use web search, response_format.type = "json_schema" cannot be used at the same time.

Show child attributes

Example:

[{ "id": "web" }]

Response

Chat completed successfully

OpenAI-compatible Chat Completion response

string

Unique identifier for the chat completion

Example:

"chatcmpl-abc123def456"

object

string

Object type, always "chat.completion"

Example:

"chat.completion"

created

integer<int64>

Unix timestamp (in seconds) of when the completion was created

Example:

1711929600

model

string

The model used for this completion

Example:

"GEMINI_25_FLASH"

choices

object[]

List of completion choices

Show child attributes

usage

object

Show child attributes