Create a message
/v1/messages specification, routed through the AIsa gateway at https://api.aisa.one/v1/messages.
Use this endpoint when you want to call Claude models (claude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5-20251001, etc.) with Anthropic’s native request format — including extended thinking, tool use, and prompt caching. If you prefer OpenAI-style chat completions, the same Claude models are also available via the OpenAI Chat endpoint.
Authentication uses your AIsa API key as a Bearer token. See the model catalog for the full list of supported Claude variants and context windows, and pricing for per-token rates.Your AIsa API key as a Bearer token.
Claude model identifier. Examples: claude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5-20251001.
"claude-sonnet-4-6"
Maximum number of tokens to generate before stopping.
1024
Conversation turns.
[
{
"role": "user",
"content": "Hello, Claude"
}
]System prompt. String or array of text blocks.
Randomness. 0 = deterministic, 1 = creative.
0 <= x <= 1Nucleus sampling. Use instead of temperature.
Only sample from the top K options for each token.
Custom sequences that stop generation.
Enable SSE streaming.
Tools the model may use.
How the model uses tools.
Enable extended thinking for complex reasoning.
auto, standard_only Successful response
"msg_024e7cf42f5d47cfa6982b5ff8b55642"
"message"
"assistant"
Response content blocks (text, tool_use, thinking, etc.).
"claude-sonnet-4-6"
end_turn, stop_sequence, max_tokens, tool_use