Llama 4 Scout
OpenAI-compatible chat completions endpoint for Llama-4 Scout 17B-16E Instruct model.
Headers
Authorization
string · requiredThe
Authorization
header is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE
.
Request Body
messages
object[] · requiredArray of messages in OpenAI chat format
model
stringModel name (defaults to llama-4-scout)
Default: llama-4-scoutmax_tokens
integerMaximum number of tokens to generate
temperature
numberSampling temperature (0-2)
top_p
numberNucleus sampling parameter
presence_penalty
numberPresence penalty (-2 to 2)
frequency_penalty
numberFrequency penalty (-2 to 2)
stop
string[]Stop sequences
stream
booleanWhether to stream the response
Default: false
Responses
Chat completion response in OpenAI format
id
stringUnique identifier for the completion
object
stringObject type (chat.completion)
created
integerUnix timestamp when the completion was created
model
stringModel used for the completion
choices
object[]usage
object
Kimi K2 Instruct
OpenAI-compatible chat completions endpoint for Kimi K2 Instruct model.
Headers
Authorization
string · requiredThe
Authorization
header is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE
.
Request Body
messages
object[] · requiredArray of messages in OpenAI chat format
model
stringModel name (defaults to kimi-k2)
Default: kimi-k2max_tokens
integerMaximum number of tokens to generate
Default: 1000temperature
numberSampling temperature (0-2)
Default: 1top_p
numberNucleus sampling parameter
Default: 1presence_penalty
numberPresence penalty (-2 to 2)
Default: 0frequency_penalty
numberFrequency penalty (-2 to 2)
Default: 0stop
string[]Stop sequences
Default: []stream
booleanWhether to stream the response
Default: falsestream_options
objectStreaming options
Responses
Chat completion response in OpenAI format
id
stringUnique identifier for the completion
object
stringObject type (chat.completion)
created
integerUnix timestamp when the completion was created
model
stringModel used for the completion
choices
object[]usage
object