LLM Services
Enter JWT access token
List of available models
Unauthorized - Invalid or missing access token
Forbidden - Insufficient LLM scope permissions
Internal server error
Service Unavailable - RedPill API or internal service error
Create Chat Completion
Generates a chat completion using the specified LLM model. Supports both streaming and non-streaming responses.
OpenAI Compatibility
This endpoint is fully compatible with OpenAI's chat completions API, allowing you to use existing OpenAI client libraries by simply changing the base URL.
Streaming Support
Set stream: true to receive Server-Sent Events for real-time response streaming.
Model Access Control
Confidential models (phala/*): Require
llm-confidentialorllm-allscopeNon-confidential models: Require
llm-otherorllm-allscope
Usage Tracking
All requests are automatically tracked for billing and monitoring purposes, including:
Token usage (prompt, completion, total)
Model used and confidential status
Request metadata and timing
Enter JWT access token
Request body for creating a chat completion
ID of the model to use
phala/gpt-oss-120bThe maximum number of tokens to generate
1000Sampling temperature between 0 and 2. Higher values make output more random
1Example: 0.7Nucleus sampling parameter. Alternative to temperature
1Example: 0.9Number of chat completion choices to generate
1Example: 1Whether to stream back partial message deltas
falseExample: falseUp to 4 sequences where the API will stop generating further tokens
["\n","."]Presence penalty between -2.0 and 2.0
0Example: 0Frequency penalty between -2.0 and 2.0
0Example: 0Unique identifier representing your end-user
user-123Random seed for deterministic outputs
42A list of tools the model may call
Controls which (if any) function is called by the model
autoChat completion response (non-streaming)
Bad Request - Invalid request parameters or model access denied
Unauthorized - Invalid or missing access token
Forbidden - Insufficient LLM scope permissions
Internal server error
Service Unavailable - RedPill API or internal service error
Last updated

