Model configuration and extended thinking options
The includes the side panel for advanced model settings.
The panel includes these options:
- Max response length: Specify the maximum number of tokens that the model can generate in a response.
- Thinking switch: Enable or disable extended reasoning. The switch is displayed when the selected model supports reasoning, as indicated by the reasoning flag in the
GET/modelsresponse. - Temperature percent: Control the randomness of the model output.
- Top P percent: Control output diversity through nucleus sampling. The field is not available for Claude 4.5 models and later.
After the response is generated, these response metrics are available:
- Response time
- Total tokens