You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: agent-schema.json
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -634,7 +634,7 @@
634
634
},
635
635
"provider_opts": {
636
636
"type": "object",
637
-
"description": "Provider-specific options. Sampling parameters: top_k (integer, supported by anthropic, google, amazon-bedrock, and custom OpenAI-compatible providers like vLLM/Ollama), repetition_penalty (float, forwarded to custom OpenAI-compatible providers), min_p (float, forwarded to custom providers), seed (integer, forwarded to OpenAI). Infrastructure options: dmr: runtime_flags. anthropic/amazon-bedrock (Claude): interleaved_thinking (boolean, default true). openai: transport ('sse' or 'websocket') to choose between SSE and WebSocket streaming for the Responses API. openai/anthropic/google: rerank_prompt (string) to fully override the system prompt used for RAG reranking (advanced - prefer using results.reranking.criteria for domain-specific guidance). Google: google_search (boolean) enables Google Search grounding, google_maps (boolean) enables Google Maps grounding, code_execution (boolean) enables server-side code execution.",
637
+
"description": "Provider-specific options. Sampling parameters: top_k (integer, supported by anthropic, google, amazon-bedrock, and custom OpenAI-compatible providers like vLLM/Ollama), repetition_penalty (float, forwarded to custom OpenAI-compatible providers), min_p (float, forwarded to custom providers), seed (integer, forwarded to OpenAI). Infrastructure options: dmr: runtime_flags. anthropic/amazon-bedrock (Claude): interleaved_thinking (boolean, default true), thinking_display ('summarized', 'omitted', or 'display') controls whether thinking blocks are returned in responses when thinking is enabled. Claude Opus 4.7 hides thinking by default ('omitted'); set thinking_display: summarized (or thinking_display: display) to receive thinking blocks. openai: transport ('sse' or 'websocket') to choose between SSE and WebSocket streaming for the Responses API. openai/anthropic/google: rerank_prompt (string) to fully override the system prompt used for RAG reranking (advanced - prefer using results.reranking.criteria for domain-specific guidance). Google: google_search (boolean) enables Google Search grounding, google_maps (boolean) enables Google Maps grounding, code_execution (boolean) enables server-side code execution.",
Copy file name to clipboardExpand all lines: docs/configuration/models/index.md
+16Lines changed: 16 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -178,6 +178,22 @@ models:
178
178
interleaved_thinking: false # disable if needed
179
179
```
180
180
181
+
## Thinking Display (Anthropic)
182
+
183
+
For Anthropic Claude models, `thinking_display` controls whether thinking blocks are returned in responses when thinking is enabled. Claude Opus 4.7 hides thinking content by default (`omitted`); set this provider option to receive summarized thinking:
184
+
185
+
```yaml
186
+
models:
187
+
opus-4-7:
188
+
provider: anthropic
189
+
model: claude-opus-4-7
190
+
thinking_budget: adaptive
191
+
provider_opts:
192
+
thinking_display: summarized # "summarized", "display", or "omitted"
193
+
```
194
+
195
+
See the [Anthropic provider page](/providers/anthropic/#thinking-display) for details.
Copy file name to clipboardExpand all lines: docs/providers/anthropic/index.md
+22Lines changed: 22 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -106,6 +106,28 @@ Object form (forward-compatible with future budget types):
106
106
107
107
See the full schema on the [Model Configuration]({{ '/configuration/models/#task-budget' | relative_url }}) page.
108
108
109
+
## Thinking Display
110
+
111
+
Controls whether thinking blocks are returned in responses when thinking is enabled. Claude Opus 4.7 hides thinking content by default (`omitted`); earlier Claude 4 models default to `summarized`. Set `thinking_display` in `provider_opts` to override:
112
+
113
+
```yaml
114
+
models:
115
+
claude-opus-4-7:
116
+
provider: anthropic
117
+
model: claude-opus-4-7
118
+
thinking_budget: adaptive
119
+
provider_opts:
120
+
thinking_display: summarized # "summarized", "display", or "omitted"
121
+
```
122
+
123
+
Valid values:
124
+
125
+
- `summarized`: thinking blocks are returned with summarized thinking text (default for Claude 4 models prior to Opus 4.7).
126
+
- `display`: thinking blocks are returned for display (use this to re-enable thinking output on Opus 4.7).
127
+
- `omitted`: thinking blocks are returned with an empty thinking field; the signature is still returned for multi-turn continuity (default for Opus 4.7). Useful to reduce time-to-first-text-token when streaming.
128
+
129
+
Note: `thinking_display`applies to both `thinking_budget` with token counts and adaptive/effort-based budgets. Full thinking tokens are billed regardless of the `thinking_display` value.
0 commit comments