gemini-flash-latest resolves to gemini-3-flash-preview which uses thinking_level instead of the legacy thinking_budget (mixing both returns HTTP 400). Use LOW to reduce thinking overhead while keeping basic reasoning, replacing the now-incompatible thinking_budget=0. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| api | ||
| models | ||
| llm.py | ||
| mcp_server.py | ||