Gemini 2.5 Flash (gemini-flash-latest) enables thinking by default. Thinking tokens count toward max_output_tokens, leaving ~150 tokens for actual JSON output and causing MAX_TOKENS truncation. Disable thinking centrally in call_gemini via ThinkingConfig(thinking_budget=0). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| alembic | ||
| innercontext | ||
| tests | ||
| .env.example | ||
| .python-version | ||
| alembic.ini | ||
| db.py | ||
| main.py | ||
| pyproject.toml | ||
| README.md | ||
| skincare.yaml | ||
| uv.lock | ||
See the root README for setup and usage instructions.