Constrained decoding is ~10x slower and consumes hidden tokens for constraint processing, causing truncation at ~1000 chars even with 8192 max_output_tokens. The system prompt already instructs the model to output raw minified JSON; our NaN/markdown-fence sanitisation handles edge cases. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| alembic | ||
| innercontext | ||
| tests | ||
| .env.example | ||
| .python-version | ||
| alembic.ini | ||
| db.py | ||
| main.py | ||
| pyproject.toml | ||
| README.md | ||
| skincare.yaml | ||
| uv.lock | ||
See the root README for setup and usage instructions.