innercontext/backend
Piotr Oleszczyk 3fbf6d7041 fix(backend): drop response_mime_type=application/json to avoid constrained decoding
Constrained decoding is ~10x slower and consumes hidden tokens for constraint
processing, causing truncation at ~1000 chars even with 8192 max_output_tokens.
The system prompt already instructs the model to output raw minified JSON; our
NaN/markdown-fence sanitisation handles edge cases.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-28 22:03:49 +01:00
..
alembic feat(backend): add Alembic migrations 2026-02-28 20:14:57 +01:00
innercontext fix(backend): drop response_mime_type=application/json to avoid constrained decoding 2026-02-28 22:03:49 +01:00
tests feat(mcp): add FastMCP server with 14 tools for LLM agent access 2026-02-28 17:59:11 +01:00
.env.example fix: load .env via python-dotenv; SQLite default for local dev 2026-02-26 20:51:13 +01:00
.python-version Initial commit: backend API, data models, and test suite 2026-02-26 15:10:24 +01:00
alembic.ini feat(backend): add Alembic migrations 2026-02-28 20:14:57 +01:00
db.py Initial commit: backend API, data models, and test suite 2026-02-26 15:10:24 +01:00
main.py feat(mcp): add FastMCP server with 14 tools for LLM agent access 2026-02-28 17:59:11 +01:00
pyproject.toml feat(backend): add Alembic migrations 2026-02-28 20:14:57 +01:00
README.md Initial commit: backend API, data models, and test suite 2026-02-26 15:10:24 +01:00
skincare.yaml Initial commit: backend API, data models, and test suite 2026-02-26 15:10:24 +01:00
uv.lock feat(backend): add Alembic migrations 2026-02-28 20:14:57 +01:00

See the root README for setup and usage instructions.