Go to file
Jeff Emmett 80b398643e Drastically reduce prompt size for CPU inference speed
- Cut context to 512 tokens, max output to 128
- Only 2 retrieval chunks of 150 chars each (no headers)
- Keep only last 2 conversation messages
- Minimized system prompt overhead

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 01:47:06 -07:00
app Drastically reduce prompt size for CPU inference speed 2026-02-17 01:47:06 -07:00
backlog Initialize backlog and record deployment setup 2026-02-16 18:51:00 -07:00
.env.example Initial commit: Erowid conversational bot 2026-02-17 01:19:49 +00:00
.gitignore Initial commit: Erowid conversational bot 2026-02-17 01:19:49 +00:00
Dockerfile Initial commit: Erowid conversational bot 2026-02-17 01:19:49 +00:00
docker-compose.yml Update Traefik host to erowid.psilo-cyber.net 2026-02-16 18:38:58 -07:00
requirements.txt Initial commit: Erowid conversational bot 2026-02-17 01:19:49 +00:00