Feat: llamacpp /completion attempt

2025-01-22 20:17:49 +03:00
parent c41ff09b2f
commit 75f51c1a19
4 changed files with 176 additions and 86 deletions
--- a/README.md
+++ b/README.md
@@ -40,6 +40,7 @@
 - consider adding use /completion of llamacpp, since openai endpoint clearly has template|format issues;
 - change temp, min-p and other params from tui;
 - DRY;
+- keybind to switch between openai and llamacpp endpoints;

 ### FIX:
 - bot responding (or hanging) blocks everything; +
@@ -63,3 +64,4 @@
 - number of sentences in a batch should depend on number of words there. +
 - F1 can load any chat, by loading chat of other agent it does not switch agents, if that chat is continued, it will rewrite agent in db; (either allow only chats from current agent OR switch agent on chat loading); +
 - after chat is deleted: load undeleted chat; +
+- name split for llamacpp completion. user msg should end with 'bot_name:';