* Feature: add new k-quants q6_K models
* Feature: update llama-cpp-python==0.1.62
* Fix: labels and 7b not use k-quants
* Fix: labels and 7b not use k-quants
* Fix: labels and 7b old one and q6_K
* Fix: labels and 7b old sizes
* Fix: labels and 7b Koala names
---------
Co-authored-by: pabl-o-ce <cye@poscye.com>
* Pin llama-cpp-python to 0.1.54 for GGMLv3 support
* Update to GGMLv3 models
* Reflect current GGMLv3 models
* More readable model names
* Fix file sizes
---------
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
* Style: Web UI/UX improvements
* Docs: add credits for the svg
* Style: fix in header nav tittle always center
* Style: fix chat padding md:responsive
* Style: fix system chat padding md:responsive
* Style: text-lg to text-base... [for me look better]. (suggestion only).. if you dont like it I remove it
* Style: [suggestion] header title format one line 'Serge: <modelName>' second line '<date>'
* Style: fix color using daisyui semantic color
* Style: fix color user avatar
* Style: data-theme=dark
* Style: colors by compatible with all themes
* Style: colors by compatible with all themes [second and last review]
* Style: svg and colors on download models [replace icons to svg icons]
* Style: input h-auto
* Feature: add render markdown on user and serge
* Style: code tag color and background for any theme
* Style: copy to clipboard button background and overflow-y hidden in chat render parent element
* Style: code block background adaptive to daisy themes
* Style: remove images and add rubik-pixels font
* Style: add rubik-pixels font from fontsource
---------
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
* Use slim variant for Docker, CI fixes
* Config must be added after installation
* Use Python3.9, updates to CI
* Change min required version of Python
* integrate langchain
get rid of mongodb
use llama-cpp-python bindings
* fixed most chat endpoints except posting questions
* Working post endpoint !
* everything works except streaming
* current state
* streaming as is
* got rid of langchain wrapper for calling llm, went back to using bindings directly
* working streaming
* sort chats by time
* cleaned up styling and added back loading indicator
* Add persistence support to redis
* fixed tooltips
* fixed default prompts
* added link to api docs (closes How to use the api #155 )
* begin work on dev environment
* more work on dev image
* working dev + prod images with SPA front-end
* reworked dockerfile
* make CI point to the right action
* Improvements to github actions (#79)
* Improvements to github actions
* Change username to repo owner username
* Add fix for login into ghcr (#81)
* Update bug_report.yml
* added dev instructions to readme
* reduced number of steps in dockerfile
---------
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
* Initial Kubernetes support
* Adding manifest.yaml
* removed custom config & added placeholder variables
* Adding K8S deployment steps on README.md
* adding --host parameter to uvicorn deploy.sh
* changed web/vite.config.ts placeholder variable to service name
* migrated manifest.yaml to the wiki and edited README.md
* initial work on linting & templates
* moved everyone into a nice dockerfile
* move everyone into a single dockerfile
* update sample .env file
* got rid of .env file
* rename db volume to avoid confusion and conflicts with previous version
* added bug report template