24 Commits

Author SHA1 Message Date
Jon Pro
80833600eb
Add support for User Management (#1313)
* Add users - checkpoint

* Working

* Finish workflow

* switch to cookies instead of bearer token

* Fix settings not saved

* Add sqlite

* Cleanup

* Get db url from env var

* PR Prep

* Maybe later

* Code review

* Update poetry and readme

* Things for status checks

* Add chats table to DB

* Bad merge

* Personal code review

* Add default user called system

* Fix poetry

* fix status checks

* Formatting

* Fix deleting a chat

* Fix bug when auth times out but user isn't logged out

* Code review comments

* fix lockfile

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-07-27 14:48:53 -04:00
Olivier DEBAUCHE
1cc58cd4b1
Bump llama-cpp-python to v0.2.69 (#1266)
* Update serge.env

* Update dev.sh

* Update deploy.sh

* Update dev.sh

* Update dev.sh

* Update deploy.sh

* Bump LLaMA CPP Python to 0.2.68

* Update dev.sh

* Update deploy.sh

* Update deploy.sh

* Update dev.sh

* Update dev.sh

* Update deploy.sh

* Update deploy.sh

* Update dev.sh

* Update deploy.sh

* Update dev.sh

* Update serge.env

* Update serge.env

* Update scripts/deploy.sh

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

* Update scripts/dev.sh

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

* Update dev.sh

* Update deploy.sh

* Only use official wheels for now

* Fix lint issues

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-05-06 09:37:25 -04:00
Olivier DEBAUCHE
e1f966ace3
Fix ipv4/ipv6 modes (#1153)
* Update serge.env

Add ISERGE_ENABLE_IPV4

* Update deploy.sh

Now IPV4 is activate by deflault
We can activate IPV4+IPV6 or IPV6 only

* Update dev.sh

Now ipv4 is activated by default but we can also activate ipv4+ipv6 or ipv6 only

* Update dev.sh

fix port for ipv4

* Update serge.env

fix SERGE_ENABLE_IPV4 value

* Update deploy.sh

code formating

* Update dev.sh

code formating

* Update dev.sh

bugfix

* Update serge.env

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-02-24 11:41:51 -05:00
Olivier DEBAUCHE
2b0cfb2050
Update llama-cpp-python (#1137)
* Update serge.env

Update Llama cpp python version

* Update deploy.sh

Update path

* Update dev.sh

Update  path

* Update serge.env

Bump version to v0.2.43

* Update serge.env

Bump version of Llama cpp python to v0.2.44
2024-02-18 10:00:04 -05:00
Juan Calderon-Perez
e24c8feb98
Updates to UI components, cleanup of unused dependencies (#1136)
Fixes to UI components, accessibility labels. Removal of unused python dependencies. Migrate from black to ruff for formatting.
---------

Co-authored-by: morpheus2448 <41052975+morpheus2448@users.noreply.github.com>
2024-02-13 23:51:54 -05:00
Juan Calderon-Perez
d91dd4f689 Fix duplicated ports 2024-02-13 22:31:05 -05:00
Olivier DEBAUCHE
583d344338
Update llama-cpp-python to v0.2.38 (#1062)
* Update serge.env

Bump llama python in 0.2.28

* Update serge.env

* Update deploy.sh

change lama-cpp-python provider

* Update dev.sh

* Update serge.env

Bump version from 0.2.26 to 0.2.38

* Update dev.sh

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-02-04 20:00:58 -05:00
Olivier DEBAUCHE
f9d8ed2ff1
Add support for IPv6 (#1055)
* Update deploy.sh

add support ipv6

* Update dev.sh

add support for ipv6

* Update deploy.sh

add support for ipv6

* Update deploy.sh

add support for ipv6

* Update dev.sh

support  for ipv6

* Update dev.sh

support for ipv6 reworked
Thanks Gaby :)

* Update serge.env

add support for ipv6

* Update deploy.sh

support for ipv6 reworked
Thanks Gaby :)

* Update deploy.sh

bugfix

* Update serge.env

* Update serge.env

rename variable in SERGE_ENABLE_IPV6

* Update deploy.sh

rename variable in SERGE_ENABLE_IPV6

* Update dev.sh

rename variable in SERGE_ENABLE_IPV6

* Update deploy.sh

remove redudant code

* Update dev.sh

add missing code

* Update deploy.sh

tiny change

* Update dev.sh

bugfix

* Update deploy.sh

bugfix

* Update dev.sh

bugfix

* Update deploy.sh

change unicorn by hypercorn

* Update serge.env

delete unecessary param

* Update dev.sh

replace unicorn by hypercorn

* Update pyproject.toml

replace unicorn by hypercorn

* Update poetry.lock

replace unicorn by hypercorn

* Update poetry.lock

poetry updated

* Update pyproject.toml

update

* Update poetry.lock

hypercorn update

* Update deploy.sh

shmft applied

* Update dev.sh

shmft applied

* Update deploy.sh

shmft applied

* Update dev.sh

shmft applied

* Update dev.sh

bugfix

* Update serge.env

missing value

* Update deploy.sh

code corrected

* Update dev.sh

code corrected

* Update serge.env

 code corrected

* Update deploy.sh

rollback

* Update dev.sh

rollback

* Update serge.env

* Update deploy.sh

add SERGE_IPV6_SUPPORT

* Update dev.sh

Add SERGE_IPV6_SUPPORT

* Update dev.sh

* Update deploy.sh

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-01-18 08:31:22 -05:00
Juan Calderon-Perez
1684466304
Fix installation of aarch64 wheels (#938)
* Add support for installing aarch64 wheels

* Fix syntax
2023-12-03 22:13:44 -05:00
Juan Calderon-Perez
78cff9c47f
Updates to CI process for Python dependencies (#912) 2023-11-27 22:08:30 -05:00
Juan Calderon-Perez
9e9bc528d9
Fix for Apple Metal installation (#905)
* Update deploy.sh

* Update dev.sh
2023-11-26 22:03:50 -05:00
Juan Calderon-Perez
2dfcde881a
Add support for using wheels when installing llama-cpp-python (#904)
* Initial changes to support wheels

* Format shell files

* Remove curl, move location of .ENV file

* Fix path to shfmt

* Add OPT for ShellCheck

* Fix for SC1091

* Disable SC1091

* Fix delete prompt call when prompt in progress

* Add null check

* Revert changes to Dockerfile

* Fix syntax issue

* Remove duplicated command
2023-11-26 18:34:28 -05:00
Juan Calderon-Perez
947efe5d4b
Bump llama-cpp-python to v0.2.19 (#894) 2023-11-21 19:25:46 -05:00
Juan Calderon-Perez
c7f0b487aa
Add support for GGUF models (#866) 2023-11-17 09:07:02 -05:00
Mariusz Kogen
7f6321ae82
Fix llama-cpp-python build for Apple Silicon (#763)
* Fix llama-cpp-python build for Apple Silicon

* Make ShellCheck happy

* Make gaby happy

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-09-20 08:35:50 -04:00
Juan Calderon-Perez
0500cb2266
Remove support for DragonflyDB (#684) 2023-09-03 23:37:23 -04:00
Juan Calderon-Perez
53793ca580
Update llama-cpp-python to v0.1.78 (#653)
* Update dev.sh

* Update deploy.sh
2023-08-24 23:31:01 -04:00
Juan Calderon-Perez
12ec7b7f42
Support for DragonflyDB (#598) 2023-08-06 22:54:42 -04:00
Juan Calderon-Perez
20c3dac583
Update llama-cpp-python to v0.1.77 2023-07-29 23:56:49 -04:00
Juan Calderon-Perez
6445c21af0
Update llama-cpp-python to v0.1.70 (#518) 2023-07-09 18:57:34 -04:00
Juan Calderon-Perez
696c2d288c
Fixes to startup scripts and Dockerfiles (#517) 2023-07-09 18:28:33 -04:00
Juan Calderon-Perez
4970865a49
Add support for validating shell scripts (#416) 2023-06-11 20:39:11 -04:00
Nathan Sarrazin
e512011470
Use python bindings, integrate with LangChain and get rid of MongoDB (#148)
* integrate langchain
get rid of mongodb
use llama-cpp-python bindings

* fixed most chat endpoints except posting questions

* Working post endpoint !

* everything works except streaming

* current state

* streaming as is

* got rid of langchain wrapper for calling llm, went back to using bindings directly

* working streaming

* sort chats by time

* cleaned up styling and added back loading indicator

* Add persistence support to redis

* fixed tooltips

* fixed default prompts

* added link to api docs (closes How to use the api #155 )
2023-04-23 23:42:20 +02:00
Nathan Sarrazin
b5c423fe59
API Refactor & Model Manager (#101)
* API refactoring

* delete partially downloaded files on startup

* remove unused deps
2023-03-28 23:56:41 +02:00