17 Commits

Author SHA1 Message Date
Juan Calderon-Perez
9dc8f42793
Merge branch 'main' into gpu-support 2024-02-24 17:01:01 -05:00
Olivier DEBAUCHE
e1f966ace3
Fix ipv4/ipv6 modes (#1153)
* Update serge.env

Add ISERGE_ENABLE_IPV4

* Update deploy.sh

Now IPV4 is activate by deflault
We can activate IPV4+IPV6 or IPV6 only

* Update dev.sh

Now ipv4 is activated by default but we can also activate ipv4+ipv6 or ipv6 only

* Update dev.sh

fix port for ipv4

* Update serge.env

fix SERGE_ENABLE_IPV4 value

* Update deploy.sh

code formating

* Update dev.sh

code formating

* Update dev.sh

bugfix

* Update serge.env

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-02-24 11:41:51 -05:00
Olivier DEBAUCHE
b5b35fc11e
Models update (#1154)
* Update models.json

Add support for Gemma 2B and 7B

* Update models.json

Add support for LLama pro

* Update models.json

Add support for TinyLlama

* Update models.json

Update Medicine LLM

* Update README.md

* Update serge.env

Bump version of LLama cpp to support Gemma Model
2024-02-24 00:55:16 -05:00
Olivier DEBAUCHE
235d65ca12
Update llama-cpp-python (#1138)
* Update serge.env

* Update deploy.sh

Update path

* Update dev.sh

update path

* Update serge.env

* Update serge.env

Bump version of Llama cpp python to v0.2.44
2024-02-18 10:00:49 -05:00
Olivier DEBAUCHE
2b0cfb2050
Update llama-cpp-python (#1137)
* Update serge.env

Update Llama cpp python version

* Update deploy.sh

Update path

* Update dev.sh

Update  path

* Update serge.env

Bump version to v0.2.43

* Update serge.env

Bump version of Llama cpp python to v0.2.44
2024-02-18 10:00:04 -05:00
Juan Calderon-Perez
6ecf1797b8
Revert back to llama-cpp-python v0.2.39 2024-02-13 22:17:07 -05:00
Juan Calderon-Perez
61ae2eaf80
Update llama-cpp-python to v0.2.41 (#1133) 2024-02-13 22:02:01 -05:00
Juan Calderon-Perez
b65e7abd10
Merge branch 'main' into gpu-support 2024-02-12 22:55:49 -05:00
Olivier DEBAUCHE
8e35f238c3
Add GPU support (#1056)
* Update dev.sh

* Update deploy.sh

* Update serge.env

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-02-12 22:54:26 -05:00
Olivier DEBAUCHE
86a2c7f18d
Update Llama cpp python from 0.2.38 to 0.2.39 (#1119)
* Update serge.env

Bump llama python in 0.2.28

* Update serge.env

* Update deploy.sh

change lama-cpp-python provider

* Update dev.sh

* Update serge.env

Bump version from 0.2.26 to 0.2.38

* Update dev.sh

* Update serge.env

Bump version from 0.2.38 to 0.2.39

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-02-07 09:01:46 -05:00
Olivier DEBAUCHE
583d344338
Update llama-cpp-python to v0.2.38 (#1062)
* Update serge.env

Bump llama python in 0.2.28

* Update serge.env

* Update deploy.sh

change lama-cpp-python provider

* Update dev.sh

* Update serge.env

Bump version from 0.2.26 to 0.2.38

* Update dev.sh

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-02-04 20:00:58 -05:00
Olivier DEBAUCHE
f9d8ed2ff1
Add support for IPv6 (#1055)
* Update deploy.sh

add support ipv6

* Update dev.sh

add support for ipv6

* Update deploy.sh

add support for ipv6

* Update deploy.sh

add support for ipv6

* Update dev.sh

support  for ipv6

* Update dev.sh

support for ipv6 reworked
Thanks Gaby :)

* Update serge.env

add support for ipv6

* Update deploy.sh

support for ipv6 reworked
Thanks Gaby :)

* Update deploy.sh

bugfix

* Update serge.env

* Update serge.env

rename variable in SERGE_ENABLE_IPV6

* Update deploy.sh

rename variable in SERGE_ENABLE_IPV6

* Update dev.sh

rename variable in SERGE_ENABLE_IPV6

* Update deploy.sh

remove redudant code

* Update dev.sh

add missing code

* Update deploy.sh

tiny change

* Update dev.sh

bugfix

* Update deploy.sh

bugfix

* Update dev.sh

bugfix

* Update deploy.sh

change unicorn by hypercorn

* Update serge.env

delete unecessary param

* Update dev.sh

replace unicorn by hypercorn

* Update pyproject.toml

replace unicorn by hypercorn

* Update poetry.lock

replace unicorn by hypercorn

* Update poetry.lock

poetry updated

* Update pyproject.toml

update

* Update poetry.lock

hypercorn update

* Update deploy.sh

shmft applied

* Update dev.sh

shmft applied

* Update deploy.sh

shmft applied

* Update dev.sh

shmft applied

* Update dev.sh

bugfix

* Update serge.env

missing value

* Update deploy.sh

code corrected

* Update dev.sh

code corrected

* Update serge.env

 code corrected

* Update deploy.sh

rollback

* Update dev.sh

rollback

* Update serge.env

* Update deploy.sh

add SERGE_IPV6_SUPPORT

* Update dev.sh

Add SERGE_IPV6_SUPPORT

* Update dev.sh

* Update deploy.sh

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2024-01-18 08:31:22 -05:00
Juan Calderon-Perez
b0f19fbc57
Update llama-cpp-python to v0.2.26 (#1020) 2023-12-30 19:58:14 -05:00
Juan Calderon-Perez
586f556577
Update llama-cpp-python to v0.2.25 (#1015) 2023-12-26 22:41:08 -05:00
Juan Calderon-Perez
82db8ac930
Bump llama-cpp-python to v0.2.23 (#982) 2023-12-15 07:56:56 -05:00
Juan Calderon-Perez
b49e7ceb57
Bump llama-cpp-python to v0.2.20 2023-11-28 08:16:35 -05:00
Juan Calderon-Perez
2dfcde881a
Add support for using wheels when installing llama-cpp-python (#904)
* Initial changes to support wheels

* Format shell files

* Remove curl, move location of .ENV file

* Fix path to shfmt

* Add OPT for ShellCheck

* Fix for SC1091

* Disable SC1091

* Fix delete prompt call when prompt in progress

* Add null check

* Revert changes to Dockerfile

* Fix syntax issue

* Remove duplicated command
2023-11-26 18:34:28 -05:00