69 Commits

Author SHA1 Message Date
Juan Calderon-Perez
b4190e5616
Add support for latest debian release (#755) 2023-09-19 22:55:19 -04:00
Juan Calderon-Perez
0500cb2266
Remove support for DragonflyDB (#684) 2023-09-03 23:37:23 -04:00
Juan Calderon-Perez
6c88efe5ed
Add support for WizardCoder-Python-13B-V1.0-GGML (#681) 2023-09-02 21:34:36 -04:00
PΔBLØ ᄃΞ
1444accc12
add: WizardCoder-Python-34B-V1.0 model (#659)
* add: WizardCoder-Python-34B-V1.0 model

* Fix json format

---------

Co-authored-by: pabl-o-ce <cye@poscye.com>
2023-08-27 21:31:39 -04:00
Mike Zeng
6f4a396880
Adds support for LLaMA2 in the models.json (#612)
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-08-10 21:03:44 -04:00
Juan Calderon-Perez
12ec7b7f42
Support for DragonflyDB (#598) 2023-08-06 22:54:42 -04:00
PΔBLØ ᄃΞ
8f997f6395
Add orca and remove models don't use alpaca template (#541)
* Add orca and remove models don't use alpaca template

* Add orca and remove models don't use alpaca template

---------

Co-authored-by: pabl-o-ce <cye@poscye.com>
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-07-15 17:32:59 -04:00
Juan Calderon-Perez
da8c3e27d3
Refactor production Dockerfile, Add development Dockerfile (#485)
* Refactor Dockerfile, add Dockerfile.dev for development

* We need dev dependencies for running vite build
2023-07-01 22:47:29 -04:00
PΔBLØ ᄃΞ
c6c02271e0
Support for BigTrans, Minotaur, Robin, and Vicuna v1.3 models (#451)
* Feature: add Vicuna-v1.3-7B and Vicuna-v1.3-13B

* Feature: add BigTrans-13B

* Feature: add robin,minotour,chronos-hermes,vicuna,trans

* Feature: add robin,minotour,chronos-hermes,vicuna,trans

* Docs: update model max ram required

* Docs: update model max ram required

* Feature: add more quants q2_k,q3_K_L,q4_1,q4_K_M,q8_0 on schema.json

* Fix: missing emojis and typo vicuna models q3_K_L

* Fix: minotaur

* Fix: minotaur

* Fix: schame.json enum k-quants

* Fix: schame.json enum k-quants

---------

Co-authored-by: pabl-o-ce <cye@poscye.com>
2023-06-19 17:16:41 -04:00
PΔBLØ ᄃΞ
6009f10fcb
Feature: Add Airoboros-1.2 models (#435)
Co-authored-by: pabl-o-ce <cye@poscye.com>
2023-06-16 16:58:11 -04:00
PΔBLØ ᄃΞ
d1586d129a
Feature: add Chronos-Hermes-13B (#431)
Co-authored-by: pabl-o-ce <cye@poscye.com>
2023-06-15 19:23:40 -04:00
PΔBLØ ᄃΞ
6bf84ddf8e
Add new ranked GGMLv3 models (#417)
* Feature: add some best ranked models

* Fix: models size

* Update README.md

---------

Co-authored-by: pabl-o-ce <cye@poscye.com>
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-06-12 01:31:17 -04:00
Juan Calderon-Perez
16699fac48
Update README.md 2023-06-11 16:40:33 -04:00
Juan Calderon-Perez
b1fb7009e7
Updates to README.md (#414) 2023-06-11 16:36:28 -04:00
Juan Calderon-Perez
dfb98c6885
Adjust headings for model names 2023-06-11 00:43:03 -04:00
Juan Calderon-Perez
70c51a457c
Structure models list in README.md (#413) 2023-06-11 00:40:34 -04:00
PΔBLØ ᄃΞ
634fdacc08
Feature: add new k-quants q6_K models (#412)
* Feature: add new k-quants q6_K models

* Feature: update llama-cpp-python==0.1.62

* Fix: labels and 7b not use k-quants

* Fix: labels and 7b not use k-quants

* Fix: labels and 7b old one and q6_K

* Fix: labels and 7b old sizes

* Fix: labels and 7b Koala names

---------

Co-authored-by: pabl-o-ce <cye@poscye.com>
2023-06-10 23:28:05 -04:00
Nathan Sarrazin
0b3ad75016 it's main not latest I guess ? 2023-06-10 10:25:52 +02:00
Nathan Sarrazin
ec385e807a update repo to reflect change in org 2023-06-10 09:39:53 +02:00
Juan Calderon-Perez
b65457227d
Updates to README.md (#402) 2023-06-09 22:37:42 -04:00
PΔBLØ ᄃΞ
91b20bbb1c
Support for latest GGMLv3 models (#398)
* Feature: add new models (vicuna, wizard-vicuna, wizardlm, samantha, lazarus, nous-hermes, koala)

* Docs: add new models (vicuna, wizard-vicuna, wizardlm, samantha, lazarus, nous-hermes, koala) in README

---------

Co-authored-by: pabl-o-ce <cye@poscye.com>
Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-06-08 18:32:46 -04:00
Nathan Nye
51fae79aa2
GGMLv3 support (#334)
* Pin llama-cpp-python to 0.1.54 for GGMLv3 support

* Update to GGMLv3 models

* Reflect current GGMLv3 models

* More readable model names

* Fix file sizes

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-05-26 00:00:08 -04:00
PΔBLØ ᄃΞ
57bc893967
Web UI/UX improvements (#212)
* Style: Web UI/UX improvements

* Docs: add credits for the svg

* Style: fix in header nav tittle always center

* Style: fix chat padding md:responsive

* Style: fix system chat padding md:responsive

* Style: text-lg to text-base... [for me look better]. (suggestion only).. if you dont like it I remove it

* Style: [suggestion] header title format one line 'Serge: <modelName>' second line '<date>'

* Style: fix color using daisyui semantic color

* Style: fix color user avatar

* Style: data-theme=dark

* Style: colors by compatible with all themes

* Style: colors by compatible with all themes [second and last review]

* Style: svg and colors on download models [replace icons to svg icons]

* Style: input h-auto

* Feature: add render markdown on user and serge

* Style: code tag color and background for any theme

* Style: copy to clipboard button background and overflow-y hidden in chat render parent element

* Style: code block background adaptive to daisy themes

* Style: remove images and add rubik-pixels font

* Style: add rubik-pixels font from fontsource

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-05-25 01:31:57 -04:00
Nathan Sarrazin
3a594ddab0
Update README.md (#227) 2023-04-26 08:14:38 +02:00
Juan Calderon-Perez
330c5a8818
Use Debian Slim for base images, Fixes to CI actions (#221)
* Use slim variant for Docker, CI fixes

* Config must be added after installation

* Use Python3.9, updates to CI

* Change min required version of Python
2023-04-25 09:21:19 +02:00
Nathan Sarrazin
e512011470
Use python bindings, integrate with LangChain and get rid of MongoDB (#148)
* integrate langchain
get rid of mongodb
use llama-cpp-python bindings

* fixed most chat endpoints except posting questions

* Working post endpoint !

* everything works except streaming

* current state

* streaming as is

* got rid of langchain wrapper for calling llm, went back to using bindings directly

* working streaming

* sort chats by time

* cleaned up styling and added back loading indicator

* Add persistence support to redis

* fixed tooltips

* fixed default prompts

* added link to api docs (closes How to use the api #155 )
2023-04-23 23:42:20 +02:00
Juan Calderon-Perez
c48dc72c84
Fix spelling mistakes (#160) 2023-04-16 20:49:14 -04:00
Juan Calderon-Perez
3d713bfbb0
Link to llama.cpp git repo. (#126) 2023-04-03 06:10:50 +02:00
Juan Calderon-Perez
424f670e0f
Update llama.cpp to master-e986f94 (#124)
* Update Dockerfile

* Add support for GPT4All

* Skip running Github Actions on .md changed

---------

Co-authored-by: Nathan Sarrazin <sarrazin.nathan@gmail.com>
2023-04-02 20:38:28 +02:00
PΔBLØ ᄃΞ
3ed6f8e20f
docs: correct the name 7B-native on Readme (#120) 2023-03-31 19:50:17 +02:00
Nathan Sarrazin
f54b1ce44d
Update README.md 2023-03-31 08:51:54 +02:00
Nathan Sarrazin
07ce53730c
Update README.md 2023-03-30 00:05:11 +02:00
Nathan Sarrazin
3930a9371a
Update README.md 2023-03-29 07:50:11 +02:00
Nathan Sarrazin
f786c498ac
Update README.md 2023-03-29 06:40:56 +02:00
Nathan Sarrazin
b5c423fe59
API Refactor & Model Manager (#101)
* API refactoring

* delete partially downloaded files on startup

* remove unused deps
2023-03-28 23:56:41 +02:00
Paraskevas Leivadaros
bad45112c2
Update README.md (#94)
Added a note on storage for each model
2023-03-28 17:54:54 +02:00
Nathan Sarrazin
293a23e487
Proper dev & deployment images. Converts front-end to SPA. (#84)
* begin work on dev environment

* more work on dev image

* working dev + prod images with SPA front-end

* reworked dockerfile

* make CI point to the right action

* Improvements to github actions (#79)

* Improvements to github actions

* Change username to repo owner username

* Add fix for login into ghcr (#81)

* Update bug_report.yml

* added dev instructions to readme

* reduced number of steps in dockerfile

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-03-27 16:44:20 +02:00
Fenark
cccc8d2746
Initial support for Kubernetes (#71)
* Initial Kubernetes support

* Adding manifest.yaml

* removed custom config & added placeholder variables

* Adding K8S deployment steps on README.md

* adding --host parameter to uvicorn deploy.sh

* changed web/vite.config.ts placeholder variable to service name

* migrated manifest.yaml to the wiki and edited README.md
2023-03-26 12:59:39 +02:00
Nathan Sarrazin
7e0149b6f9
Update README.md 2023-03-25 16:25:57 +01:00
Nathan Sarrazin
28007b249a
Update README.md 2023-03-25 16:25:28 +01:00
Louis
7af808bf5c
Fix README.md (Getting started commands) (#59)
* Fix README.md (Getting started commands)

* Revert docker compose v2 syntax (no dash)
2023-03-25 13:19:58 +01:00
Nathan Sarrazin
52327af045
Update README.md 2023-03-25 11:55:28 +01:00
Nathan Sarrazin
d697059122
Move everything into a single dockerfile (#57)
* initial work on linting & templates

* moved everyone into a nice dockerfile

* move everyone into a single dockerfile

* update sample .env file

* got rid of .env file

* rename db volume to avoid confusion and conflicts with previous version

* added bug report template
2023-03-25 11:55:01 +01:00
Nathan Sarrazin
546c4114ed
Update README.md 2023-03-25 07:29:37 +01:00
Nathan Sarrazin
47b63b420e
Update README.md 2023-03-24 18:16:03 +01:00
Nathan Sarrazin
7d3a600345
Update README.md 2023-03-23 14:12:35 +01:00
Nathan Sarrazin
9cc88cb621
Fix git link on README 2023-03-23 05:24:44 +01:00
Nathan Sarrazin
c336aa3d0b clearer readme 2023-03-22 18:22:19 +01:00
Nathan Sarrazin
a1cc66a1ca Make it easier to get started 2023-03-22 18:15:51 +01:00
Nathan Sarrazin
b4c329286c
Update README.md 2023-03-22 12:54:07 +01:00