serge

Author	SHA1	Message	Date
Juan Calderon-Perez	b4190e5616	Add support for latest debian release (#755 )	2023-09-19 22:55:19 -04:00
Juan Calderon-Perez	0500cb2266	Remove support for DragonflyDB (#684 )	2023-09-03 23:37:23 -04:00
Juan Calderon-Perez	6c88efe5ed	Add support for WizardCoder-Python-13B-V1.0-GGML (#681 )	2023-09-02 21:34:36 -04:00
PΔBLØ ᄃΞ	1444accc12	add: WizardCoder-Python-34B-V1.0 model (#659 ) * add: WizardCoder-Python-34B-V1.0 model * Fix json format --------- Co-authored-by: pabl-o-ce <cye@poscye.com>	2023-08-27 21:31:39 -04:00
Mike Zeng	6f4a396880	Adds support for LLaMA2 in the models.json (#612 ) Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-08-10 21:03:44 -04:00
Juan Calderon-Perez	12ec7b7f42	Support for DragonflyDB (#598 )	2023-08-06 22:54:42 -04:00
PΔBLØ ᄃΞ	8f997f6395	Add orca and remove models don't use alpaca template (#541 ) * Add orca and remove models don't use alpaca template * Add orca and remove models don't use alpaca template --------- Co-authored-by: pabl-o-ce <cye@poscye.com> Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-07-15 17:32:59 -04:00
Juan Calderon-Perez	da8c3e27d3	Refactor production Dockerfile, Add development Dockerfile (#485 ) * Refactor Dockerfile, add Dockerfile.dev for development * We need dev dependencies for running vite build	2023-07-01 22:47:29 -04:00
PΔBLØ ᄃΞ	c6c02271e0	Support for BigTrans, Minotaur, Robin, and Vicuna v1.3 models (#451 ) * Feature: add Vicuna-v1.3-7B and Vicuna-v1.3-13B * Feature: add BigTrans-13B * Feature: add robin,minotour,chronos-hermes,vicuna,trans * Feature: add robin,minotour,chronos-hermes,vicuna,trans * Docs: update model max ram required * Docs: update model max ram required * Feature: add more quants q2_k,q3_K_L,q4_1,q4_K_M,q8_0 on schema.json * Fix: missing emojis and typo vicuna models q3_K_L * Fix: minotaur * Fix: minotaur * Fix: schame.json enum k-quants * Fix: schame.json enum k-quants --------- Co-authored-by: pabl-o-ce <cye@poscye.com>	2023-06-19 17:16:41 -04:00
PΔBLØ ᄃΞ	6009f10fcb	Feature: Add Airoboros-1.2 models (#435 ) Co-authored-by: pabl-o-ce <cye@poscye.com>	2023-06-16 16:58:11 -04:00
PΔBLØ ᄃΞ	d1586d129a	Feature: add Chronos-Hermes-13B (#431 ) Co-authored-by: pabl-o-ce <cye@poscye.com>	2023-06-15 19:23:40 -04:00
PΔBLØ ᄃΞ	6bf84ddf8e	Add new ranked GGMLv3 models (#417 ) * Feature: add some best ranked models * Fix: models size * Update README.md --------- Co-authored-by: pabl-o-ce <cye@poscye.com> Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-06-12 01:31:17 -04:00
Juan Calderon-Perez	16699fac48	Update README.md	2023-06-11 16:40:33 -04:00
Juan Calderon-Perez	b1fb7009e7	Updates to README.md (#414 )	2023-06-11 16:36:28 -04:00
Juan Calderon-Perez	dfb98c6885	Adjust headings for model names	2023-06-11 00:43:03 -04:00
Juan Calderon-Perez	70c51a457c	Structure models list in README.md (#413 )	2023-06-11 00:40:34 -04:00
PΔBLØ ᄃΞ	634fdacc08	Feature: add new k-quants q6_K models (#412 ) * Feature: add new k-quants q6_K models * Feature: update llama-cpp-python==0.1.62 * Fix: labels and 7b not use k-quants * Fix: labels and 7b not use k-quants * Fix: labels and 7b old one and q6_K * Fix: labels and 7b old sizes * Fix: labels and 7b Koala names --------- Co-authored-by: pabl-o-ce <cye@poscye.com>	2023-06-10 23:28:05 -04:00
Nathan Sarrazin	0b3ad75016	it's main not latest I guess ?	2023-06-10 10:25:52 +02:00
Nathan Sarrazin	ec385e807a	update repo to reflect change in org	2023-06-10 09:39:53 +02:00
Juan Calderon-Perez	b65457227d	Updates to README.md (#402 )	2023-06-09 22:37:42 -04:00
PΔBLØ ᄃΞ	91b20bbb1c	Support for latest GGMLv3 models (#398 ) * Feature: add new models (vicuna, wizard-vicuna, wizardlm, samantha, lazarus, nous-hermes, koala) * Docs: add new models (vicuna, wizard-vicuna, wizardlm, samantha, lazarus, nous-hermes, koala) in README --------- Co-authored-by: pabl-o-ce <cye@poscye.com> Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-06-08 18:32:46 -04:00
Nathan Nye	51fae79aa2	GGMLv3 support (#334 ) * Pin llama-cpp-python to 0.1.54 for GGMLv3 support * Update to GGMLv3 models * Reflect current GGMLv3 models * More readable model names * Fix file sizes --------- Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-05-26 00:00:08 -04:00
PΔBLØ ᄃΞ	57bc893967	Web UI/UX improvements (#212 ) * Style: Web UI/UX improvements * Docs: add credits for the svg * Style: fix in header nav tittle always center * Style: fix chat padding md:responsive * Style: fix system chat padding md:responsive * Style: text-lg to text-base... [for me look better]. (suggestion only).. if you dont like it I remove it * Style: [suggestion] header title format one line 'Serge: <modelName>' second line '<date>' * Style: fix color using daisyui semantic color * Style: fix color user avatar * Style: data-theme=dark * Style: colors by compatible with all themes * Style: colors by compatible with all themes [second and last review] * Style: svg and colors on download models [replace icons to svg icons] * Style: input h-auto * Feature: add render markdown on user and serge * Style: code tag color and background for any theme * Style: copy to clipboard button background and overflow-y hidden in chat render parent element * Style: code block background adaptive to daisy themes * Style: remove images and add rubik-pixels font * Style: add rubik-pixels font from fontsource --------- Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-05-25 01:31:57 -04:00
Nathan Sarrazin	3a594ddab0	Update README.md (#227 )	2023-04-26 08:14:38 +02:00
Juan Calderon-Perez	330c5a8818	Use Debian Slim for base images, Fixes to CI actions (#221 ) * Use slim variant for Docker, CI fixes * Config must be added after installation * Use Python3.9, updates to CI * Change min required version of Python	2023-04-25 09:21:19 +02:00
Nathan Sarrazin	e512011470	Use python bindings, integrate with LangChain and get rid of MongoDB (#148 ) * integrate langchain get rid of mongodb use llama-cpp-python bindings * fixed most chat endpoints except posting questions * Working post endpoint ! * everything works except streaming * current state * streaming as is * got rid of langchain wrapper for calling llm, went back to using bindings directly * working streaming * sort chats by time * cleaned up styling and added back loading indicator * Add persistence support to redis * fixed tooltips * fixed default prompts * added link to api docs (closes How to use the api #155 )	2023-04-23 23:42:20 +02:00
Juan Calderon-Perez	c48dc72c84	Fix spelling mistakes (#160 )	2023-04-16 20:49:14 -04:00
Juan Calderon-Perez	3d713bfbb0	Link to llama.cpp git repo. (#126 )	2023-04-03 06:10:50 +02:00
Juan Calderon-Perez	424f670e0f	Update llama.cpp to master-e986f94 (#124 ) * Update Dockerfile * Add support for GPT4All * Skip running Github Actions on .md changed --------- Co-authored-by: Nathan Sarrazin <sarrazin.nathan@gmail.com>	2023-04-02 20:38:28 +02:00
PΔBLØ ᄃΞ	3ed6f8e20f	docs: correct the name 7B-native on Readme (#120 )	2023-03-31 19:50:17 +02:00
Nathan Sarrazin	f54b1ce44d	Update README.md	2023-03-31 08:51:54 +02:00
Nathan Sarrazin	07ce53730c	Update README.md	2023-03-30 00:05:11 +02:00
Nathan Sarrazin	3930a9371a	Update README.md	2023-03-29 07:50:11 +02:00
Nathan Sarrazin	f786c498ac	Update README.md	2023-03-29 06:40:56 +02:00
Nathan Sarrazin	b5c423fe59	API Refactor & Model Manager (#101 ) * API refactoring * delete partially downloaded files on startup * remove unused deps	2023-03-28 23:56:41 +02:00
Paraskevas Leivadaros	bad45112c2	Update README.md (#94 ) Added a note on storage for each model	2023-03-28 17:54:54 +02:00
Nathan Sarrazin	293a23e487	Proper dev & deployment images. Converts front-end to SPA. (#84 ) * begin work on dev environment * more work on dev image * working dev + prod images with SPA front-end * reworked dockerfile * make CI point to the right action * Improvements to github actions (#79) * Improvements to github actions * Change username to repo owner username * Add fix for login into ghcr (#81) * Update bug_report.yml * added dev instructions to readme * reduced number of steps in dockerfile --------- Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>	2023-03-27 16:44:20 +02:00
Fenark	cccc8d2746	Initial support for Kubernetes (#71 ) * Initial Kubernetes support * Adding manifest.yaml * removed custom config & added placeholder variables * Adding K8S deployment steps on README.md * adding --host parameter to uvicorn deploy.sh * changed web/vite.config.ts placeholder variable to service name * migrated manifest.yaml to the wiki and edited README.md	2023-03-26 12:59:39 +02:00
Nathan Sarrazin	7e0149b6f9	Update README.md	2023-03-25 16:25:57 +01:00
Nathan Sarrazin	28007b249a	Update README.md	2023-03-25 16:25:28 +01:00
Louis	7af808bf5c	Fix README.md (Getting started commands) (#59 ) * Fix README.md (Getting started commands) * Revert docker compose v2 syntax (no dash)	2023-03-25 13:19:58 +01:00
Nathan Sarrazin	52327af045	Update README.md	2023-03-25 11:55:28 +01:00
Nathan Sarrazin	d697059122	Move everything into a single dockerfile (#57 ) * initial work on linting & templates * moved everyone into a nice dockerfile * move everyone into a single dockerfile * update sample .env file * got rid of .env file * rename db volume to avoid confusion and conflicts with previous version * added bug report template	2023-03-25 11:55:01 +01:00
Nathan Sarrazin	546c4114ed	Update README.md	2023-03-25 07:29:37 +01:00
Nathan Sarrazin	47b63b420e	Update README.md	2023-03-24 18:16:03 +01:00
Nathan Sarrazin	7d3a600345	Update README.md	2023-03-23 14:12:35 +01:00
Nathan Sarrazin	9cc88cb621	Fix git link on README	2023-03-23 05:24:44 +01:00
Nathan Sarrazin	c336aa3d0b	clearer readme	2023-03-22 18:22:19 +01:00
Nathan Sarrazin	a1cc66a1ca	Make it easier to get started	2023-03-22 18:15:51 +01:00
Nathan Sarrazin	b4c329286c	Update README.md	2023-03-22 12:54:07 +01:00

1 2

69 Commits