8 Commits

Author SHA1 Message Date
Juan Calderon-Perez
8c211053d4
Update llama-cpp-python to v0.1.61 (#403) 2023-06-09 23:54:56 -04:00
Juan Calderon-Perez
77131da4f1
Update llama-cpp-python to v0.1.59 (#401) 2023-06-09 22:08:44 -04:00
Juan Calderon-Perez
13daab6880
Update llama-cpp-bindings (#377) 2023-06-03 11:04:29 -04:00
Juan Calderon-Perez
c83a30797a
Update python bindings to 0.1.55 (#355) 2023-05-29 18:54:49 -04:00
Nathan Nye
51fae79aa2
GGMLv3 support (#334)
* Pin llama-cpp-python to 0.1.54 for GGMLv3 support

* Update to GGMLv3 models

* Reflect current GGMLv3 models

* More readable model names

* Fix file sizes

---------

Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
2023-05-26 00:00:08 -04:00
Juan Calderon-Perez
65ab97bdd4
Pin llama-cpp-python to 0.1.49 (#294) 2023-05-12 10:01:41 -04:00
Nathan Sarrazin
e512011470
Use python bindings, integrate with LangChain and get rid of MongoDB (#148)
* integrate langchain
get rid of mongodb
use llama-cpp-python bindings

* fixed most chat endpoints except posting questions

* Working post endpoint !

* everything works except streaming

* current state

* streaming as is

* got rid of langchain wrapper for calling llm, went back to using bindings directly

* working streaming

* sort chats by time

* cleaned up styling and added back loading indicator

* Add persistence support to redis

* fixed tooltips

* fixed default prompts

* added link to api docs (closes How to use the api #155 )
2023-04-23 23:42:20 +02:00
Nathan Sarrazin
b5c423fe59
API Refactor & Model Manager (#101)
* API refactoring

* delete partially downloaded files on startup

* remove unused deps
2023-03-28 23:56:41 +02:00