Query a local model from your browser.
Elevate your browsing experience with our cutting-edge Chrome extension, designed to seamlessly interact with local models hosted on your own server. This extension allows you to unlock the power of querying local models effortlessly and with precision, all from within your browser.
Our extension is fully compatible with both Llama CPP and .gguf models, providing you with a versatile solution for all your modeling needs. To get started, simply access our latest version, which includes a sample Llama CPP Flask server for your convenience. You can find this server on our GitHub repository:
GitHub Repository - Local Llama Chrome Extension:
https://github.com/mrdiamonddirt/local-llama-chrome-extension
To set up the server, install the server's pip package with the following command:
```
pip install local-llama
```
Then, just run:
```
local-llama
```
With just a few straightforward steps, you can harness the capabilities of this extension. Run the provided Python script, install the extension, and instantly gain the ability to effortlessly query your local models. Experience the future of browser-based model interactions today.