mirror of
https://github.com/mudler/LocalAI
synced 2026-04-21 21:37:21 +00:00
|
|
||
|---|---|---|
| .. | ||
| data | ||
| models | ||
| .gitignore | ||
| docker-compose.yml | ||
| query.py | ||
| README.md | ||
| store.py | ||
| update.py | ||
Data query example
This example makes use of Llama-Index to enable question answering on a set of documents.
It loosely follows the quickstart.
Summary of the steps:
- prepare the dataset (and store it into
data) - prepare a vector index database to run queries on
- run queries
Requirements
You will need a training data set. Copy that over data.
Setup
Start the API:
# Clone LocalAI
git clone https://github.com/go-skynet/LocalAI
cd LocalAI/examples/query_data
wget https://huggingface.co/skeskinen/ggml/resolve/main/all-MiniLM-L6-v2/ggml-model-q4_0.bin -O models/bert
wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j
# start with docker-compose
docker-compose up -d --build
Create a storage
In this step we will create a local vector database from our document set, so later we can ask questions on it with the LLM.
export OPENAI_API_BASE=http://localhost:8080/v1
export OPENAI_API_KEY=sk-
python store.py
After it finishes, a directory "storage" will be created with the vector index database.
Query
We can now query the dataset.
export OPENAI_API_BASE=http://localhost:8080/v1
export OPENAI_API_KEY=sk-
python query.py
Update
To update our vector database, run update.py
export OPENAI_API_BASE=http://localhost:8080/v1
export OPENAI_API_KEY=sk-
python update.py