RAG, Langchain and Chromadb example #31

MichaelClifford · 2024-01-30T00:58:57Z

This PR adds a new recipe rag-langchain to the repo. In this example we build only the ai-application code and rely on the existing playground image for our model service and the external chormaDB image for our vectorDB service.

rag-langchain/ai-studio.yaml

lstocchi

I'm ok with using the playground. My only concern is that we also need to have a playground-cuda if we're going to update the other samples

lstocchi · 2024-01-30T17:33:26Z

rag-langchain/ai-studio.yaml

+      arch:
+        - arm64
+        - amd64
+    - name: chromadb-server


We need a way to differentiate it from the sample app

Then idk how fast it is to start but if it's slow and the sample app fails to connect we need a way to listen to its port so when it's ready we can restart the sample app container. I need to test it

We talked with @jeffmaury earlier. Now we find the port to listen to by using the containerfile (we look at the EXPOSE). If we don't have it as we use an image, maybe it's better to specify the port inside the ai-studio.yaml. if you decompile the image you could see multiple ports (also some belonging to the parent image)

Isen't it differentiated by the model-service=True parameter?

I agree - any information you need to build the application will be best if included in the ai.yaml - so you only have to depend on that single file

wouldn't there be instances where you'd want the user to define the ports to use just before runtime?

so atm what we do is, if model_service=true we mount the volume/set he model_path else we set the model_endpoint env.
When running the chromadb service the model_endpoint is not really necessary, so it would be nice to know if the item we are handling is the sample app or the vector db.

Regarding the port, by reading the readme i see that we should run podman run -it -p 8000:8000 chroma but where do i take the port value i have to open if we do not add it in the ai-studio?

should we add a flag vectordb=true?

we should keep in mind to be as much generic as we can. I don't see why we need to differentiate client app and vector db. If we need to pass the port to another container then we should do like what is done in Kubernetes ie have env var named CONTAINER_NAME_PORT

@jeffmaury i don't get your point. We need to know if we are working with a sample app or a vectorDB. When dealing with the sample app container we need to open its port into the pod so that it can be reached, nothing for the vectorDB. For the sample app we also need to fill the env variable so that it can connect to the model service, nothing for the vectorDB apparently. So there is a difference when adding a sample app or a vectorDB to the pod.

rag-langchain/rag_app.py

sallyom

lgtm

MichaelClifford force-pushed the rag-langchain branch from e3fce64 to ad0243d Compare January 30, 2024 02:05

jeffmaury requested changes Jan 30, 2024

View reviewed changes

rag-langchain/ai-studio.yaml Outdated Show resolved Hide resolved

rag-langchain/ai-studio.yaml Outdated Show resolved Hide resolved

MichaelClifford force-pushed the rag-langchain branch from bbcb65f to 371502d Compare January 30, 2024 14:51

MichaelClifford changed the title ~~[WIP] RAG, Langchain and Chromadb example~~ RAG, Langchain and Chromadb example Jan 30, 2024

jeffmaury requested changes Jan 30, 2024

View reviewed changes

rag-langchain/ai-studio.yaml Outdated Show resolved Hide resolved

lstocchi reviewed Jan 30, 2024

View reviewed changes

MichaelClifford added 4 commits January 30, 2024 16:46

WIP rag + langchain example

7ba5053

fix context dirs

da10991

make playground host and port configurable

41671a1

add chromadb as Containerfile

2759190

MichaelClifford force-pushed the rag-langchain branch from 2c3e7f9 to 2759190 Compare January 30, 2024 21:46

add vectordb param to ai yaml

c9122ca

sallyom approved these changes Jan 31, 2024

View reviewed changes

sallyom merged commit c248cc2 into containers:main Jan 31, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG, Langchain and Chromadb example #31

RAG, Langchain and Chromadb example #31

MichaelClifford commented Jan 30, 2024 •

edited

Loading

lstocchi left a comment

lstocchi Jan 30, 2024

lstocchi Jan 30, 2024

lstocchi Jan 30, 2024

MichaelClifford Jan 30, 2024

sallyom Jan 30, 2024

MichaelClifford Jan 30, 2024

lstocchi Jan 30, 2024

MichaelClifford Jan 30, 2024

jeffmaury Jan 31, 2024

lstocchi Jan 31, 2024

sallyom left a comment

RAG, Langchain and Chromadb example #31

RAG, Langchain and Chromadb example #31

Conversation

MichaelClifford commented Jan 30, 2024 • edited Loading

lstocchi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sallyom left a comment

Choose a reason for hiding this comment

MichaelClifford commented Jan 30, 2024 •

edited

Loading