Chat with Docs using Hybrid RAG - Use iVSM (CPU inferences) #738

chetan-hirapara · 2024-11-14T13:02:48Z

New changes:

1. Use multiple data sources like PDF, Text and Audio files from insurance domain for chat with pdf
2. Change title as suggested in mail. - Teradata Enterprise Vector Store : vectorizing PDFs
3. For chunking of pdf text, can you do in-db STO with python. - It seems complex and time consuming
4. Use HF models for create embeddings via BYOM approach (parallel CPU inferencing)
5. Use 3rd party LLM (OpenAI/Bedrock/Gemini) for final answer
6. You will have the use HF model also for question --> embeddings
7. Also make some visualization (embedding to 2D) to show the selected chunk based on questions . I think scatter plot could be good which shows all chunks, question, and selected chunk
8. Store PDFs in object store or Vantage Table (pointing to object store)
9. No needs to add chat UI, Create pre-defined questions in a dropdown, and it can answer based on question selected.

PR# #752

chetan-hirapara self-assigned this Nov 14, 2024

chetan-hirapara changed the title ~~Caht with Docs - Use IVSM~~ Chat with Docs - Use IVSM Nov 14, 2024

chetan-hirapara added enhancement Adding graphs and improving output High Priority Work on this next labels Nov 14, 2024

chetan-hirapara changed the title ~~Chat with Docs - Use IVSM~~ Chat with Docs using Hybrid RAG - Use iVSM (CPU inferences) Nov 22, 2024

chetan-hirapara mentioned this issue Nov 28, 2024

Chat with Audio, text and pdf files #752

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat with Docs using Hybrid RAG - Use iVSM (CPU inferences) #738

Chat with Docs using Hybrid RAG - Use iVSM (CPU inferences) #738

chetan-hirapara commented Nov 14, 2024 •

edited

Loading

Chat with Docs using Hybrid RAG - Use iVSM (CPU inferences) #738

Chat with Docs using Hybrid RAG - Use iVSM (CPU inferences) #738

Comments

chetan-hirapara commented Nov 14, 2024 • edited Loading

chetan-hirapara commented Nov 14, 2024 •

edited

Loading