RAG-based System Architecture

From GM-RKB
Jump to navigation Jump to search

An RAG-based System Architecture is an LLM-based system arch for a RAG-based system.



References

2023

2023

2023

  • https://medium.com/@abdullahw72/langchain-chatbot-for-multiple-pdfs-harnessing-gpt-and-free-huggingface-llm-alternatives-9a106c239975
    • QUOTE: The Langchain Chatbot for Multiple PDFs follows a modular architecture that incorporates various components to enable efficient information retrieval from PDF documents. Let’s delve into the key architectural elements:

    • SUMMARY:
      • User Interface: Allows users to input questions and view responses.
      • Natural Language Understanding (NLU): Understands and interprets user queries. Uses NLP techniques like tokenization and named entity recognition.
      • Vector Store: Stores vector representations of text chunks extracted from PDFs. Generated using embeddings.
      • Embeddings: Encode semantic information from text chunks into vectors. Can use OpenAI or Hugging Face models.
      • Large Language Models (LLMs): Provide advanced language capabilities. Fine-tuned models like those from OpenAI or Hugging Face's Instruct series.
      • Conversational Retrieval: Matches user query vectors to document vector representations to find relevant information.
      • Chat History: Stores conversation context to enable coherent, relevant responses.