Technical principles for implementing RAG based on LangChain

Written by

Clara Bennett

Updated on:June-21st-2025

LangChain : An open source framework that provides rich components and tools for building RAG systems.
LLama-Index : A RAG framework designed specifically for the LLama model, suitable for applications in specific scenarios.
RAGFlow : A newer RAG framework that focuses on simplicity and efficiency, providing preset components and workflows.
Haystack : A commonly used open source framework that supports vector storage and orchestration layers and is an important component of the RAG system.
GraphRAG : Focuses on large model-driven RAG technology, improving the efficiency of the RAG system by optimizing vector library construction and inference performance.

Upload documents : Users upload relevant knowledge documents. LangChain supports multiple formats such as txt, pdf, docx, etc., and automatically converts them into markdown format for storage.
Text segmentation : To facilitate analysis and processing, long texts need to be segmented into multiple small chunks, similar to the information transmission mechanism of the TCP protocol (segmentation followed by transmission + packet loss detection).
Text vectorization : The cut text blocks are converted into vectors that can be processed by the algorithm through EMB (data splitting and mapping) technology, and stored in the vector database.
Question vectorization : vectorize the user's question content (cutting + splitting + mapping).
Semantic retrieval and matching : Match the vectorized user question content with the text blocks in the vector library, and match the top X (the size depends on the rule definition) most similar to the user question vector according to the rules.
Submit prompt to LLM : Add the matched text and user question to the defined prompt word template and submit it to LLM for processing.
Generate final output answer : LLM generates the final output answer and returns it to the user.

Can understand user questions.
The correct knowledge base content can be matched.
Answers based on user questions and knowledge base content must be comprehensive and accurate.
Whether the final answer output contains information other than the knowledge base content (in simple terms, it means being connected to the Internet/not being connected to the Internet).
Whether the final answer output is reliable (that is, it is necessary to compare the output results of multiple rounds of evaluation to determine whether the difference is too large).
It is necessary to support context-based follow-up questions, and the final answer output after the follow-up questions must also meet the above requirements.