...

/

Advanced RAG: Enhancing Model Efficiency and Accuracy

Advanced RAG: Enhancing Model Efficiency and Accuracy

Learn about advanced RAG with its pre-retrieval, retrieval, and post-retrieval processes.

While naive RAG provides a foundation for using LLMs with document retrieval, it has limitations. For instance, the retrieval accuracy might not be perfect, potentially leading to irrelevant information being used as context. Additionally, the LLM might struggle to integrate retrieved information seamlessly, resulting in disjointed responses. In this lesson, we'll look at advanced RAG and explore its workflow.

Improving retrieval process

Advanced RAG builds on naive RAG by introducing improvements in retrieval quality. It adds pre-retrieval and post-retrieval strategies before and after the retrieval process. Let’s look at the figure below:

Press + to interact
The high-level overview of advanced RAG workflow
1 / 13
The high-level overview of advanced RAG workflow

Pre-retrieval process

Advanced RAG leverages a two-pronged approach during the pre-retrieval process to enhance retrieval accuracy before the actual retrieval happens at inference time. Here’s a breakdown of the key techniques employed:

  • Optimizing indexing

  • Optimizing query

Optimizing indexing

The following are some important optimizing indexing techniques:

  • Enhancing data granularity (chunking strategies): Before delving into retrieval intricacies, grasp the significance of text processing strategies like chunking, which play a pivotal role in breaking down large ...