RAG Tool Configuration

The RAG configuration screen is split into several tabs, or sections.

Retrieve

Knowledge sources

This is the critical setting that you are prompted to configure when you are creating a new RAG Tool. This may be one or multiple Knowledge Sources for the LLM to ground its response when answering users’ questions.

Similarity score threshold

Similarity score threshold indicates the minimum similarity score that is required for a chunk to be retrieved. A higher score (like 0.9) means that the match between chunk meaning and user input meaning must be very high. A lower score means that less relevant chunks may be retrieved. Default value is 0.75, which is a moderate similarity score.

Number of chunks to select

This parameter controls how many chunks are selected and passed to the LLM for generation if there are multiple chunks with score that fits the threshold. For example, with default Similarity score threshold value of 0.75, if there are 10 chunks with score above 0.75 and Number of chunks to select is set to 5, only 5 top chunks will be sent to the LLM for answer generation. If there are 3 chunks with score above 0.75, only these three will be passed to the LLM.

Number of added chunks

Number of added chunks specifies how many neighboring chunks should be appended to each retrieved chunk, which can be helpful when valuable context spans across neighbouring chunks. Start with the default value of 0 and adjust if needed.

Re-ranking

Re-ranking improves the accuracy of RAG Tool response by retrieving a broader range of chunks and re-assessing their relevance score. The top few chunks from the re-ranked list are then used to generate a better, more accurate answer. To do this, a re-ranking model is used.

If Re-ranking is enabled, it is recommended to set a lower similarity score threshold, because at the first step we need to grab a wider range of chunks for further re-ranking. Also, if this technique is enabled, you have an additional parameter under the Chunk limits: Max number of chunks to retrieve for re-ranking . This controls how many chunks will actually get re-ranked. Default value is 5. Other Chunk limits also remain applicable if Re-ranking is enabled.

Generate

This section contains just one parameter - the generation Prompt Template, which shapes the structure, format and tone of RAG output.

If you need to edit this Prompt Template, we recommend that you first clone the default one and make changes in it, or create a new variant of default Prompt Template. Make sure you do not remove the {context} placeholder from the Prompt Template.

Learn more about the generation Prompt Template.

Language

Multi-lingual RAG

This setting should be turned on for cases when users ask questions in languages other than that of the Knowledge Source.

The Multi-lingual RAG flow works by detecting user question language, translating the query into the language of Knowledge Source and, after response has been generated, translating it back into user’s language. This approach makes additional calls to LLM, but it significantly improves the quality of multi-lingual RAG.

Post-process

Post-processing is an additional workflow that processes RAG-generated output and generates metrics for analytical reports. Currently supported metrics include language, question topic, and answered/not answered.

The Post-processing Prompt Template provides instructions on how to extract these specific metrics from a RAG response, so normally there is no need to edit or replace it. Make sure you manually add some Topics to classify your user questions - these are configured on the RAG tool level.

UI settings

UI Settings provide additional customization options for end-user experience, such as heading and subheading and sample questions that give users a hint on what kind of questions they can ask. Here you can also enable user feedback to collect likes and dislikes. Changes that you make are immediately available in the Preview area.

Test sets

Under the Test sets tab, you can select the most relevant or frequently used Test set with test records for the RAG Tool. This speeds up testing of the RAG Tool: just pick a record and it immediately gets copied into the Preview area.

RAG Tool Configuration ​

Retrieve ​

Knowledge sources ​

Similarity score threshold ​

Number of chunks to select ​

Number of added chunks ​

Re-ranking ​

Generate ​

Language ​

Multi-lingual RAG ​

Post-process ​

UI settings ​

Test sets ​