Create and manage Relevance Generative Answering (RGA) models

This is for:

In this article

What does an RGA model do?
Prerequisites
Create an RGA model
Add a custom prompt instruction
Edit an RGA model
Delete an RGA model
Review active model information
Reference
What’s next?

Coveo Relevance Generative Answering (RGA) is a paid product extension. Contact Coveo Sales or your Account Manager to add RGA to your organization license.

Relevance Generative Answering (RGA) leverages generative AI technology to generate answers to complex natural language user queries in a Coveo-powered search interface.

To implement RGA, you must first create an RGA model.

What does an RGA model do?

When an RGA model builds, it creates embeddings for the indexed items specified in the model settings and stores the embeddings in model memory.

Note

The model is preconfigured to rebuild and update the embeddings weekly based on when the model is created. Contact your Coveo Account Manager if a different build interval is required.

By default, RGA supports content retrieval and answer generation only in English. However, Coveo offers beta support for content retrieval and answer generation in languages other than English. Learn more about multilingual content retrieval and answer generation.

Relevance Generative Answering passage embedding | Coveo

An RGA model uses a pre-trained sentence transformer language model to create the embeddings. The language model does this by capturing relationships between words, phrases, and sentences in the dataset.

An RGA model creates embeddings only for the content in an item's body, which is the content mapped to the body field in the Coveo index. For more information, see How RGA uses your content.

As shown in the following diagram, the model uses chunks to create the embeddings. Instead of creating a vector for each individual word, a vector is created for a segment of text (chunk) to increase relevance.

As shown in the following diagram, when a user enters a query in a Coveo-powered search interface, the RGA model embeds the query in the embedding vector space in its memory to find the most relevant segments of text (chunks). In the context of the RGA answer-generation flow, this is referred to as second-stage content retrieval. Only the chunks corresponding to the most relevant items identified during first-stage content retrieval (items retrieved by the Coveo search engine) are considered. The most relevant chunks are then used to generate an answer.

The embeddings that are created by the RGA model aren’t impacted by Coveo Analytics events. However, an RGA model leverages the most relevant items retrieved by your Coveo-powered search for a given user query. Therefore, by enabling an Automatic Relevance Tuning (ART) model, which learns from Coveo Analytics events, then the most relevant items, and by extension the generated answer, will be influenced by Coveo Analytics events.

Note

You can set a custom value for the maximum number of items that the RGA model considers when retrieving the most relevant segments of text (chunks). You can also modify the relevancy threshold that’s used by the RGA model to determine if a chunk is relevant enough to be considered by the RGA model for answer generation.

These are advanced model query pipeline association configurations that should be used by experienced Coveo administrators only. For more information, see RGA model association advanced configuration.

Prerequisites

You have the required privileges to create an RGA model.

The content you want to use for the model meets the item requirements and is optimized for RGA.

Note

An RGA implementation should include both an RGA model and a Semantic Encoder (SE) model. The same SE model can be used with multiple RGA models. Both the RGA and SE models must be configured to use the same content.

See RGA overview for information on how RGA and SE work together in the context of a search session to generate answers.

Keep the model embedding limits in mind when choosing the content for your model.

Create an RGA model

On the Models (platform-ca | platform-eu | platform-au) page of the Coveo Administration Console, click Add model, and then click the Relevance Generative Answering card.
Click Next.

In the Learn from section, select the content that the model will use. You can select the sources and apply additional filters using the Standard configuration, or use Advanced mode to define a custom filter expression.

You’ll lose the current mode settings when you switch between Standard and Advanced mode.

Note

See RGA overview for information on how RGA and SE work together in the context of a search session to generate answers.

The Data volume preview section shows the impact of your settings on the data that’s available to the model.

In the Standard tab:
1. In the Sources dropdown menu, select the sources that contain the items from which you want the model to learn.
  
  Note
  
  If your Coveo organization includes multiple indexes, the model can learn only from sources that are linked to the default index.
2. (Optional) In the Apply filters on dataset section, you can specify a condition to segment the content on which the model should base its training:
  
  Example
  
  You want the model to base its training only on items for which the collection field have the FAQ value.
  
  Therefore, you add a collection is equal to FAQ condition.
  1. Click Add filter(s).
  2. In the Field name input, enter the name of the field that you want to use to segment the dataset.
  3. In the Select an operator dropdown menu, select the desired operator.
  4. In the Value input, enter the value of the field on which you want to segment the dataset.
  5. Click Apply.
In the Advanced tab:
1. Enter a custom filter expression using Coveo query syntax.
2. Click Apply.

Click Next.
In the Name your model input, enter a meaningful display name for the model.
(Optional) Use the Project selector to associate your model with one or more projects.
Click Start building.
Associate the model with a pipeline to use the model in a search interface.

An effective RGA implementation relies on a process of continuous improvement that includes evaluating the generated answers and modifying the implementation based on the evaluation results.

The Coveo Knowledge Hub provides the tools you need to evaluate and improve your RGA implementation.

Add a custom prompt instruction

A prompt is an input text that’s given to a generative LLM to guide it in generating a specific output. During the answer generation process, the RGA model automatically creates a prompt and sends it to the generative LLM. The prompt includes the user query, an instruction, and the most relevant chunks of text. The RGA model's base instruction directs the LLM to generate the answer using only the chunks provided in the prompt. This grounds the LLM response to the most relevant information, which is an essential part of retrieval-augmented generation (RAG).

However, you can add an additional instruction to better guide the generative LLM based on your use case. For example, you can add a custom instruction to adjust the tone or style of the answer to better align with your company’s voice and branding. You can include a role-specific instruction to tailor the answer to specific roles or audiences, such as a sales agent or customer support representative. You can also provide restrictions so that generated answers don’t mention specific products or competitors, give financial advice, or show other sensitive information.

Your custom prompt instruction is added to the default base prompt instruction that the RGA model generates automatically. In other words, your custom instruction doesn’t replace the base instruction but enhances it.

Adding a custom prompt instruction affects all answers generated across query pipelines that use the model. While this can improve answers for certain queries, it may also cause inaccuracies, inconsistencies, or unintended results.

You’re responsible for configuring, testing, and maintaining your prompt instruction to ensure compatibility with the Coveo Platform and future updates. This includes providing role-specific guidance or exemplars for creating a prompt instruction, and monitoring outputs at scale to detect issues. The Knowledge Hub can help you evaluate and refine results.

Custom instructions aren’t covered by Coveo’s automated tests, which are limited to the default base prompt. Because the base prompt is updated frequently, you should retest your prompt instruction regularly.

Be aware of the following risks when adding a custom prompt instruction:

Semantic overload may produce inaccurate or harmful responses.
No Coveo tools are provided to validate answer quality at scale.
A custom prompt instruction may introduce security risks, such as prompt injection.

To add a custom prompt instruction for your RGA model

Note

You can add a custom prompt instruction only by editing an existing RGA model. You can’t add an instruction when creating an RGA model.

On the Models (platform-ca | platform-eu | platform-au) page, click the model you want to edit, and then click Edit in the Action bar.
On the subpage that opens, select the Configuration tab.
Under Prompt enhancement, enable Prompt instruction.

Enter your custom prompt instruction.

Notes

The following default template instruction appears in the text box when the feature is enabled. The template is a guide that you can use to create your own prompt instruction.

You are a subject matter expert representing [Enterprise Name] and must operate strictly within [Enterprise Domain]'s guidelines and applicable regulations. Your responses must reflect [Enterprise Name]'s values, tone, and content standards.
Do not offer personalized advice, opinions, or speculative commentary.
Use only factual, approved, or customer-provided content.
If a topic falls outside the approved scope, politely decline to answer or redirect to official resources.
Avoid referencing competitors, unverified tools, or external platforms unless explicitly allowed.
When uncertain, refrain from answering to avoid compliance or brand risks.
Maintain a [tone guidance — e.g., neutral, respectful, brand-aligned, professional, inclusive] tone in all responses.
Never generate or assist with content that violates safety, legal, privacy, or ethical standards.
Always prioritize [compliance/safety/accuracy/customer trust] based on [Customer Name]'s core mission.

RGA doesn’t support variables or function calls in the prompt instruction.
The custom prompt instruction is limited to 2,000 characters.

Click Save.

Note

Saving changes to the prompt instruction automatically triggers a model rebuild. Your changes take effect only after the rebuild completes and the new model version is live in your Coveo organization. Depending on the amount of data processed by your model, it may take up to 1 hour for a model to go live following a rebuild.

Disabling the prompt instruction feature and then saving the model deletes the current prompt instruction. To preserve your custom prompt instruction, make sure to copy it before disabling the feature. If the feature is re-enabled, the default template prompt instruction appears.

Edit an RGA model

On the Models (platform-ca | platform-eu | platform-au) page, click the model you want to edit, and then click Edit in the Action bar.
On the subpage that opens, select the Configuration tab.
Under Name, edit the model’s display name.
(Optional) Use the Project selector to associate your model with one or more projects.

In the Learn from section, select the content that the model will use. You can select the sources and apply additional filters using the Standard configuration or use Advanced mode to define a custom filter expression.

You’ll lose the current mode settings when you switch between Standard and Advanced mode.

Note

See RGA overview for information on how RGA and SE work together in the context of a search session to generate answers.

The Data volume preview section shows the impact of your settings on the data that’s available to the model.

In the Standard tab:
1. In the Sources dropdown menu, select the sources that contain the items from which you want the model to learn.
  
  Note
  
  If your Coveo organization includes multiple indexes, the model can learn only from sources that are linked to the default index.
2. (Optional) In the Apply filters on dataset section, you can specify a condition to segment the content on which the model should base its training:
  
  Example
  
  You want the model to base its training only on items for which the collection field have the FAQ value.
  
  Therefore, you add a collection is equal to FAQ condition.
  1. Click Add filter(s).
  2. In the Field name input, enter the name of the field that you want to use to segment the dataset.
  3. In the Select an operator dropdown menu, select the desired operator.
  4. In the Value input, enter the value of the field on which you want to segment the dataset.
  5. Click Apply.
In the Advanced tab:
1. Enter a custom filter expression using Coveo query syntax.
2. Click Apply.

In the Prompt enhancement section, you can add a custom prompt instruction for the RGA model.
Click Save.

Delete an RGA model

Note

Models are automatically dissociated from all their associated query pipelines once they’re deleted.

On the Models (platform-ca | platform-eu | platform-au) page, click the ML model that you want to delete, and then click More > Delete in the Action bar.
In the panel that appears, click Delete.

Review active model information

On the Models (platform-ca | platform-eu | platform-au) page, click the desired model (must be Active), and then click Open in the Action bar (see Reviewing model information).

Reference

Model embedding limits

The RGA model converts your content’s body text into numerical representations (vectors) in a process called embedding. It does this by breaking the text up into smaller segments called chunks, and each chunk is mapped as a distinct vector. For more information, see Embeddings.

Due to the amount of processing required for embeddings, the model is subject to the following embedding limits:

Note

For a given model, the same chunking strategy is used for all sources and item types.

Up to 15 million items or 50 million chunks

Notes

The maximum number of items depends on the item allocation of your product plan.
Coveo strongly recommends that you add a Semantic Encoder (SE) model as part of your RGA implementation. If you have more than one RGA model in your Coveo organization, each RGA model must use only the items that are used by the SE model.

1000 chunks per item

This means that for a given item, there can be a maximum of 1000 chunks. So if an item is long with a lot of text, such as more than 200,000 words or 250 pages, the model will embed the item’s text until the 1000-chunk limit is reached. The remaining text won’t be embedded and therefore won’t be used by the model.

250 words per chunk

Note

There can be an overlap of up to 10% between chunks. In other words, the last 10% of the previous chunk can be the first 10% of the next chunk.

"Status" column

On the Models (platform-ca | platform-eu | platform-au) page of the Administration Console, the Status column indicates the current state of your Coveo ML models.

The following table lists the possible model statuses and their definitions:

Status	Definition	Status icon
Active	The model is active and available.
Build in progress	The model is currently building.
Inactive	The model isn’t ready to be queried, such as when a model was recently created or the organization is offline. Click See more details for additional information (see Review model information).
Limited	Build issues exist that may affect model performance. Click See more details for additional information (see Review model information).
Soon to be archived	The model will soon be archived because it hasn’t been queried for an extended period of time. Click Delete to remove the model. Learn more about archived models.
Error	An error prevented the model from being built successfully. If it’s a temporary system error, check back soon. Otherwise, click See more details for additional information (see Review model information).
Archived	The model was archived because it hasn’t been queried for an extended period of time. Click Delete to remove the model. Learn more about archived models.

Status

Definition

Status icon

Active

The model is active and available.

Active

Build in progress

The model is currently building.

Building

Inactive

The model isn’t ready to be queried, such as when a model was recently created or the organization is offline.
Click See more details for additional information (see Review model information).

Inactive

Limited

Build issues exist that may affect model performance.
Click See more details for additional information (see Review model information).

Limited

Soon to be archived

The model will soon be archived because it hasn’t been queried for an extended period of time.
Click Delete to remove the model.
Learn more about archived models.

Archive pending

Error

An error prevented the model from being built successfully.
If it’s a temporary system error, check back soon. Otherwise, click See more details for additional information (see Review model information).

Error

Archived

The model was archived because it hasn’t been queried for an extended period of time.
Click Delete to remove the model.
Learn more about archived models.

Archived

Required privileges

By default, members with the required privileges can view and edit elements of the Models (platform-ca | platform-eu | platform-au) page.

The following table indicates the privileges required for members to manage Coveo Generic models (see Manage privileges and Privilege reference).

Action	Service - Domain	Required access level
View models	Machine Learning - Models Organization - Organization Search - Query pipelines	View
Manage models	Organization - Organization Search - Query pipelines	View
Machine Learning - Models	Edit
Machine Learning - Allow content preview	Enable
Content - Sources	View All
Content - Fields	View

Action

Service - Domain

Required access level

View models

Machine Learning - Models
Organization - Organization
Search - Query pipelines

View

Manage models

Organization - Organization
Search - Query pipelines

View

Machine Learning - Models

Edit

Machine Learning - Allow content preview

Enable

Content - Sources

View All

Content - Fields

View

What’s next?

Associate the RGA model with a query pipeline.