Data Health Overview

The Overview (platform-ca | platform-eu | platform-au) tab of the Data Health (platform-ca | platform-eu | platform-au) page provides a breakdown of data validations performed within your organization. These validations are based on specific criteria and are categorized by rules (for example, syntax rules, commerce-related rules, etc.).

The Overview tab lets you investigate validation errors, their severity levels, and the total number of impacted events accordingly. You can also check the status of each validation criteria, regardless of whether they have passed or failed. This provides you with a broader perspective on your organization data integrity, highlighting the areas that need to be investigated or fixed.

Data health overview | Coveo

1 Get started by adjusting the date range that you would like to review.

2 Click the Category dropdown menu, and then select the option applicable to you. This feature is useful if you use multiple Coveo solutions.

3 Click the Tracking ID dropdown menu, and then select a site ID to filter the passed and failed validations.

4 Enable the Show empty groups option to view the groups that don’t have the required events to run their validation check.

5 View the Validation failure by severity section to inspect the breakdown of failed validations and their respective severity levels.

Note

The percentages shown with each severity level refer only to validations that failed.

6 View your data health score to view a glimpse of your overall data integrity.

7 Click a line under Group that you would like to investigate further, and the Show details link appears.

8 Click Show details to view recurrences of both failing and passing criteria of a specific group, and to view an event payload example along with the cause, impact, and solution for failing events.

Note

In instances when the validation rule is applied to a visit, rather than an event, the failing criteria displays the Show visit in Event Browser button instead of a payload example.

9 Select the Event browser tab for when you want to inspect a specific event.

Validation failure by severity

The Validation failure by severity section provides a snapshot of the total percentage of failed validation by severity levels. Severity levels refer to the measure of importance awarded to validation rules. Categorizing validation criteria by severity helps organize data health priorities by concentrating on the most critical issues that can impact your data.

There’s a set level of severity, ranging from critical to low, for each event type-validation criteria combination:

Severity level Description

Critical

The event is completely worthless if this validation fails.

High

An important validation that impacts core functionalities of your Coveo solutions if it fails repeatedly.

Medium

This validation impacts the overall quality of your Coveo solutions without breaking core functionalities if it fails repeatedly.

Low

This validation mostly impacts your reporting without breaking any core functionalities.

The Validation failure by severity section gives you an overall view of failed events by severity level. You can also view the severity level by individual failed validation criteria in the Show details panel.

Data health score

The Data health score section provides you with your organization data status at a glance so that you can allot the necessary time and resources to assess and fix any validation errors that may have occurred. The goal is to continuously improve your data so you reach the highest score possible. This ensures consistent accuracy of reported metrics.

The data health score encompasses the entirety of your organization data and is calculated by a weighted average. The following formula is used when computing the score for a single rule. For multiple rules (for example, global score or the score for an entire group), the same formula is used for each rule, and then the average for all the rules is computed.

1 - (severity_constant * number of failed instances / number of validated instances)

  • severity_constant refers to the weighting factor applied to specific validation criteria. The higher the validation criteria severity, the greater the constant, which in turn impacts the data health score.

  • number of failed instances refers to the number of events that didn’t meet the validation criteria.

  • number of validated instances refers to the number of instances subjected to a validation check.

Notes
  • The data health score can change from one period to the next since validation rules are dynamic; new rules may be added and existing one may be updated for different reasons, such as evolving compliance requirements, recurring data integrity issues, and more.

  • While we recommend that you strive for continuous improvement in your data health score, it’s also important to perform a data completeness check. This means ensuring that the key event totals reported by Coveo align with the totals from your internal trusted data source. The reason for this is that there may be instances where not all events are sent, and the score only reflects the quality of the events sent to Coveo, therefore a completeness check helps ensure that nothing is missing from the picture.

The calculated score therefore provides a layered assessment of data integrity, as it takes the number of failed events into account, but just as importantly, considers the severity of those events to provide a better context.

How to troubleshoot your organization’s data health

When inspecting your data health, you may come across validation criteria that show a lower data health score. Your score indicates the health of your organization data and if there are validation errors that need to be addressed. The Overview tab can be leveraged to focus on any problem areas in your data so that you can ensure that your search solution is working as it should be.

Although the score may not seem significantly low, it’s still important to assess what may be decreasing it.

Step 1: Adjust the date range

To obtain a better picture of your group scores as well as your overall score, modify the Date range after selecting the date picker. Possible options are focusing on a specific day, or even the last hour. This lets you see how the score may have changed at different periods of time.

Data health date range | Coveo
Note

The Overview tab can display data for up to a one-month period, going back from the current date.

Step 2: Identify the cause

It’s important to identify what may be causing the score to drop. You can do so by going over the different groups and clicking the ones that have a lower score and then selecting Show details.

Data health event type | Coveo
Tip

Even if the score doesn’t seem significantly low, it’s nevertheless a good idea to get more details to troubleshoot any issues to improve your overall score and maintain data integrity.

Step 3: View details

In the Show details panel, the Failing criteria tab will list the validation criteria that didn’t pass, along with the number of recurrences, the severity level, as well as the data health score.

Click the arrow at the end of a row to expand it. With the cause, impact, and solution visible, you now have a better understanding of what needs to be resolved. The panel also provides an example of the event payload that failed the validation criteria.

Data health show details | Coveo
Note

If the validation fails for a visit, the Show visit in Event Browser button appears in the Show details panel instead. For example, consider a case when there’s a validation issue with user login. Under Group, you select Visit Consistency, and then click Show details. You expand the row under Failing criteria, and there you can see the Show visit in Event Browser button instead of the payload.

Data health show details | Coveo

Step 4: Go to the Event Browser

Often times, the solution to the validation criteria error will be to use the Event browser to find the error and follow the instructions.

If you take the error shown in step 3 as an example, the Show details panel indicates that it’s a search event type.

Data health show details search | Coveo

With this information on hand, you can do the following:

  1. Go to the Event browser (platform-ca | platform-eu | platform-au).

  2. Select the Invalid option from the Status section.

  3. Select the search option from Event type section.

    • You can now filter by the client ID and expand a line to view the reason for the error.

    • After expanding a row, you can view the complete event content by clicking Show event content.

      Data health event content | Coveo
    • You can open the Visit Browser page containing that specific event by clicking Show in visit browser. The Visit Browser opens in a separate tab.

      Data health visit browser | Coveo
      Tip

      In the context of data health, the Visit Browser is useful for implementers, as it provides a detailed view of individual events within user visits. This enables the recreation of specific issues by identifying recurrences and anomalies, offering a deeper insight into potential problems.