Add or Edit a Source Using One of the Available Connectors
- Amazon S3 Source
- Box (Personal) Source
- Box Business Source
- Confluence Cloud Source
- Confluence Self-Hosted Source
- Database Source
- Dropbox (Personal) Source
- Dropbox Business Source
- Exchange Enterprise Source
- Exchange Online (Personal) Source
- File System Source
- Generic REST API Source
- Gmail for Work Source
- Gmail (Personal) Source
- Google Drive (Personal) Source
- Google Drive for Work Source
- Jira Software Cloud Source
- Jira Software Self-Hosted Source
- Jive Cloud Source
- Jive Server Source
- Lithium Source
- Microsoft Dynamics 365 Source
- OneDrive for Business Source
- OTCS Source
- Push Source
- RSS Source
- Salesforce Source
- ServiceNow Source
- SharePoint Online Source
- SharePoint Online Legacy Source
- SharePoint Server Source
- Sitecore Source
- Sitemap Source
- Twitter Source
- Web Source
- YouTube Source
- Zendesk Source
Add or Edit a SharePoint Server Source
Members of the Administrators and Content Managers built-in groups can include SharePoint on-premises content and make it searchable. This source can be shared, private, or secured (see Content Security). A SharePoint Server source is refreshed every six hours and rescanned every week to retrieve SharePoint item changes (addition, modification, or deletion). A source rescan or rebuild is necessary to capture deleted user profiles.
The SharePoint Server source allows you, as a member of the administrator or content manager group, to make SharePoint on-premises content searchable. To retrieve SharePoint Online content, you must create a SharePoint Online source (see Add/Edit SharePoint Online Source).
Depending on the content retrieval method you choose, on-premises Active Directory permission systems may be unofficially supported by Coveo Cloud or not supported at all. If you use the Crawling Module to index SharePoint Server content, a custom project by Coveo experts may allow you to index your SharePoint Server source permissions. So, if you want to create a secured source, contact the Coveo Support team for further information. With SharePoint Server sources of the On-Premises type, on-premises Active Directory permission systems are not supported.
Source Features Summary
|SharePoint version||2019, 2016, 2013, 2010, Foundation 2013, and Foundation 2010|
|Searchable content types||Sites, sub-sites, user profiles1, personal websites1, lists, list items, list item attachments, document libraries, document sets, documents, web parts2, and microblog posts and replies.|
Rescan or rebuild is required to retrieve deleted user profiles.
|Content security options||Secured||Depending on the content retrieval method you choose, on-premises Active Directory permission systems may be unofficially supported by Coveo Cloud or not supported at all. If you use the Crawling Module to index SharePoint Server content, a custom project by Coveo experts may allow you to index your SharePoint Server source permissions. Contact the Coveo Support team for further information. With SharePoint Server sources of the On-Premises type, on-premises Active Directory permission systems are not supported.|
Note 1: Not available in Microsoft SharePoint Foundation.
Note 2: Not all web parts are available in Microsoft SharePoint Foundation 2010 (see Web Parts in SharePoint Foundation).
Active Directory Federation Services
When your SharePoint environment uses ADFS as a trusted identity provider, the ADFS service endpoint URL paths must be enabled (see Enable the ADFS Service Endpoint URL Path).
SharePoint Account Permissions
When you want to include SharePoint content, you must create a specific SharePoint account to be used by the source only. Otherwise, you need to also change the source Password value each time the account password changes to prevent authentication errors (see Source Password).
Access your SharePoint tenant with an administrator account.
On your SharePoint tenant:
Select or create a user account for the source to use when retrieving your SharePoint content. Refer to the following table to identify the required type of user for your web application enabled authentication.
SharePoint environment SharePoint web application enabled authentication User type User format Classic Windows Windows account
Claims Windows Windows account ADFS ADFS SSO Okta Okta SSO
Grant appropriate SharePoint permissions to the SharePoint account to ensure it has access to the content that you want to make searchable.
The following table presents the minimal required permissions that the source account must have to perform specific actions.
Action to perform Minimal required permission Content and security indexing, source refresh, and site collection discovery
Full Read policy for each web application to make searchable (see Add the Full Read Policy to All SharePoint Tenant Web Applications).
Personal site, user profile, and social tags indexing
When including personal sites or user profiles, the account used as source credentials must not have a personal site on the SharePoint server being included to prevent failures when attempting to retrieve the list of personal sites.
Read permission for the site collection of the source URL (see Add the SharePoint Website Read Permission).
Retrieve People Data for Search Crawlers permission to the User Profile Service Application (see Add the "Retrieve People Data for Search Crawlers" Permission to the User Profile Service Application).
Add or Edit a SharePoint Server Source
Ensure your SharePoint instance meets the source requirements (see Requirements).
If not already in the Add/Edit a SharePoint Server Source panel, go to the panel:
To add a source, in the main menu, under Content, select Sources > Add source button > SharePoint. Depending on whether you need to use the Coveo On-Premises Crawling Module or not, select the option with the Crawling Module or the On-Premises tag (see Content Retrieval Methods).
To edit a source, in the main menu, under Content, select Sources, and then double-click the desired source row.
In the Configuration tab, enter appropriate values for the available parameters:
A descriptive name for your source under 255 characters (not already in use for another source in this organization).
One or more URLs corresponding to the desired site collection, lists, websites, and subsites to make searchable. Each URL must include the protocol and tenant name.
For a specific web application:
For a specific site collection:
For a specific website:
For a specific list:
A specific folder in a list is not supported.
In the drop-down menu, select the option for the content type matching the URLs you specified. By default, Web application is selected.
Available options are the following:
Value Content to make searchable Web application All site collections of the specified web application Site collection All web sites of the specified site collection Web and sub webs Only the specified web site and its sub webs (also known as subsites) List Only the specified list or document library
Paired Crawling Module
If your source is a Crawling Module source and if you have more than one Crawling Module linked to this organization, select the one with which you want to pair your source (see Deploying Multiple Crawling Modules). If you change the Crawling Module instance with which your source is paired, you must rebuild your source.
Character optical recognition (OCR)
Check this box if you want Coveo Cloud to extract text from image files or PDF files containing images (see Enable Optical Character Recognition). OCR-extracted text is processed as item data, meaning that it is fully searchable and will appear in the item Quick View (see Search Result Quick View).
When adding a source, if you have more than one logical (non-Elasticsearch) index in your organization, select the index in which the retrieved content will be stored (see Leverage Many Coveo Indexes). If your organization only has one index, this drop-down menu is not visible and you have no decision to make.
In the Authentication section, you must provide authentication information so that Coveo can access the content you want to make searchable (see Security). In the drop-down menu, select the identity provider you use to manage identities in your SharePoint site. Options are:
Windows (NTLM or Kerberos)
Simple ADFS (see Simple ADFS Identity Provider)
Chained ADFS (see Chained ADFS Identity Providers)
Depending on the option you choose in the drop-down menu, you must fill some of the following boxes:
The username of a dedicated SharePoint administrator account that has access to the content to include, or, if using Okta, the username of an Okta administrator account (see Setting up SharePoint Crawling Account Permissions).
The corresponding password.
ADFS server URL
The URL of an ADFS server trusted by SharePoint.
SharePoint trust identifier
The SharePoint server relying party trust identifier.
To find your relying party trust identifier:
Access the AD FS 2.0 Management Console (Windows Start menu > All Programs > Administrative Tools > AD FS 2.0 Management).
In AD FS 2.0 Management Console, under Trust Relationships, select Relying Party Trusts.
In the Relying Party Trusts list, find the row for SharePoint. The ADFS relying party trust identifier is the value in the Identifier column.
ADFS trust identifier
The relying party trust identifier of the ADFS server acting as an intermediate.
Identity provider server URL
The URL of the identity provider used in SharePoint to authenticate users.
You can edit the identity provider server URL in the ADFS settings (see Enable the ADFS Service Endpoint URL Path).
The SharePoint trusted identity provider realm provided in your Okta application configuration (see Using Okta as a Trusted Identity Provider).
Okta sign in URL
The URL to which users should be redirected in order to authenticate with Okta (see Using Okta as a Trusted Identity Provider).
In the Content to Include section, consider changing the default settings in this section to make additional content searchable.
User profiles: Select to include SharePoint users.
Personal sites: When the Scope is Web application, select to include SharePoint personal sites (see Scope).
In the Crawling Settings section, the Reindex all child items on UpdateShallow option allows you to reindex the children of an item that has been updated. This ensures that, if the metadata of the child items contains parent item information, this information stays up to date. However, checking this box significantly impacts the source refresh time. Therefore, if you do not check it, Coveo recommends scheduling source rescans so that the child items are eventually updated as well.
You change your SharePoint site name. In the metadata of the child items, the site name appears under
spsitename. If the box is not checked, the children are not reindexed and keep an outdated
spsitenameuntil the next source rescan or rebuild. However, if the box is checked, the children are updated along with the parent SharePoint site item.
- In the Access tab, determine whether each group and API key can view or edit the source configuration (see Understanding Resource Access):
- In the Access Level column, select View or Edit for each available group.
- On the left-hand side of the tab, if available, click Groups or API Keys to switch lists.
If you remove the Edit access level from all the groups of which you are a member, you will not be able to edit the source again after saving. Only administrators and members of other groups that have Edit access on this resource will be able to do so. To keep your ability to edit this resource, you must grant the Edit access level to at least one of your groups.
Optionally, consider editing or adding mappings (see Adding and Managing Source Mappings).
You can only manage mapping rules once you build the source (see Refresh, Rescan, or Rebuild Sources).
Complete your source addition or edition:
Click Add Source/Save when you want to save your source configuration changes without starting a build/rebuild, such as when you know you want to do other changes soon.
On the Sources page, you must click Start initial build or Start required rebuild in the source Status column to add the source content or make your changes effective, respectively.
Click Add and Build Source/Save and Rebuild Source when you are done editing the source and want to make changes effective.
Once the source is built or rebuilt, you can review its content in the Content Browser (see Inspect Items With the Content Browser).
Review your source update schedule and optionally change it so that it better fits your needs (see Edit a Source Schedule). By default, your content is refreshed every six hours and rescanned every week.