Indexing Website Content
Step 1: Getting Started With Coveo Cloud
For first time users, learn how to log in to the Coveo Platform.
Get an OAuth2 token. You will use it later in this tutorial to authenticate some of the REST API calls you’re going to make.
Using the access token you got in step 2, create a trial organization.
Using the identity registered in step 1, log in to the Coveo Platform.
You can review your organization license limits in the Coveo Platform (see Review Organization Settings and Limits).
Step 2: Creating and Configuring a Source to Index a Website
Coveo provides many out-of-the-box connectors designed to access and index website content. Connectors may be system-specific or generic.
A corporate website may be generated and managed through a Content Management System. Coveo provides CMS-specific connectors. However, you can also select a more generic connector when creating a source to index content from a website, or from its underlying CMS database (e.g., a Web, Sitemap, or Database connector). For the comprehensive list of connection options available, see Connector Types.
The following table summarizes Coveo connection options for website content. You can click a given connector name in the table for more details regarding its features, including content security type support and instructions on how to create a source.
|System used||Available connectors|
|Sitecore||Coveo has a system-specific integration for Sitecore websites (see Coveo for Sitecore).|
|Other systems (Adobe Experience Manager, WordPress, Acquia, Episerver, etc.)||Use the Sitemap Source if your website includes a Sitemap file or a Sitemaps index file.|
|Use the Website Source to crawl your website as do search engines such as Google.|
|Use the Database Source if you can access the underlying database of your CMS and you know its schema.|
|Use the Generic REST API Source to get content from a remote repository exposing its data through a REST API.|
|Use the Push Source for situations where you have to resort to having a developer create a custom crawler and push the collected content to your Coveo organization (e.g., for an on-premises content management system developed in-house).|
Advanced Indexing Options - Adding an Indexing Pipeline Extension
The Coveo Cloud indexing pipeline is the process each item goes through when indexed. At this stage, you might want to explore how you can customize this process by adding an extension. (see Indexing Pipeline Extension Overview)
Step 3: Review and Inspect Your Indexed Content
The Content Browser is a basic, non-configurable, demo search interface in Coveo Platform to help you navigate and inspect your organization sources content.
For instructions on accessing the Content Browser and making use of its many features, see Inspect Items With the Content Browser.
Step 4: Adding Fields and Mapping Metadata
Coveo organization sources come with a set of standard system fields. However, adding your own fields allows the end user to obtain additional information in search results and to better target desired content (see Field Uses).
To add a field and its associated mapping for your source
After the index rebuild, return to the Content Browser page to review the changes to your indexed items.
Step 5: (Optional) Creating a Customizable Demo Search Page
You should now proceed to Integrating a Search Interface into Your Website.