Add or Edit an RSS Source

Members of the Administrators and Content Managers built-in groups can add the content of an RSS feed to a Coveo Cloud organization. RSS feeds allow you to stay informed by receiving the latest content from websites that interest you.

Coveo indexes RSS feed content to make it searchable by all users of the Coveo Cloud organization, or only by the source creator. By default, an RSS source starts a refresh every hour to index RSS feed item changes (additions or modifications). A source rebuild or rescan is required to take account of deleted items (see Edit a Source Schedule).

The availability of old and new RSS feed items depends on the RSS feed configuration contained in an XML file.

  • If the XML feed file contains OpenSearch information, your RSS source uses it to include all available items in the feed as far in the past as possible (see OpenSearch).

  • If the XML feed file doesn’t contain OpenSearch information, the RSS source doesn’t retrieve old feed items, only new ones that are published.

  • The incremental indexing process of RSS sources keeps items in your source even when they’re filtered out of the RSS feed as new items are published. The filtered out RSS feed items are however deleted from the source when you perform a source rebuild or rescan (see Adding and Managing Your Coveo Cloud Organization Sources).

A technological website RSS feed is configured to provide only the last 100 articles. Previous articles aren’t accessible. When you create, rebuild, or rescan your RSS source, you get the last 100 RSS articles available on this site at that time. The next day, five articles are published so your source ends up containing 105 articles. Six months later, it may contain a thousand articles as long as you don’t rebuild or rescan the source, in which case it will only contain the last 100 articles.

Source Features Summary

Features Supported Additional information
RSS feeds version RSS 1.0, and 2.0, and Atom 1.0
Searchable content types

RSS feeds (or channels) and RSS items

Content update Refresh

The source must know the last update time of each RSS feed item. Otherwise, the source sets the default min value and you need to perform a source rescan or rebuild to update changes on items.

Depending on the RSS feed format, the following property must be defined for each item:

  • Atom 1.0: `<updated>`

  • RSS 2.0: `<a10:updated>`

Rescan
Rebuild
Content security options Determined by source permissions

Source creator
Everyone

Add or Edit an RSS Source

  1. If not already in the Add/Edit an RSS Source panel, access the panel:

    • To add a source, in the main menu, under Content, select Sources > Add Source button > RSS.

      OR

    • To edit a source, in the main menu, under Content, select Sources > source row > Edit in the Action bar.

  2. In the Configuration tab, enter appropriate values for the available parameters:

    • Source name

      A descriptive name for your source under 255 characters (not already in use for another source in this organization).

      Corporate-RSS-Feeds

    • Feed URL

      The web address of RSS feeds that you want to index in the file:///, http:// or https:// form.

      Ensure that your RSS feed is in one of the following supported formats:

      • RSS 1.0

      • RSS 2.0

      • Atom 1.0

      http://rss.cnn.com/rss/cnn_tech.rss

      As you start typing the first RSS address, another field appears under the one you’re already typing into. This is to remind you that you can add more than one RSS feed address in your RSS source.

    • Character optical recognition (OCR)

      Check this box if you want Coveo Cloud to extract text from image files or PDF files containing images (see Enable Optical Character Recognition). OCR-extracted text is processed as item data, meaning that it’s fully searchable and will appear in the item Quick View (see Search Result Quick View).

      Since the OCR feature is available at an extra charge, you must first contact Coveo Sales to add this feature to your organization license. You can then enable it for your source.

    • Index

      When adding a source, if you have more than one logical (non-Elasticsearch) index in your organization, select the index in which the retrieved content will be stored (see Leverage Many Coveo Indexes). If your organization only has one index, this drop-down menu isn’t visible and you have no decision to make.

      • To add a source storing content in an index different than default, you need the View access level on the Logical Index domain (see Privilege Management and Logical Indexes Domain).

      • Once the source is added, you can’t switch to a different index.

  3. In the Authentication section, when the RSS feed or the linked items require an authentication to be accessed, you must fill out the following parameters.

    • Username

      The username on the RSS feed website(s) that has access to the RSS feed(s) content that you want to index.

    • Password

      The corresponding password.

  4. In the Content Security tab, select who will be able to access the source items through a Coveo-powered search interface. For details on this parameter, see Content Security.

  5. In the Access tab, determine whether each group and API key can view or edit the source configuration (see Understanding Resource Access):

    1. In the Access Level column, select View or Edit for each available group.

    2. On the left-hand side of the tab, if available, click Groups or API Keys to switch lists.

    If you remove the Edit access level from all the groups of which you’re a member, you won’t be able to edit the source again after saving. Only administrators and members of other groups that have Edit access on this resource will be able to do so. To keep your ability to edit this resource, you must grant the Edit access level to at least one of your groups.

  6. Optionally, consider editing or adding mappings (see Adding and Managing Source Mappings).

    You can only manage mapping rules once you build the source (see Refresh, Rescan, or Rebuild Sources).

  7. Complete your source addition or edition:

    • Click Add Source/Save when you want to save your source configuration changes without starting a build/rebuild, such as when you know you want to do other changes soon.

      On the Sources page, you must click Start initial build or Start required rebuild in the source Status column to add the source content or make your changes effective, respectively.

      OR

    • Click Add and Build Source/Save and Rebuild Source when you’re done editing the source and want to make changes effective.

      Back on the Sources page, you can review the progress of your RSS source addition or modification (see Adding and Managing Sources).

    Once the source is built or rebuilt, you can review its content in the Content Browser (see Inspect Items With the Content Browser).

What’s Next?

Review your source update schedule and optionally change it so that it better fits your needs (see Edit a Source Schedule). By default, your content is refreshed every hour.

Recommended Articles