Add a Box Business source
Add a Box Business source
Members with the required privileges can add the content of users' Box enterprise accounts to a Coveo organization.
Leading practice
The number of items that a source processes per hour (crawling speed) depends on various factors, such as network bandwidth and source configuration. See About crawling speed for information on what can impact crawling speed, as well as possible solutions. |
The Box Business Legacy connector has been deprecated for security reasons. If you still have a Box Business Legacy source, it will still work, but we recommend that you create a Box Business source to replace it. The difference between the two connectors lies in how you identify your Box account when creating the source: with a Box Business Legacy, you entered a Box Enterprise ID, whereas with a Box Business source, a JSON configuration containing credentials is required. Moreover, Box Business Legacy sources provided a Coveo application API key to enter in your Box application to authorize Coveo. With the new connector, this is no longer required. |
Source key characteristics
The following table presents the main characteristics of a Box Business source.
Features | Supported | Additional information | |
---|---|---|---|
Box version |
Latest cloud version |
Following available Box releases |
|
Indexable content |
Files, folders, enterprises, users, and web links |
||
Takes place every three hours by default. A rescan or rebuild is required to:
|
|||
Content security options |
[1] |
||
Add a Box Business source
Follow the instructions below to add a Box Business source.
-
On the Sources (platform-ca | platform-eu | platform-au) page, click Add source.
-
In the Add a source of content panel, click the Box Business source tile.
-
Configure your source.
Leading practice
It’s best to create or edit your source in your sandbox organization first. Once you’ve confirmed that it indexes the desired content, you can copy your source configuration to your production organization, either with a snapshot or manually. See About non-production organizations for more information and best practices regarding sandbox organizations. |
"Configuration" tab
In the Add a Box Business source panel, the Configuration tab is selected by default. It contains your source general and authentication information, as well as other parameters.
General information
Source name
Enter a name for your source.
Leading practice
A source name can’t be modified once it’s saved, therefore be sure to use a short and descriptive name, using letters, numbers, hyphens ( |
Authorization
Provide the Box application credentials in JSON format, either by pasting the configuration in the box or clicking Choose File and selecting the file to upload.
Note
If you still have a Box Business Legacy source, an enterprise ID is required instead. In the Box Enterprise ID box, enter the unique identifier that’s displayed in your Box Enterprise Admin Console, in the Account & Billing tab. |
Optical character recognition (OCR)
If you want Coveo to extract text from image files or PDF files containing images, enable the appropriate option.
The extracted text is processed as item data, meaning that it’s fully searchable and will appear in the item Quick view. See Enable optical character recognition for details on this feature.
Project
Use the Project selector to associate your source with one or more Coveo projects.
"Users to include" section
Choose whether to index the content of all your managed Box accounts or only specific accounts. If you select Specific, enter the user email addresses corresponding to the Box accounts you want to index and make searchable.
Notes
|
"Content security" tab
Select who will be able to access the source items through a Coveo-powered search interface. For details on this parameter, see Content security.
When using the Everyone content security option, see Safely apply content filtering for information on how to ensure that your source content is safely filtered and only accessible by intended users. |
"Access" tab
In the Access tab, specify whether each group (and API key, if applicable) in your Coveo organization can view or edit the current source.
For example, when creating a new source, you could decide that members of Group A can edit its configuration, while Group B can only view it.
For more information, see Custom access level.
Completion
-
Finish adding or editing your source:
-
When you want to save your source configuration changes without starting a build/rebuild, such as when you know you want to do other changes soon, click Add source/Save.
-
When you’re done editing the source and want to make changes effective, click Add and build source/Save and rebuild source.
NoteOn the Sources (platform-ca | platform-eu | platform-au) page, you must click Launch build or Start required rebuild in the source Status column to add the source content or to make your changes effective, respectively.
Back on the Sources (platform-ca | platform-eu | platform-au) page, you can follow the progress of your source addition or modification.
Once the source is built or rebuilt, you can review its content in the Content Browser.
-
-
Once your source is done building or rebuilding, review the metadata Coveo is retrieving from your content.
-
On the Sources (platform-ca | platform-eu | platform-au) page, click your source, and then click More > View and map metadata in the Action bar.
-
If you want to use a currently not indexed metadata in a facet or result template, map it to a field.
-
Click the metadata and then, at the top right, click Add to Index.
-
In the Apply a mapping on all item types of a source panel, select the field you want to map the metadata to, or add a new field if none of the existing fields are appropriate.
Notes-
For details on configuring a new field, see Add or edit a field.
-
For advanced mapping configurations, like applying a mapping to a specific item type, see Manage mappings.
-
-
Click Apply mapping.
-
-
Depending on the source type you use, you may be able to extract additional metadata from your content. You can then map that metadata to a field, just like you did for the default metadata.
More on custom metadata extraction and indexing
Some source types let you define rules to extract metadata beyond the default metadata Coveo discovers during the initial source build.
For example:
Source type Custom metadata extraction methods Define metadata key-value pairs in the
addOrUpdate
section of thePUT
request payload used to upload push operations to an Amazon S3 file container.REST API
and
GraphQL APIIn the JSON configuration (REST API | GraphQL API) of the source, define metadata names (REST API | GraphQL API) and specify where to locate the metadata values in the JSON API response Coveo receives.
Add
<CustomField>
elements in the XML configuration. Each element defines a metadata name and the database field to use to populate the metadata with.-
Configure web scraping configurations that contain metadata extraction rules using CSS or XPath selectors.
-
Extract metadata from JSON-LD
<script>
tags.
-
Configure web scraping configurations that contain metadata extraction rules using CSS or XPath selectors.
-
Extract JSON-LD
<script>
tag metadata. -
Extract
<meta>
tag content using theIndexHtmlMetadata
JSON parameter.
Some source types automatically map metadata to default or user created fields, making the mapping process unnecessary. Some source types automatically create mappings and fields for you when you configure metadata extraction.
See your source type documentation for more details.
-
-
When you’re done reviewing and mapping metadata, return to the Sources (platform-ca | platform-eu | platform-au) page.
-
To reindex your source with your new mappings, click Launch rebuild in the source Status column.
-
Once the source is rebuilt, you can review its content in the Content Browser.
-
Add source filters
By default, Coveo indexes all items in the Box accounts that you specified in the Users to Include section. However, if you want to index only certain items or ignore unwanted items, you can add filters to your source JSON configuration.
Note
Your Box Business source indexes only certain file types, such as |
-
On the Sources (platform-ca | platform-eu | platform-au) page, click your Box source, and then click More > Edit configuration with JSON in the Action bar.
-
Add your filters in the
addressPatterns
array object, while adhering to the following:-
You must use the
printableuri
path for theexpression
parameter.The Box Business connector is designed to use the
printableuri
, instead of theuri
, to identify where an item is located when applying source filters. To get theprintableuri
for a folder or item in Box Business, see Get the Printable URI. -
For your Box Business source to be able to locate an item specified in a
printableuri
path, theaddressPatterns
array must also include separateexpression
parameter entries (inclusion filters) for each of the hierarchical elements of the item’s URL structure.
ExampleYou want to index all the items in User1’s
Content
folder and subfolders, except for theDraft
subfolder or its subfolders.The
printableuri
for theContent
folder ishttps://www.box.com/Speedbit/User1 (user1@speedbit.com)/All Files/Content
.The
printableuri
for theDraft
folder ishttps://www.box.com/Speedbit/User1 (user1@speedbit.com)/All Files/Content/Draft
In order for your Box Business source to be able to locate the
Content
andDraft
folders, a separate inclusion filter is required for each of the folders' hierarchical elements.Your
addressPatterns
array would be as follows:"addressPatterns": [ { "expression": "https://www.box.com/Speedbit", "patternType": "Wildcard", "allowed": true }, { "expression": "https://www.box.com/Speedbit/User1 (user1@speedbit.com)", "patternType": "Wildcard", "allowed": true }, { "expression": "https://www.box.com/Speedbit/User1 (user1@speedbit.com)/All Files", "patternType": "Wildcard", "allowed": true }, { "expression": "https://www.box.com/Speedbit/User1 (user1@speedbit.com)/All Files/Content*", "patternType": "Wildcard", "allowed": true }, { "expression": "https://www.box.com/Speedbit/User1 (user1@speedbit.com)/All Files/Content/Draft*", "patternType": "Wildcard", "allowed": false } ]
-
-
Once you’ve added all your source filters, click Save and rebuild source.
Get the printable URI
The Box Business connector is designed to use the printableuri
, instead of the uri
, when applying source filters.
This means that you must use the printableuri
path when specifying the expression
parameter in the addressPatterns array object.
To get the printableuri
for a folder or item in Box Business, you can either:
-
Create the printable URI using the account and item location details in Box Business.
-
Inspect the item with the Content Browser if you’ve already built the source and indexed the item.
Create the printableuri
The printable URI format for an item in Box Business is https://www.box.com/<COMPANY_NAME>/<ACCOUNT_USERNAME> (<ACCOUNT_EMAIL>)/All Files/<FOLDER>/<FILENAME>
.
To create the printable URI, replace:
-
<COMPANY_NAME>
with the name of the company account in Box Business. -
<ACCOUNT_USERNAME>
with the name of the user that owns the item or folder. -
<ACCOUNT_EMAIL>
with the email address of the user that owns the item or folder. -
<FOLDER>
with the name of the folder. -
<FILENAME>
with the item filename.
-
The printable URI of the following folder would be
https://www.box.com/Speedbit/User1 (user1@speedbit.com)/All Files/Content
:-
Company name is
Speedbit
-
Account username is
User1
-
Account email address is
user1@speedbit.com
-
Folder where the items are located is named
Content
-
-
The printable URI of the following item would be
https://www.box.com/Speedbit/User2 (user2@speedbit.com)/All Files/Tasks/Task_List.pdf
:-
Company name is
Speedbit
-
Account username is
User2
-
Account email address is
user2@speedbit.com
-
Folder where the item is located is named
Tasks
-
Item filename is
Task_List.pdf
-
Inspect the item with the Content Browser
If you’ve already built the source and indexed items, you can get the printableuri
for a given item by inspecting its properties in the Content Browser.
-
On the Content Browser (platform-ca | platform-eu | platform-au) page, use the search box and facets to locate the desired item.
-
Click the item card, and then click Properties in the Action bar.
-
In the Fields tab, use the Filter box to search for the
printableuri
property value for the item.
Safely apply content filtering
The best way to ensure that your indexed content is seen only by the intended users is to enforce content security by selecting the Same users and groups as in your content system option. Should this option be unavailable, select Specific users and groups instead.
However, if you need to configure your source so that the indexed source content is accessible to Everyone, you should adhere to the following leading practices. These practices ensure that your source content is safely filtered and only accessible by the appropriate users:
-
Configure query filters: Apply filter rules on a query pipeline to filter the source content that appears in search results when a query goes through that pipeline.
-
Use condition-based query pipeline routing: Apply a condition on a query pipeline to make sure that every query originating from a specific search hub is routed to the right query pipeline.
-
Configure the search token: Authenticate user queries via a search token that’s generated server side that enforces a specific search hub.
Following the above leading practices results in a workflow whereby the user query is authenticated server side via a search token that enforces the search hub from which the query originates. Therefore, the query can’t be modified by users or client-side code. The query then passes through a specific query pipeline based on a search hub condition, and the query results are filtered using the filter rules.
Configure query filters
Filter rules allow you to enter hidden query expressions to be added to all queries going through a given query pipeline.
They’re typically used to add a field-based expression to the constant query expression (cq
).
You apply the @objectType=="Solution"
query filter to the pipeline to which the traffic of your public support portal is directed.
As a result, the @objectType=="Solution"
query expression is added to any query sent via this support portal.
Therefore, if a user types Speedbit watch wristband
in the search box, the items returned are those that match these keywords and whose objectType
has the Solution
value.
Items matching these keywords but having a different objectType
value aren’t returned in the user’s search results.
To learn how to configure query pipeline filter rules, see Manage filter rules.
Note
You can also enforce a filter expression directly in the search token. |
Use condition-based query pipeline routing
The most recommended and flexible query pipeline routing mechanism is condition-based routing.
When using this routing mechanism, you ensure that search requests are routed to a specific query pipeline according to the search interface from which they originate, and the authentication is done server side.
To accomplish this:
-
Apply a condition to a query pipeline based on a search hub value, such as Search Hub is Community Search or Search Hub is Agent Panel. This condition ensures that all queries that originate from a specific search hub go through that query pipeline.
-
Authenticate user queries via a search token that’s generated server side and that contains the search hub parameter that you specified in the query pipeline.
Configure the search token
When using query filters to secure content, the safest way to enforce content security is to authenticate user queries using a search token that’s generated server side. For instance, when using this approach, you can enforce a search hub value in the search token. This makes every authenticated request that originates from a component use the specified search hub, and therefore be routed to the proper query pipeline. Because this configuration is stored server side and encrypted in the search token, it can’t be modified by users or client-side code.
Implementing search token authentication requires you to add server side logic to your web site or application. Therefore, the actual implementation details will vary from one project to another.
The following procedure provides general guidelines:
Note
If you’re using the Coveo In-Product Experience (IPX) feature, see Implement advanced search token authentication. |
-
Authenticate the user.
-
Call a service exposed through Coveo to request a search token for the authenticated user.
-
Specify the
userIDs
for the search token, and enforce asearchHub
parameter in the search token.
Note
You can specify other parameters in the search token, such as a query |
For more information and examples, see Search token authentication.
Required privileges
You can assign privileges to allow access to specific tools in the Coveo Administration Console. The following table indicates the privileges required to view or edit elements of the Sources (platform-ca | platform-eu | platform-au) page and associated panels. See Manage privileges and Privilege reference for more information.
Note
The Edit all privilege isn’t required to create sources. When granting privileges for the Sources domain, you can grant a group or API key the View all or Custom access level, instead of Edit all, and then select the Can Create checkbox to allow users to create sources. See Can Create ability dependence for more information. |
Actions | Service | Domain | Required access level |
---|---|---|---|
View sources, view source update schedules, and subscribe to source notifications |
Content |
Fields |
View |
Sources |
|||
Organization |
Organization |
||
Edit sources, edit source update schedules, and edit source mappings |
Organization |
Organization |
View |
Content |
Fields |
Edit |
|
Sources |
|||
View and map metadata |
Content |
Source metadata |
View |
Fields |
|||
Organization |
Organization |
||
Content |
Sources |
Edit |