Supported languages

The Coveo Platform can index content written in 57 languages. Coveo Machine Learning (Coveo ML) features and Coveo JavaScript Search Framework user interfaces also support a wide array of languages.

The following sections present the features supported by each language:

Index features support

Notes
  • Coveo can index content in languages other than those listed below as long as:

    • The language uses spaces to separate words.

    • The item encodes characters in Unicode.

    For languages meeting these requirements, most language features are supported, except language detection and stemming.

  • Since result ranking is partially based on summarization and key-concept extraction techniques, English, French, German, and Spanish queries return the most relevant results consistently.

Language Locale Supported index features[1]

English

en

All[2]

French

fr

All[2]

German

de

All[2]

Spanish

es

All[2]

Danish

da

All

Dutch

nl

All

Finnish

fi

All

Hungarian

hu

All

Italian

it

All

Norwegian

no

All

Portuguese

pt

All

Swedish

sv

All

Turkish

tr

All

Catalan

ca

All

Romanian

ro

All

Valencian

ca

All

Armenian

hy

All

Russian

ru

All

Chinese (traditional and simplified)[3]

zh

All except Stemming

Greek

el

All except Stemming

Hindi

hi

All

Japanese[3]

ja

All except Stemming

Korean[3]

ko

All except Stemming

Thai

th

All except Stemming

Arabic

ar

All except Stemming

Basque

eu

All except Stemming

Lithuanian

lt

All except Stemming

Czech

cs

All except Stemming

Indonesian

id

All except Stemming

Polish

pl

All except Stemming

Albanian

sq

All except Stemming

Afrikaans

af

All except Stemming

Belarusian

be

All except Stemming

Bulgarian

bg

All except Stemming

Burmese

my

All except Stemming

Croatian

hr

All except Stemming

Esperanto

eo

All except Stemming

Estonian

et

All except Stemming

Filipino

fil

All except Stemming

Hebrew

he

All except Stemming

Icelandic

is

All except Stemming

Kazakh

kk

All except Stemming

Latvian

lv

All except Stemming

Macedonian

mk

All except Stemming

Malay

ms

All except Stemming

Moldovan

ro

All except Stemming

Mongolian

mn

All except Stemming

Norwegian Bokmål

nb

All except Stemming

Persian

fa

All except Stemming

Serbian

sr

All except Stemming

Slovak

sk

All except Stemming

Slovenian

sl

All except Stemming

Swahili

sw

All except Stemming

Tagalog

tl

All except Stemming

Ukrainian

uk

All except Stemming

Uzbek

uz

All except Stemming

Vietnamese

vi

All except Stemming and Did you mean

Machine learning features support

The languages supported by Coveo ML vary depending on the model type.

Note

ART, QS, DNE, and CR models can support content in languages other than those listed in the following table as long as the language uses spaces to separate words. For languages that meet this requirement, most language features are supported, except language detection and stemming.

Available ML models:

Language Locale ART, QS, DNE, and CR Smart Snippets and CC

English

en

check

check

French

fr

check

x

German

de

check

x

Spanish

es

check

x

Danish

da

check

x

Dutch

nl

check

x

Finnish

fi

check

x

Hungarian

hu

check

x

Italian

it

check

x

Norwegian

no

check

x

Portuguese

pt

check

x

Swedish

sv

check

x

Turkish

tr

check

x

Catalan

ca

check

x

Romanian

ro

check

x

Valencian

ca

check

x

Armenian

hy

check[4]

x

Russian

ru

check

x

Chinese (traditional and simplified)

zh

check

x

Greek

el

check

x

Hindi

hi

check

x

Japanese

ja

check

x

Korean

ko

check

x

Thai

th

check

x

Arabic

ar

check

x

Basque

eu

check

x

Lithuanian

lt

check

x

Czech

cs

check[4]

x

Indonesian

id

check

x

Polish

pl

check[4]

x

Albanian

sq

check[4]

x

Afrikaans

af

check[4]

x

Belarusian

be

check[4]

x

Bulgarian

bg

check[4]

x

Burmese

my

check[4]

x

Croatian

hr

check[4]

x

Esperanto

eo

check[4]

x

Estonian

et

check[4]

x

Filipino

fil

check[4]

x

Hebrew

he

check[4]

x

Icelandic

is

check[4]

x

Kazakh

kk

check[4]

x

Latvian

lv

check[4]

x

Macedonian

mk

check[4]

x

Malay

ms

check[4]

x

Moldovan

ro

check[4]

x

Mongolian

mn

check[4]

x

Norwegian Bokmål

nb

check[4]

x

Persian

fa

check[4]

x

Serbian

sr

check[4]

x

Slovak

sk

check[4]

x

Slovenian

sl

check[4]

x

Swahili

sw

check[4]

x

Tagalog

tl

check[4]

x

Ukrainian

uk

check[4]

x

Uzbek

uz

check[4]

x

Vietnamese

vi

check[4]

x

JavaScript Search Framework support

Language Locale Search UI support

English

en

check

French

fr

check

German

de

check

Spanish

es

check

Danish

da

check

Dutch

nl

check

Finnish

fi

check

Hungarian

hu

check

Italian

it

check

Norwegian

no

check

Portuguese

pt

check

Swedish

sv

check

Turkish

tr

check

Catalan

ca

x

Romanian

ro

x

Valencian

ca

x

Armenian

hy

x

Russian

ru

check

Chinese (traditional and simplified)

zh

check

Greek

el

check

Hindi

hi

x

Japanese

ja

check

Korean

ko

check

Thai

th

check

Arabic

ar

x

Basque

eu

x

Lithuanian

lt

x

Czech

cs

check

Indonesian

id

check

Polish

pl

check

Albanian

sq

x

Afrikaans

af

x

Belarusian

be

x

Bulgarian

bg

x

Burmese

my

x

Croatian

hr

x

Esperanto

eo

x

Estonian

et

x

Filipino

fil

x

Hebrew

he

x

Icelandic

is

x

Kazakh

kk

x

Latvian

lv

x

Macedonian

mk

x

Malay

ms

x

Moldovan

ro

x

Mongolian

mn

x

Norwegian Bokmål

nb

x

Persian

fa

x

Serbian

sr

x

Slovak

sk

x

Slovenian

sl

x

Swahili

sw

x

Tagalog

tl

x

Ukrainian

uk

x

Uzbek

uz

x

Vietnamese

vi

x


1. Encoding, Excerpt, Language detection, Thesaurus, Stemming, and Did you mean.
2. The index can also generate item summaries.
3. A specialized tokenizer based on dictionaries is used to split CJK characters into words, which can impact search results relevancy.
4. Language detection and stemming are not supported.