Indexing Pipeline Extension Supported Character Sets

The Coveo indexing pipeline extensions support the following character sets:

  • BIG5

  • CP437, CP850, CP874, CP1250, CP1251, CP1252, CP1253, CP1254, CP1255, CP1256, CP1257, CP1258

  • EUC_JP, EUC_KR

  • GBK

  • HZ

  • ISO_2022_JP, ISO_2022_KR, ISO_2022_CN

  • ISO_8859_1, ISO_8859_2, ISO_8859_3, ISO_8859_4, ISO_8859_5, ISO_8859_6, ISO_8859_7, ISO_8859_8, ISO_8859_9, ISO_8859_10, ISO_8859_13, ISO_8859_14, ISO_8859_15, ISO_8859_16

  • KOI8_R, KOI8_U

  • MAC

  • PDF_DOC_ENCODING, PDF_STD_ENCODING, PDF_WINANSI_ENCODING

  • SHIFT_JIS

  • US_ASCII

  • UTF8, UTF16LE, UTF16BE, UTF32LE, UTF32BE