索引 - index - 分词 - 文本分析 - 《ZincSearch 中文文档 - 帮助手册 - 教程》

分词
请求示例
响应示例
- 使用特定的分析器
使用特定的标记符号生成器 - tokenizer
使用特定的标记符号生成器和 filter
示例

分词

分析文本并生成 token。

请求示例

POST /api/_analyze

请求参数：

{
  "analyzer" : "standard",
  "text" : "50 first dates"
}

响应示例

{
    "tokens": [
        {
            "end_offset": 2,
            "keyword": false,
            "position": 1,
            "start_offset": 0,
            "token": "50",
            "type": "Numeric"
        },
        {
            "end_offset": 8,
            "keyword": false,
            "position": 1,
            "start_offset": 3,
            "token": "first",
            "type": "AlphaNumeric"
        },
        {
            "end_offset": 14,
            "keyword": false,
            "position": 1,
            "start_offset": 9,
            "token": "dates",
            "type": "AlphaNumeric"
        }
    ]
}

使用特定的分析器

{
  "analyzer" : "standard",
  "text" : "50 first dates"
}

使用特定的标记符号生成器 - tokenizer

{
  "tokenizer" : "standard",
  "text" : "50 first dates"
}

使用特定的标记符号生成器和 filter

{
  "tokenizer" : "standard",
  "char_filter" : ["html"],
  "token_filter" : ["camel_case"],
  "text" : "50 first dates"
}

示例

分词 - 文本分析 - 图1