Token数

Token数

原文链接 : https://www.elastic.co/guide/en/elasticsearch/reference/current/token-count.html

译文链接 :http://www.apache.wiki/pages/viewpage.action?pageId=10030674

简述

类型为token_count的字段是一个接受字符串值的integer字段，对它们进行分析，然后对字符串中的token数进行索引。

示例

例如：

PUT my_index
{
  "mappings": {
    "my_type": {
      "properties": {
        "name": { 
          "type": "text",
          "fields": {
            "length": { 
              "type":     "token_count",
              "analyzer": "standard"
            }
          }
        }
      }
    }
  }
}
PUT my_index/my_type/1
{ "name": "John Smith" }
PUT my_index/my_type/2
{ "name": "Rachel Alice Williams" }
GET my_index/_search
{
  "query": {
    "term": {
      "name.length": 3 
    }
  }
}

| | name字段是使用默认standard分析器的分析字符串字段。 | | | name.length字段是一个token_count 多字段，它将在name字段中索引token的数量。 | | | 此查询仅匹配包含Rachel Alice Williams的文档，因为它包含三个token。 |

在技术上， token_count类型对位置增量进行求和，而不是对token计数。这意味着即使分析仪滤除停止词，它们也包括在计数中。

参数

token_count字段接受以下参数：