Stemmer Token Filter(Stemmer 词元过滤器)
原文链接 : https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-stemmer-tokenfilter.html
译文链接 : http://www.apache.wiki/pages/viewpage.action?pageId=10028526
贡献者 : fucker,ApacheCN,Apache中文网
一个过滤器,通过单个统一接口提供(几乎)所有可用的词干词元过滤器的访问,例如:
{"index" : {"analysis" : {"analyzer" : {"my_analyzer" : {"tokenizer" : "standard","filter" : ["standard", "lowercase", "my_stemmer"]}},"filter" : {"my_stemmer" : {"type" : "stemmer","name" : "light_german"}}}}}
language / name 参数用以下可用值控制词干(首选过滤器以粗体标记):
| Arabic | arabic |
| Armenian | armenian |
| Basque | basque |
| Brazilian Portuguese | brazilian |
| Bulgarian | bulgarian |
| Catalan | catalan |
| Czech | czech |
| Danish | danish |
| Dutch | dutch, dutch_kp |
| English | english, light_english, minimal_english, possessive_english, porter2,lovins |
| Finnish | finnish, light_finnish |
| French | french, light_french, minimal_french |
| Galician | galician, minimal_galician (Plural step only) |
| German | german, german2, light_german, minimal_german |
| Greek | greek |
| Hindi | hindi |
| Hungarian | hungarian, light_hungarian |
| Indonesian | indonesian |
| Irish | irish |
| Italian | italian, light_italian |
| Kurdish (Sorani) | sorani |
| Latvian | latvian |
| Lithuanian | lithuanian |
| Norwegian (Bokmål) | norwegian, light_norwegian, minimal_norwegian |
| Norwegian (Nynorsk) | light_nynorsk, minimal_nynorsk |
| Portuguese | portuguese, light_portuguese, minimal_portuguese, portuguese_rslp |
| Romanian | romanian |
| Russian | russian, light_russian |
| Spanish | spanish, light_spanish |
| Swedish | swedish, light_swedish |
| Turkish | turkish |
