Hunspell Token Filter · my-elasticsearch-cn

# Hunspell Token Filter(Hunspell 詞元過濾器) ## 簡述 Hunspell過濾器是Hunspell的基礎。Hunspell字典將從文件系統上的專用hunspell目錄（?`<path.conf>/hunspell`?）中`<path.conf>/hunspell`?。?預期每個字典都有自己的目錄，以其關聯的語言環境（語言）命名。?這個字典目錄預計會保存一個`*.aff`和一個或多個`*.dic`文件（所有這些文件將自動被讀取）。?例如，假設使用默認的hunspell位置，以下目錄布局將定義`en_US`字典： | `- conf` `|-- hunspell` `|??? |-- en_US` `|??? |??? |-- en_US.dic` `|??? |??? |-- en_US.aff` | ## 參數每個字典都可以進行一個設置： ignore_case 如果為true，字典匹配將不區分大小寫（默認為false ）這個設置可以在`elasticsearch.yml`使用全局`elasticsearch.yml` * `indices.analysis.hunspell.dictionary.ignore_case` 或指定特定字典： * `indices.analysis.hunspell.dictionary.en_US.ignore_case`?。還可以在保存這些設置的`settings.yml`目錄下添加`settings.yml`文件（這將覆蓋在`elasticsearch.yml`定義的任何其他設置）。可以通過配置分析設置來使用hunspell過濾器： | `{` `"analysis"`?`: {` `"analyzer"`?`: {` `"en"`?`: {` `"tokenizer"`?`:?``"standard"``,` `"filter"`?`: [?``"lowercase"``,?``"en_US"`?`]` `}` `},` `"filter"`?`: {` `"en_US"`?`: {` `"type"`?`:?``"hunspell"``,` `"locale"`?`:?``"en_US"``,` `"dedup"`?`:?``true` `}` `}` `}` `}` | hunspell過濾器接受四個選項： locale 此過濾器的區域設置。如果這沒有設置，則使用lang或者language來代替它們，因此必須設置其中一個。 dictionary 字典的名稱您的hunspell字典的路徑應通過indices.analysis.hunspell.dictionary.location 。 dedup 如果需要返回單條結果，則需要將其設置為true 。默認為true 。 longest_only 如果只返回最長的結果，請將其設置為true 。默認為false ：返回所有可能的結果。與（基于算法的）?snowball stemmers（雪球詞干分析器）相反，這是基于詞典查找的，因此詞干的質量由詞典的質量決定。 ## 加載字典默認情況下，當節點啟動時，將為該字典檢查默認的Hunspell目錄（?`config/hunspell/`?），并自動加載任何字典。通過在配置文件`indices.analysis.hunspell.dictionary.lazy`設置為`true`?，可以將字典加載實際使用。 ? ## 參考 Hunspell是一個拼寫檢查器和形態分析器，專為具有豐富形態和復雜的字復合和字符編碼的語言而設計。 1. 維基百科，?[http://en.wikipedia.org/wiki/Hunspell](https://translate.googleusercontent.com/translate_c?depth=1&hl=zh-CN&rurl=translate.google.com&sl=en&sp=nmt4&tl=zh-CN&u=http://en.wikipedia.org/wiki/Hunspell&usg=ALkJrhhTe4SNBi-yzE2Ium7lZbYNdA0rjA) 2. 源代碼，?[http://hunspell.sourceforge.net/](https://translate.googleusercontent.com/translate_c?depth=1&hl=zh-CN&rurl=translate.google.com&sl=en&sp=nmt4&tl=zh-CN&u=http://hunspell.sourceforge.net/&usg=ALkJrhjW2G8qRbu0lZQh2LVWUBJYgNKtWQ) 3. 開放辦公室Hunspell字典，?[http://wiki.openoffice.org/wiki/Dictionaries](https://translate.googleusercontent.com/translate_c?depth=1&hl=zh-CN&rurl=translate.google.com&sl=en&sp=nmt4&tl=zh-CN&u=http://wiki.openoffice.org/wiki/Dictionaries&usg=ALkJrhjol_nl3ZVfCZVXqDENOiy7iD3A8g) 4. Mozilla Hunspell字典，?[https://addons.mozilla.org/en-US/firefox/language-tools/](https://translate.googleusercontent.com/translate_c?depth=1&hl=zh-CN&rurl=translate.google.com&sl=en&sp=nmt4&tl=zh-CN&u=https://addons.mozilla.org/en-US/firefox/language-tools/&usg=ALkJrhjkouoWClkhvx3I7BTP9HzM9bIOtw) 5. Chromium Hunspell字典，?[http://src.chromium.org/viewvc/chrome/trunk/deps/third_party/hunspell_dictionaries/](https://translate.googleusercontent.com/translate_c?depth=1&hl=zh-CN&rurl=translate.google.com&sl=en&sp=nmt4&tl=zh-CN&u=http://src.chromium.org/viewvc/chrome/trunk/deps/third_party/hunspell_dictionaries/&usg=ALkJrhjpxkhrlh-puMLQPulnPxd5rDvdxw)