<ruby id="bdb3f"></ruby>

    <p id="bdb3f"><cite id="bdb3f"></cite></p>

      <p id="bdb3f"><cite id="bdb3f"><th id="bdb3f"></th></cite></p><p id="bdb3f"></p>
        <p id="bdb3f"><cite id="bdb3f"></cite></p>

          <pre id="bdb3f"></pre>
          <pre id="bdb3f"><del id="bdb3f"><thead id="bdb3f"></thead></del></pre>

          <ruby id="bdb3f"><mark id="bdb3f"></mark></ruby><ruby id="bdb3f"></ruby>
          <pre id="bdb3f"><pre id="bdb3f"><mark id="bdb3f"></mark></pre></pre><output id="bdb3f"></output><p id="bdb3f"></p><p id="bdb3f"></p>

          <pre id="bdb3f"><del id="bdb3f"><progress id="bdb3f"></progress></del></pre>

                <ruby id="bdb3f"></ruby>

                ??一站式輕松地調用各大LLM模型接口,支持GPT4、智譜、豆包、星火、月之暗面及文生圖、文生視頻 廣告
                # 指紋分析器 原文鏈接 : [https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-fingerprint-analyzer.html](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/getting-started.html)(修改該鏈接為官網對應的鏈接) 譯文鏈接 : [http://www.apache.wiki/display/Elasticsear](http://www.apache.wiki/display/Elasticsearch)ch/analysis-fingerprint-analyzer.html(修改該鏈接為 **ApacheCN** 對應的譯文鏈接) 貢獻者 : @您的名字,[ApacheCN](/display/~apachecn),[Apache中文網](/display/~apachechina) fingerprint 分析器實現了OpenRefine項目使用的[指紋識別算法](https://github.com/OpenRefine/OpenRefine/wiki/Clustering-In-Depth#fingerprint)來協助聚類。 輸入文本較低,規范化以刪除擴展字符,排序,重復數據刪除并連接到單個令牌。 如果配置了一個停用詞列表,停止單詞也將被刪除。 **定義** 它包括: 分詞器 * [Standard Tokenizer](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-standard-tokenizer.html "Standard Tokenizer") 詞語過濾器 * [Lower Case Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-lowercase-tokenfilter.html "Lowercase Token Filter") * [ASCII Folding Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-asciifolding-tokenfilter.html "ASCII Folding Token Filter") * [Stop Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-stop-tokenfilter.html "Stop Token Filter")?(默認禁用) * [Fingerprint Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-fingerprint-tokenfilter.html "Fingerprint Token Filter") ## **輸出實例** ``` POST _analyze { "analyzer": "fingerprint", "text": "Yes yes, G?del said this sentence is consistent and." } ``` ``` 上述的句子將產生以下的詞語: ``` ``` [ and consistent godel is said sentence this yes ] ``` ## **配置**? ``` fingerprint(指紋)分析器接受以下的參數: ``` | `separator` | 用于連接條款的字符。 默認為空格。 | | `max_output_size` | 要發出的最大標記大小。 默認為255.大于此大小的token將被丟棄。 | | `stopwords` | 預定義的停止詞列表,如_english_或包含停止詞列表的數組。 默認為\ _none_。 | | `stopwords_path` | 包含停止詞的文件的路徑。 | 有關停止字配置的更多信息,請參閱?[Stop Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.3/analysis-stop-tokenfilter.html "Stop Token Filter")。 ## **配置實例** 在這個例子中,我們配置 fingerprint 分析器以使用預定義的英文停止詞列表: ``` PUT my_index { "settings": { "analysis": { "analyzer": { "my_fingerprint_analyzer": { "type": "fingerprint", "stopwords": "_english_" } } } } } POST my_index/_analyze { "analyzer": "my_fingerprint_analyzer", "text": "Yes yes, G?del said this sentence is consistent and." } ``` 以上示例產生以下詞語: ``` [ consistent godel said sentence yes ] ```
                  <ruby id="bdb3f"></ruby>

                  <p id="bdb3f"><cite id="bdb3f"></cite></p>

                    <p id="bdb3f"><cite id="bdb3f"><th id="bdb3f"></th></cite></p><p id="bdb3f"></p>
                      <p id="bdb3f"><cite id="bdb3f"></cite></p>

                        <pre id="bdb3f"></pre>
                        <pre id="bdb3f"><del id="bdb3f"><thead id="bdb3f"></thead></del></pre>

                        <ruby id="bdb3f"><mark id="bdb3f"></mark></ruby><ruby id="bdb3f"></ruby>
                        <pre id="bdb3f"><pre id="bdb3f"><mark id="bdb3f"></mark></pre></pre><output id="bdb3f"></output><p id="bdb3f"></p><p id="bdb3f"></p>

                        <pre id="bdb3f"><del id="bdb3f"><progress id="bdb3f"></progress></del></pre>

                              <ruby id="bdb3f"></ruby>

                              哎呀哎呀视频在线观看