Fetch抓取 · JAVA

[TOC] # 簡介 Fetch抓取是指,hive中對某些情況的查詢可以不必使用mapreduce計算.例如 ~~~ select * from employees; ~~~ 在這種情況下,hive可以簡單的讀取employees對應的存儲目錄下的文件,然后輸出查詢結果到控制臺. # 配置在hive-default.xml.template文件中hive.fetch.task.conversion默認是more. 老版本hive默認minimal,該屬性改為more以后,在全局查找,字段查找,limit查找等都不走mapreduce ~~~ <property> <name>hive.fetch.task.conversion</name> <value>more</value> <description> Expects one of [none, minimal, more]. Some select queries can be converted to single FETCH task minimizing latency. Currently the query should be single sourced not having any subquery and should not have any aggregations or distincts (which incurs RS), lateral views and joins. 0. none : disable hive.fetch.task.conversion 1. minimal : SELECT STAR, FILTER on partition columns, LIMIT only 2. more : SELECT, FILTER, LIMIT only (support TABLESAMPLE and virtual columns) </description> </property> ~~~ # 案例 1. 把hive.fetch.task.conversion設置成none,然后執行查詢語句,都會執行mapreduce程序. ~~~ hive> set hive.fetch.task.conversion=none; ~~~ 然后執行下 ~~~ hive> select * from emp; hive> select ename frome emp; hive> select ename frome emp limit 3; ~~~ 2. 把hive.fetch.task.conversion設置成more,然后執行查詢語句,都不會執行mapreduce程序. ~~~ hive> set hive.fetch.task.conversion=more; ~~~ 然后執行下 ~~~ hive> select * from emp; hive> select ename frome emp; hive> select ename frome emp limit 3; ~~~