內置運算符 · 大數據

[TOC] 內容較多，見《Hive官方文檔》 ~~~ show functions; ~~~ # 關系運算 Hive支持的關系運算符常見的關系運算符 ~~~ 等值比較: = 不等值比較: <> 小于比較: < 小于等于比較: <= 大于比較: > 大于等于比較: >= 空值判斷: IS NULL 非空判斷: IS NOT NULL LIKE比較: LIKE JAVA的LIKE操作: RLIKE REGEXP操作: REGEXP 等值比較: = ~~~ **語法：A=B** 操作類型：所有基本類型描述: 如果表達式A與表達式B相等，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where 1=1; ~~~ **不等值比較: `<>`** 語法: `A <> B` 操作類型: 所有基本類型描述: 如果表達式A為NULL，或者表達式B為NULL，返回NULL；如果表達式A與表達式B不相等，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where 1 <> 2; ~~~ **小于比較: <** 語法:` A < B` 操作類型: 所有基本類型描述: 如果表達式A為NULL，或者表達式B為NULL，返回NULL；如果表達式A小于表達式B，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where 1 < 2; ~~~ **小于等于比較: `<=`** 語法: `A <= B` 操作類型: 所有基本類型描述: 如果表達式A為NULL，或者表達式B為NULL，返回NULL；如果表達式A小于或者等于表達式B，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where 1 <= 1; ~~~ **大于等于比較: `>=`** 語法: `A >= B` 操作類型: 所有基本類型描述: 如果表達式A為NULL，或者表達式B為NULL，返回NULL；如果表達式A大于或者等于表達式B，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where 1 >= 1; ~~~ **空值判斷: IS NULL** 語法: A IS NULL 操作類型: 所有類型描述: 如果表達式A的值為NULL，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where null is null; ~~~ **非空判斷: IS NOT NULL** 語法: A IS NOT NULL 操作類型: 所有類型描述: 如果表達式A的值為NULL，則為FALSE；否則為TRUE 舉例： ~~~ hive> select 1 from dual where 1 is not null; ~~~ **LIKE比較: LIKE** 語法: A LIKE B 操作類型: strings 描述: 如果字符串A或者字符串B為NULL，則返回NULL；如果字符串A符合表達式B 的正則語法，則為TRUE；否則為FALSE。B中字符`"_"`表示任意單個字符，而字符`”%”`表示任意數量的字符。舉例： ~~~ hive> select 1 from dual where 'key' like 'foot%'; 1 hive> select 1 from dual where 'key ' like 'foot____'; 1 ~~~ 注意：否定比較時候用 NOT A LIKE B ~~~ hive> select 1 from dual where NOT 'key ' like 'fff%'; ~~~ **JAVA的LIKE操作: RLIKE** 語法: A RLIKE B 操作類型: strings 描述: 如果字符串A或者字符串B為NULL，則返回NULL；如果字符串A符合JAVA正則表達式B的**正則語法**，則為TRUE；否則為FALSE。舉例： ~~~ hive> select 1 from dual where 'footbar' rlike '^f.*r$’; 1 ~~~ 注意：判斷一個字符串是否全為數字： ~~~ hive>select 1 from dual where '123456' rlike '^\\d+$'; 1 hive> select 1 from dual where '123456aa' rlike '^\\d+$'; 1 ~~~ **REGEXP操作: REGEXP** 語法: A REGEXP B 操作類型: strings 描述: 功能與RLIKE相同舉例： ~~~ hive> select 1 from dual where 'key' REGEXP '^f.*r$'; 1 ~~~ # 邏輯運算與數學運算 ## Hive數學運算加法操作: + 減法操作: - 乘法操作: * 除法操作: / 取余操作: % 位與操作: & 位或操作: | 位異或操作: ^ 位取反操作: ~ 加法操作: + **語法: A + B** 操作類型：所有數值類型說明：返回A與B相加的結果。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。比如，int + int 一般結果為int類型，而int + double 一般結果為double類型舉例： ~~~ hive> select 1 + 9 from dual; 10 ~~~ **減法操作: -** 語法: A - B 操作類型：所有數值類型說明：返回A與B相減的結果。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。比如，int - int 一般結果為int類型，而int - double 一般結果為double類型舉例： ~~~ hive> select 10 - 5 from dual; 5 ~~~ **`乘法操作 : *`** 語法: `A * B` 操作類型：所有數值類型說明：返回A與B相乘的結果。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。注意，如果A乘以B的結果超過默認結果類型的數值范圍，則需要通過cast將結果轉換成范圍更大的數值類型舉例： ~~~ hive> select 40 * 5 from dual; 200 ~~~ **除法操作 : /** 語法: A / B 操作類型：所有數值類型說明：返回A除以B的結果。結果的數值類型為double 舉例： ~~~ hive> select 40 / 5 from dual; 8.0 ~~~ 注意： hive 中最高精度的數據類型是 double, 只精確到小數點后 16 位，在做除法運算的時候要特別注意 ~~~ hive>select ceil(28.0/6.999999999999999999999) from dual limit 1; 結果為4 hive>select ceil(28.0/6.99999999999999) from dual limit 1; 結果為5 ~~~ **取余操作 : %** 語法: `A % B` 操作類型：所有數值類型說明：返回A除以B的余數。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。舉例： ~~~ hive> select 41 % 5 from dual; 1 hive> select 8.4 % 4 from dual; 0.40000000000000036 ~~~ 注意：精度在 hive 中是個很大的問題，類似這樣的操作最好通過 round 指定精度 ~~~ hive> select round(8.4 % 4 , 2) from dual; 0.4 ~~~ **位與操作 : &** 語法: `A & B` 操作類型：所有數值類型說明：返回A和B按位進行與操作的結果。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。舉例： ~~~ hive> select 4 & 8 from dual; 0 hive> select 6 & 4 from dual; 4 ~~~ **位或操作 : |** 語法: A | B 操作類型：所有數值類型說明：返回A和B按位進行或操作的結果。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。舉例： ~~~ hive> select 4 | 8 from dual; 12 hive> select 6 | 8 from dual; 14 ~~~ **位異或操作 : ^** 語法: A ^ B 操作類型：所有數值類型說明：返回A和B按位進行異或操作的結果。結果的數值類型等于A的類型和B的類型的最小父類型（詳見數據類型的繼承關系）。舉例： ~~~ hive> select 4 ^ 8 from dual; 12 hive> select 6 ^ 4 from dual; 2 ~~~ **位取反操作 : ~** 語法: ~A 操作類型：所有數值類型說明：返回A按位取反操作的結果。結果的數值類型等于A的類型。舉例： ~~~ hive> select ~6 from dual; -7 hive> select ~4 from dual; -5 ~~~ ## Hive邏輯運算邏輯與操作: AND 邏輯或操作: OR 邏輯非操作: NOT **邏輯與操作 : AND** 語法: A AND B 操作類型：boolean 說明：如果A和B均為TRUE，則為TRUE；否則為FALSE。如果A為NULL或B為NULL，則為NULL 舉例： ~~~ hive> select 1 from dual where 1=1 and 2=2; 1 ~~~ **邏輯或操作 : OR** 語法: A OR B 操作類型：boolean 說明：如果A為TRUE，或者B為TRUE，或者A和B均為TRUE，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where 1=2 or 2=2; 1 ~~~ **邏輯非操作 : NOT** 語法: NOT A 操作類型：boolean 說明：如果A為FALSE，或者A為NULL，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where not 1=2; ~~~ **邏輯非操作 : NOT** 語法: NOT A 操作類型：boolean 說明：如果A為FALSE，或者A為NULL，則為TRUE；否則為FALSE 舉例： ~~~ hive> select 1 from dual where not 1=2 ; ~~~ # 數值運算取整函數: round 指定精度取整函數: round 向下取整函數: floor 向上取整函數: ceil 向上取整函數: ceiling 取隨機數函數: rand 自然指數函數: exp 以10為底對數函數: log10 以2為底對數函數: log2 對數函數: log 冪運算函數: pow 冪運算函數: power 開平方函數: sqrt 二進制函數: bin 十六進制函數: hex 反轉十六進制函數: unhex 進制轉換函數: conv 絕對值函數: abs 正取余函數: pmod 正弦函數: sin 反正弦函數: asin 余弦函數: cos 反余弦函數: acos positive函數: positive negative函數: negative **取整函數 : round** 語法: `round(double a)` 返回值: BIGINT 說明: 返回double類型的整數值部分（遵循四舍五入）舉例： ~~~ hive> select round(3.1415926) from dual; 3 hive> select round(3.5) from dual; 4 hive> create table dual as select round(9542.158) from dual; hive> describe dual; _c0 bigint ~~~ **指定精度取整函數 : round** 語法: `round(double a, int d)` 返回值: DOUBLE 說明: 返回指定精度d的double類型舉例： ~~~ hive> select round(3.1415926,4) from dual; 3.1416 ~~~ **向下取整函數 : floor** 語法: floor(double a) 返回值: BIGINT 說明: 返回等于或者小于該double變量的最大的整數舉例： ~~~ hive> select floor(3.1415926) from dual; 3 hive> select floor(25) from dual; 25 ~~~ **向上取整函數 : ceil** 語法: ceil(double a) 返回值: BIGINT 說明: 返回等于或者大于該double變量的最小的整數舉例： ~~~ hive> select ceil(3.1415926) from dual; 4 hive> select ceil(46) from dual; 46 ~~~ **向上取整函數 : ceiling** 語法: ceiling(double a) 返回值: BIGINT 說明: 與ceil功能相同舉例： ~~~ hive> select ceiling(3.1415926) from dual; 4 hive> select ceiling(46) from dual; 46 ~~~ **取隨機數函數 : rand** 語法: rand(),rand(int seed) 返回值: double 說明: 返回一個0到1范圍內的隨機數。如果指定種子seed，則會等到一個穩定的隨機數序列舉例： ~~~ hive> select rand() from dual; 0.5577432776034763 ~~~ **自然指數函數 : exp** 語法: exp(double a) 返回值: double 說明: 返回自然對數e的a次方舉例： ~~~ hive> select exp(2) from dual; 7.38905609893065 ~~~ **自然對數函數: ln** 語法: ln(double a) 返回值: double 說明: 返回a的自然對數 **以 10 為底對數函數 : log10** 語法: log10(double a) 返回值: double 說明: 返回以10為底的a的對數舉例： ~~~ hive> select log10(100) from dual; 2.0 ~~~ **以 2 為底對數函數 : log2** 語法: log2(double a) 返回值: double 說明: 返回以2為底的a的對數舉例： ~~~ hive> select log2(8) from dual; 3.0 ~~~ **對數函數 : log** 語法: log(double base, double a) 返回值: double 說明: 返回以base為底的a的對數舉例： ~~~ hive> select log(4,256) from dual; 4.0 ~~~ **冪運算函數 : pow** 語法: pow(double a, double p) 返回值: double 說明: 返回a的p次冪舉例： ~~~ hive> select pow(2,4) from dual; 16.0 ~~~ **冪運算函數 : power** 語法: power(double a, double p) 返回值: double 說明: 返回a的p次冪,與pow功能相同舉例： ~~~ hive> select power(2,4) from dual; 16.0 ~~~ **開平方函數 : sqrt** 語法: sqrt(double a) 返回值: double 說明: 返回a的平方根舉例： ~~~ hive> select sqrt(16) from dual; 4.0 ~~~ **二進制函數 : bin** 語法: bin(BIGINT a) 返回值: string 說明: 返回a的二進制代碼表示舉例： ~~~ hive> select bin(7) from dual; 111 ~~~ **十六進制函數 : hex** 語法: hex(BIGINT a) 返回值: string 說明: 如果變量是int類型，那么返回a的十六進制表示；如果變量是string類型，則返回該字符串的十六進制表示舉例： ~~~ hive> select hex(17) from dual; 11 hive> select hex('abc') from dual; 616263 ~~~ **反轉十六進制函數 : unhex** 語法: unhex(string a) 返回值: string 說明: 返回該十六進制字符串所代碼的字符串舉例： ~~~ hive> select unhex('616263') from dual; abc hive> select unhex('11') from dual; - hive> select unhex(616263) from dual; abc ~~~ **進制轉換函數 : conv** 語法: conv(BIGINT num, int from_base, int to_base) 返回值: string 說明: 將數值num從from_base進制轉化到to_base進制舉例： ~~~ hive> select conv(17,10,16) from dual; 11 hive> select conv(17,10,2) from dual; 10001 ~~~ **絕對值函數 : abs** 語法: abs(double a) abs(int a) 返回值: double int 說明: 返回數值a的絕對值舉例： ~~~ hive> select abs(-3.9) from dual; 3.9 hive> select abs(10.9) from dual; 10.9 ~~~ **正取余函數 : pmod** 語法: pmod(int a, int b),pmod(double a, double b) 返回值: int double 說明: 返回正的a除以b的余數舉例： ~~~ hive> select pmod(9,4) from dual; 1 hive> select pmod(-9,4) from dual; 3 ~~~ **正弦函數 : sin** 語法: sin(double a) 返回值: double 說明: 返回a的正弦值舉例： ~~~ hive> select sin(0.8) from dual; 0.7173560908995228 ~~~ **反正弦函數 : asin** 語法: asin(double a) 返回值: double 說明: 返回a的反正弦值舉例： ~~~ hive> select asin(0.7173560908995228) from dual; 0.8 ~~~ **余弦函數 : cos** 語法: cos(double a) 返回值: double 說明: 返回a的余弦值舉例： ~~~ hive> select cos(0.9) from dual; 0.6216099682706644 ~~~ **反余弦函數 : acos** 語法: acos(double a) 返回值: double 說明: 返回a的反余弦值舉例： ~~~ hive> select acos(0.6216099682706644) from dual; 0.9 ~~~ **positive 函數 : positive** 語法: positive(int a), positive(double a) 返回值: int double 說明: 返回a 舉例： ~~~ hive> select positive(-10) from dual; -10 hive> select positive(12) from dual; 12 ~~~ **negative 函數 : negative** 語法: negative(int a), negative(double a) 返回值: int double 說明: 返回-a 舉例： ~~~ hive> select negative(-5) from dual; 5 hive> select negative(8) from dual; -8 ~~~