數據定義 · Hadoop

set hive.cli.print.current.db=true; 創建表 CREATE TABLE sales( name STRING, amount INT, region STRING) row format delimited fields terminated by ','; 插入語句 ~~~ INSERT INTO TINSERT INTO TABLE sales VALUES("ljs",100,"beijing"); INSERT INTO TINSERT INTO TABLE sales VALUES("zhangs",10,"shanghai"); INSERT INTO TABLE sales VALUES("zhoug",8,"liaoning"); ~~~ 執行SQL語句后，數據存儲在dhfs /hive/warehouse 創建集合類型的表 ~~~ create table employees( name string, salary float, subordinates array<string>, deductions map<string,float>, address struct<street:string,city:string,state:string,zip:int>) row format delimited fields terminated by '\001' collection items terminated by '\002' map keys terminated by '\003' lines terminated by '\n' stored as textfile; ~~~ 桶表—介紹桶表 ~~~ CREATE TABLE bucketed_users( UserID Int, Gender string, Age Int, Occupation string, Zipcode string) CLUSTERED BY (UserID) INTO 4 BUCKETS ROW FORMAT DELIMITED FIELDS TERMINATED BY ','; ~~~ 要向分桶表中填充成員，需要將 hive.enforce.bucketing 屬性設置為 true，Hive 就知道用表定義中聲明的數量來創建桶。 ~~~ hive>set hive.enforce.bucketing = true; ~~~ 插入數據 ~~~ INSERT OVERWRITE TABLE bucketed_users SELECT UserID,Gender, Age,Occupation,Zipcode FROM users; ~~~ 每個桶在磁盤上對應一個文件。