Here is Something !: Hive Bucketed Tables

Tuesday, 2 December 2014

Hive Bucketed Tables

In previous post we had seen how to create partition tables in Hive.

Lets see how to create buckets in Hive table

The main difference between Hive partitioning and Bucketing is ,when we do partitioning, we create a partition for each unique value of the column. But there may be situation where we need to create lot of tiny partitions. But if you use bucketing, you can limit it to a number which you choose and decompose your data into those buckets. In hive a partition is a directory but a bucket is a file.

In hive, bucketing does not work by default. You will have to set following variable to enable bucketing. set hive.enforce.bucketing=true;

1. Creating a staging table to store your data

create external table stagingtbl (EmployeeID Int,FirstName String,Designation String,Salary Int,Department String) row format delimited fields terminated by "," location '/user/aibladmin/Hive';

2. Create bucketed table

create table emp_bucket (EmployeeID Int,FirstName String,Designation String,Salary Int,Department String) clustered by (department) into 3 buckets row format delimited fields terminated by ",";

3. Load data from stagingtbl to bucketed table

from stagingtbl insert into table emp_bucket
select employeeid,firstname,designation,salary,department;

4. Check how many data file have created in Hive metastore.

Lets check the table content in Hive warehouse

We can find 3 files in warehouse directory for department A,B and C.Each bucket contains unique values.

30 comments:

Unknown11 March 2015 at 15:27
Very nice explanation

thanks
ReplyDelete
Replies
Unknown9 August 2015 at 13:12
Hi Sreeveni,
Did you use bucket map join. can you explain usecase for bucket map join.explain with simple example

Thanks
Hareesh
ReplyDelete
Replies
rockprabha0820 March 2016 at 10:43
When i tried this i don't see all the 3buckets created only one is created with all data can you please explain me whether do i need to set anything other than what was mentioned here.
ReplyDelete
Replies
Venu12 May 2016 at 07:06
Hi sreeveni, Nice explanation,
I need a small info when would we use exactly this bucketing concepts? real time scenarios can you explain pls?!
Thanks
Venu
http://www.apachespark.in
ReplyDelete
Replies
Unknown25 August 2016 at 23:19
This comment has been removed by the author.
ReplyDelete
Replies
Unknown25 August 2016 at 23:43
Thank you. Very helpful explanation for hive bucketing. you can also see the full details about hive partition and bucketing as well as the hadoop ecosystems in-depth with clear examples in the below link http://www.geoinsyssoft.com/hive-partition-bucketing/
ReplyDelete
Replies
Unknown12 September 2016 at 00:11
This comment has been removed by the author.
ReplyDelete
Replies
Unknown12 September 2016 at 00:17
This comment has been removed by the author.
ReplyDelete
Replies
Unknown12 September 2016 at 02:31
very nice explanation..thanks for sharing..and visit our site for more on hadoop..
http://bit.ly/2bZrnGP
ReplyDelete
Replies
Unknown1 October 2016 at 01:28
its a very good explanation for hive bucketing..easy to learn..

http://bit.ly/2dcxHPD
ReplyDelete
Replies
amar26 August 2017 at 00:18
it's very nice blog
ReplyDelete
Replies
Unknown3 September 2017 at 22:56
Very good blog helpful to everyone Hadoop training in bangalore
Tableau training in bangalore
Android training in bangalore
Php training in bangalore
ReplyDelete
Replies
nishanth25 September 2017 at 23:39
Excellent blog on Hive Bucketed Tables. Thank you sharing you knowledge with us.
Devops Training in Bangalore
itEanz
ReplyDelete
Replies
Unknown30 October 2017 at 22:07
Very nice blog artificial intelligence training in bangalore
ReplyDelete
Replies
Unknown9 November 2017 at 01:02
u s e f u l l article. Thanks for sharing
ReplyDelete
Replies
Unknown26 November 2017 at 23:10
v e r y i n f o r mative blog
ReplyDelete
Replies
Unknown19 December 2017 at 03:36
Is there a way to re-create the same bucketed table again after droping it.
like in partitioning we use msck repair to get our partitioned data.
ReplyDelete
Replies
UNKNOWN19 May 2018 at 01:28
I have to voice my passion for your kindness giving support to those people that should have guidance on this important matter.

AWS Training in Bangalore

ReplyDelete
Replies
Elegant IT Services25 February 2020 at 23:58
Thanks for sharing the information...
qlikview training in bangalore
ReplyDelete
Replies
Elegant IT Services26 February 2020 at 00:08
Thanks for the amazing information...
power bi training in bangalore
ReplyDelete
Replies
lavanya30 July 2020 at 09:34
Thanks for your great and helpful presentation I like your good service. I always appreciate your post. That is very interesting I love reading and I am always searching for informative information like this. Nice blog,I understood the topic very clearly,And want to study more like thisJava training in Chennai

Java Online training in Chennai

Java Course in Chennai

Best JAVA Training Institutes in Chennai

Java training in Bangalore

Java training in Hyderabad

Java Training in Coimbatore

Java Training

Java Online Training
ReplyDelete
Replies
surya4 August 2020 at 06:20
Thanks for sharing an informative blog keep rocking bring more details.I like the helpful info you provide in your articles. I’ll bookmark your weblog and check again here regularly. I am quite sure I will learn much new stuff right here! Good luck for the next!

angular js training in chennai

angular training in chennai

angular js online training in chennai

angular js training in bangalore

angular js training in hyderabad

angular js training in coimbatore

angular js training

angular js online training

ReplyDelete
Replies
aravind8 August 2020 at 08:46
This article is really helpful for me. I am regular visitor to this blog. Share such kind of article more in future.It’s hard to come by experienced people about this subject, but you seem like you know what you’re talking about! Thanks.
DevOps Training in Chennai

DevOps Online Training in Chennai

DevOps Training in Bangalore

DevOps Training in Hyderabad

DevOps Training in Coimbatore

DevOps Training

DevOps Online Training
ReplyDelete
Replies
dhinesh13 August 2020 at 07:20
Great site and a great topic as well I really get amazed to read this.This is incredible,I feel really happy to have seen your webpage.I gained many unknown information, the way you have clearly explained is really fantastic.keep posting such useful information.
Full Stack Training in Chennai | Certification | Online Training Course
Full Stack Training in Bangalore | Certification | Online Training Course

Full Stack Training in Hyderabad | Certification | Online Training Course
Full Stack Developer Training in Chennai | Mean Stack Developer Training in Chennai
Full Stack Training

Full Stack Online Training

ReplyDelete
Replies
sushmi28 August 2020 at 05:54
Thanks for sharing an informative blog keep rocking bring more details.I like the helpful info you provide in your articles. I’ll bookmark your weblog and check again here regularly. I am quite sure I will learn much new stuff right here! Good luck for the next!

AWS Course in Bangalore

AWS Course in Hyderabad

AWS Course in Coimbatore

AWS Course

AWS Certification Course

AWS Certification Training

AWS Online Training

AWS Training
ReplyDelete
Replies
prabhu29 August 2020 at 00:09
There are many interesting information included and i can easily understand all given information.I post something on my blog to post something, or wait to post something worth saying. Keep update more information....

IELTS Coaching in chennai

German Classes in Chennai

GRE Coaching Classes in Chennai

TOEFL Coaching in Chennai

spoken english classes in chennai | Communication training

ReplyDelete
Replies
vivekvedha29 August 2020 at 00:19
very nice explanation..thanks for sharing..and visit our site.
acte chennai

acte complaints

acte reviews

acte trainer complaints

acte trainer reviews

acte velachery reviews complaints

acte tambaram reviews complaints

acte anna nagar reviews complaints

acte porur reviews complaints

acte omr reviews complaints

ReplyDelete
Replies

Subscribe to: Post Comments (Atom)