Apache Hive Cookbook:

Hanish Bansal

Saurabh Chauhan

Shrey Mehrotra

AU$24.99 per month

3 (4 Ratings)

Paperback Apr 2016 268 pages 1st Edition

eBook

AU$36.99 ~~AU$53.99~~

Renews at AU$24.99p/m

Hanish Bansal

Saurabh Chauhan

Shrey Mehrotra

AU$24.99 per month

3 (4 Ratings)

Paperback Apr 2016 268 pages 1st Edition

eBook

AU$36.99 ~~AU$53.99~~

Renews at AU$24.99p/m

eBook

AU$36.99 ~~AU$53.99~~

Renews at AU$24.99p/m

What do you get with a Packt Subscription?

Free for first 7 days. $24.99 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

View table of contents

Preview Book

Apache Hive Cookbook

Chapter 2. Services in Hive

In the previous chapter, you learned how we could install Hive with different metastore configurations. We also have gone through Hive clients and Hive services in brief.

In this chapter, we will cover the following recipes in detail:

Introducing HiveServer2
Understanding HiveServer2 properties
Configuring HiveServer2 high availability
Using HiveServer2 clients
Introducing the Hive metastore service
Configuring high availability of metastore service
Introducing Hue

Understanding HiveServer2 properties

By default, HiveServer2 is started with default configurations. The configurations are mainly related to the port and host on which the server is going to start and number of threads that could be configured for client and background operations.

How to do it…

You can change the default properties for HiveServer2 by overriding the value in hive-site.xml in the conf folder of Hive package.

Property	Default Value	Description
`hive.server2.thrift.port`	`10000`	HiveServer2 thrift interface
`hive.server2.thrift.bind.host`	`localhost`	HiveServer2 bind host
`hive.server2.thrift.min.worker.threads`	`5`	Minimum thrift worker threads
`hive.server2.thrift.max.worker.threads`	`500`	Maximum thrift worker threads
`hive.server2.authentication`	`None`	None/LDAP/KERBEROS/PAM/NOSASL
`hive.server2.authentication.kerberos.keytab`	""	A keytab file for kerberos principal
`hive.server2.authentication.kerberos.principal`	""	The Kerberos principal
`hive...`

Key benefits

Grasp a complete reference of different Hive topics.
Get to know the latest recipes in development in Hive including CRUD operations
Understand Hive internals and integration of Hive with different frameworks used in today’s world.

Description

Hive was developed by Facebook and later open sourced in Apache community. Hive provides SQL like interface to run queries on Big Data frameworks. Hive provides SQL like syntax also called as HiveQL that includes all SQL capabilities like analytical functions which are the need of the hour in today’s Big Data world. This book provides you easy installation steps with different types of metastores supported by Hive. This book has simple and easy to learn recipes for configuring Hive clients and services. You would also learn different Hive optimizations including Partitions and Bucketing. The book also covers the source code explanation of latest Hive version. Hive Query Language is being used by other frameworks including spark. Towards the end you will cover integration of Hive with these frameworks.

Who is this book for?

The book is intended for those who want to start in Hive or who have basic understanding of Hive framework. Prior knowledge of basic SQL command is also required