Chapter 6. Data Aggregation and Sampling
This chapter is about how to aggregate and sample data in Hive. It firstly covers the usage of several aggregation functions, analytic functions working with GROUP BY
and PARTITION BY
, and windowing clauses. Then, it introduces different ways of sampling data in Hive.
In this chapter, we will cover the following topics:
- Basic aggregation
- Advanced aggregation
- Aggregation condition
- Analytic functions
- Sampling