Writing our first Hadoop MapReduce job
In this recipe, we will write our first MapReduce job using the Hadoop MapReduce API and run it using the mongo-hadoop connector getting the data from MongoDB. Refer to the Executing MapReduce in Mongo using a Java client recipe in Chapter 3, Programming Language Drivers to see how MapReduce is implemented using a Java client, test data creation, and problem statement.
Getting ready
Refer to the previous Executing our first sample MapReduce job using the mongo-hadoop connector recipe to set up the mongo-hadoop connector. The prerequisites of this recipe and the Executing MapReduce in Mongo using a Java client recipe from Chapter 3, Programming Language Drivers are all that we need for this recipe. This is a maven project and thus maven needs to be set up and installed. Refer to the Connecting to the Single node from a Java client recipe in Chapter 1, Installing and Starting the Server where we provided the steps to set up maven in Windows; this project...