...

/

Exploring MapReduce Runs

Exploring MapReduce Runs

This lesson continues with the discussion on running a MapReduce job.

We'll cover the following...

Exploring MapReduce Runs

In this lesson, we vary the number of reducers and see how that affects the output. Here we’ll increase the number of reducers to 3.

Connect to the terminal below and execute the commands. Each command is explained later in the lesson. You can read the explanation first and then execute the commands in the terminal.

Press + to interact
# Click on the terminal below and execute the commands in order
./DataJek/startHadoop.sh
hdfs dfs -copyFromLocal /DataJek/cars.data /
hadoop jar JarDependencies/MapReduceJarDependencies/MapReduce-1.0-SNAPSHOT.jar io.datajek.mapreduce.Driver /cars.data /MultipleReducers 3
hdfs dfs -ls /MultipleReducers
hdfs dfs -cat /MultipleReducers/part-r-00000
hdfs dfs -cat /MultipleReducers/part-r-00001
hdfs dfs -cat /MultipleReducers/part-r-00002
Terminal 1
Terminal
Loading...
  1. Start-up the hadoop cluster by running the command below:

    /DataJek/startHadoop.sh
    

    Upload ...

Create a free account to view this lesson.

By signing up, you agree to Educative's Terms of Service and Privacy Policy