Exploring MapReduce Runs
This lesson continues with the discussion on running a MapReduce job.
We'll cover the following...
Exploring MapReduce Runs
In this lesson, we vary the number of reducers and see how that affects the output. Here we’ll increase the number of reducers to 3.
Connect to the terminal below and execute the commands. Each command is explained later in the lesson. You can read the explanation first and then execute the commands in the terminal.
Press + to interact
# Click on the terminal below and execute the commands in order./DataJek/startHadoop.shhdfs dfs -copyFromLocal /DataJek/cars.data /hadoop jar JarDependencies/MapReduceJarDependencies/MapReduce-1.0-SNAPSHOT.jar io.datajek.mapreduce.Driver /cars.data /MultipleReducers 3hdfs dfs -ls /MultipleReducershdfs dfs -cat /MultipleReducers/part-r-00000hdfs dfs -cat /MultipleReducers/part-r-00001hdfs dfs -cat /MultipleReducers/part-r-00002
-
Start-up the hadoop cluster by running the command below:
/DataJek/startHadoop.sh
Upload ...
Create a free account to view this lesson.
By signing up, you agree to Educative's Terms of Service and Privacy Policy