Map Reduce Framework
Here, you’ll learn about Map Reduce, which is a programming model, and it speeds up processing and computation through distribution.
We'll cover the following...
Map Reduce
This is a programming model introduced by Google. It is part of the Hadoop Ecosystem. It enables us to process large datasets in a distributed environment in a distributed and parallel manner.
Map Reduce consists of two tasks, Map and Reduce as it is also visible from the above diagram. Reducer is run after the Map operation has run. Map operations take in the input, and apply the logic. Then, they produce the output in the form of pairs.
Next, the reducer receives the pairs from multiple Map Jobs, as it is also visible from the above diagram. The responsibility of Reducer is to aggregate those intermediate results produced by Mapper functions, and then produce the ...