DAG of Stages in Apache Spark
Learn how Apache Spark provides efficient fault tolerance.
We'll cover the following...
As explained in the previous lesson, the driver examines the lineage graph of the application code and builds a
DAG scheduler of stages
A DAG of Stages is shown in the following illustration:
- Each stage contains as many pipelined transformations with narrow dependencies (one-to-one) as possible.
- The boundaries of each stage correspond to