pyspark.tar.gz

PySparkCodeWidget

PySparkLive

PySparkSPA

This course walks you through setting up docker for PySpark on the Educative platform.

A Complete Guide to PySpark Setup on Educative

## PySpark

In this lesson, we will set up Docker Job for `PySpark` in **Single Page Application**


### Docker job for Single Page Application 

Let’s see what each field in the above job means:

#### Select Docker job type

This is Docker Job Type selection in which we have to select what kind of docker job we are creating. 

``` Dockerfile
Live
```

#### Job name
This is just a job name for reference. You can use any name you want to specify for this job.

``` Dockerfile
PySparkSPA
```

#### Input file name
Name of the input file you want to run in the live widget. 

```
main.py
```

#### Run script
This script runs when we execute the code in the live widget. It is mandatory.

```
echo "Hello World"
```
#### Application port
We have to specify the port on which we want to run it.

```
8080
```

#### Start script
This script runs when we execute the Live Widget for the very first time. It is mandatory.


``` Dockerfile
cd usercode && python3 main.py
```
### Select Docker job
After creating the docker job for the SPA widget now select it as given below.

from pyspark.sql import SparkSession
from dotenv import load_dotenv
def create_spark_session():
    """Create a Spark Session"""
    _ = load_dotenv()
    return (
        SparkSession
        .builder
        .appName("helloworld")
        .master("local[5]")
        .getOrCreate()
    )
spark = create_spark_session()
print('Session Started')

import React from 'react';
import { shallow, configure } from 'enzyme';
import Adapter from 'enzyme-adapter-react-16';

configure({ adapter: new Adapter() });

import HelloWorld from './app';

var TestResult = function() {
    this.succeeded = false;
    this.reason = "";
    this.input = "";
    this.expected_output = "";
    this.actual_output = "";
}

export const executeTests = function() {

  var results = [];

  result = new TestResult();
  result.input = 'HelloWorld Component';
  result.expected_output = "span containing text 'Hello World'"

  let wrapper = shallow(<HelloWorld />);

  // Call your Challenge function here.

  let type = wrapper.type();
  let testSuccessful = true;
  let failureReason;

  if (type !== 'span') {
    testSuccessful = false;
    failureReason = "You need to render exactly one span HTML element";
  } else if (wrapper.props().children != "Hello World") {
     testSuccessful = false;
     failureReason = "You have rendered wrong message in your span element";
  }

  result.actual_output = wrapper.html();

  if (testSuccessful) {
    result.succeeded = true;
    result.reason = "Succeeded"
  } else {
    result.succeeded = false;
    result.reason = failureReason;
  }

  results.push(result);

  return results;
}


React

Dart

GoJS React

Typescript React

Vue.js

> **Note**: We used SPA widget as sometimes we need extra resources and we can enhance resources only in our SPA widget.

# PySpark

In this lesson, we will set up Docker Job for `PySpark` in **Single Page Application**


## Docker job for Single Page Application 

Let’s see what each field in the above job means:

## Select Docker job type

This is Docker Job Type selection in which we have to select what kind of docker job we are creating. 

``` Dockerfile
Live
```

### Job name
This is just a job name for reference. You can use any name you want to specify for this job.

``` Dockerfile
PySparkSPA
```

### Input file name
Name of the input file you want to run in the live widget. 

```
main.py
```

### Run script
This script runs when we execute the code in the live widget. It is mandatory.

```
echo "Hello World"
```
### Application port
We have to specify the port on which we want to run it.

```
8080
```

### Start script
This script runs when we execute the Live Widget for the very first time. It is mandatory.


``` Dockerfile
cd usercode && python3 main.py
```
### Select Docker job
After creating the docker job for the SPA widget now select it as given below.

__default

Running a PySpark Project using SPA

PySpark

Docker job for Single Page Application

Select Docker job type

Job name

Input file name

Run script

Application port

Start script

Select Docker job

Let’s run PySpark code in SPA widget