...
- Connect to the Tufts High Performance Compute Cluster. See Connecting for a detailed guide.
Load the Spark module with the following command:
Code Block module load spark
Note that you can see a list of all available modules (potentially including different versions of Spark) by typing:
Code Block module avail
You can specify a specific version of Spark with the module load command or use the generic module name (spark) to load the latest version.
Start PySpark session by typing:
Code Block pyspark
...