Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Connect to the Tufts High Performance Compute Cluster. See Connecting for a detailed guide.

  2. Load the Spark module with the following command:

    Code Block
    module load spark

    Note that you can see a list of all available modules (potentially including different versions of Spark) by typing:

    Code Block
    module avail

    You can specify a specific version of Spark with the module load command or use the generic module name (spark) to load the latest version.
      

  3. Start PySpark session by typing:

    Code Block
    pyspark

...