The Tufts High Performance Compute (HPC) cluster delivers 35,845,920 cpu hours and 59,427,840 gpu hours of free compute time per year to the user community.

Teraflops: 60+ (60+ trillion floating point operations per second) cpu: 4000 cores gpu: 6784 cores Interconnect: 40GB low latency ethernet

For additional information, please contact Research Technology Services at tts-research@tufts.edu


Python

Python is a widely used high-level, general-purpose, interpreted programming language. It is often used as the "glue" within the High Performance Computing community. 

For more information about Spark and PySpark, you can visit the following resources:

https://www.python.org/

https://en.wikipedia.org/wiki/Python_(programming_language)

Getting Started with Python

You can access and start using Python with the following steps:

  1. Connect to the Tufts High Performance Compute Cluster. See Access for a detailed guide.

  2. Load the Python module with the following command:

    module load python

    Note that you can see a list of all available modules (potentially including different versions of Python) by typing:

    module avail

    You can specify a specific version of Python with the module load command or use the generic module name (python) to load the latest version.

  3. Start a Python session by typing:

    python
    print("Hello, World!")

 

Python related:

How can I verify if a particular Python package is installed?
Add-on tools such as numpy and scipy are installed. Others would be under the install tree located at:
/opt/shared/python/
in the version specific site-packages directory.   Another approach uses pip.

> module load python/2.7.6
> pip list

 

For a more detailed overview of Python and how it relates to Big Data or High Performance Computing (HPC) please contact tts-research@tufts.edu for information regarding future workshops.

 

 

 

 


For additional information, please contact Research Technology Services at tts-research@tufts.edu