The Tufts High Performance Compute (HPC) cluster delivers 35,845,920 cpu hours and 59,427,840 gpu hours of free compute time per year to the user community.

Teraflops: 60+ (60+ trillion floating point operations per second) cpu: 4000 cores gpu: 6784 cores Interconnect: 40GB low latency ethernet

For additional information, please contact Research Technology Services at tts-research@tufts.edu


Applications Faq

Application specific Information FAQs

 

Current list of software

You may check the current list of software under modules by:

-bash-3.2$ module available

When in doubt, always be version explicit in the named module you load.

The following table lists the new submission format for interactive sessions. In some cases the program name may have a version specific name. This in not an exhaustive list. For specific examples see the slurm page on this wiki.

Note:

In the following examples,  "..." refers to the needed options that are slurm specific.  Check the slurm page for some examples.  Your specifics may be differrent.

 


application

actual application name

slurm command to run

Comments

matlab

matlab

srun ... matlab

graphics interface

maple

xmaple

srun ... xmaple

graphics interface

maple

maple

srun ... maple

text interface

mathematica

mathematica

srun ... mathematica

graphics interface

mathematica

math

srun ... math

text interface

comsol

comsol

srun ... comsol

graphics interface

xstata

xstata-se

srun ... xstata-se

graphics interface to SE version

stata

stata-se

srun ... stata-se

text interface to SE version

xstata

xstata

srun ... xstata

graphics interface to IC version

stata

stata

srun ... stata

text interface to IC version

ansys

launcher121

srun ... launcher121

graphics interface

ansys workbench

runwb2

srun ... runwb2 -mesa

WorkBench interface(Note:you may or may not need the -mesa option depending on your OpenGL support and version of Workbench. Version 13 doesn't need it.

fluent

fluent

srun ... fluent

graphics interface to 2D and 3D programs

fluent icepak

icepak

srun ... icepak

graphics interface to Icepak

R

R

srun ... R

graphics delivery enabled

abaqus

abq6141

srun ... abq6141

command line interface

abaqus cae

abq6141 cae

srun ... abq6141 cae -mesa

CAE interface

abaqus 6.9-EF2

abq69ef2.exe

srun ... abq69ef2.exe

command line interface

weka

weka.jar

srun ...  java -jar  /opt/shared/weka/weka-3-6-10/weka.jar

graphics interface

tecplot

tec360

srun ... tec360 -mesa

graphics interface

    
    

Cluster scaling considerations

The use of commercial packages like Matlab, Ansys, Mathematica, etc., often have usage scenarios for which one solution does not fit all situations. For example:

Compute Needs

Resources

Development

Laptop/Desktop installation of software instead of cluster use

Batch runs, no GUI

cluster batch slurm partition

SMP parallelism

slurm interactive node access

MPI parallelism

application dependent, slurm mpi partition

Parametric Simulation

slurm batch partition

If you are uncertain of a suitable approach, please contact cluster-support@tufts.edu for assistance.

PetSc related.

How do I install PetSc?
PetSc has a complex build environment that allows one to include various applications. We encourage interested users to build the suite of interest in their home directory. Installation instructions for a particular set of applications can be found here.

Perl related:

How can I verify if a particular Perl package is installed?
Suppose you wanted to see if package GD is installed:
-bash-3.2$ rpm -q perl-GD gd

PDF related:

How do I view pdf files on the cluster?
Use the X11 based program evince. It is already in your PATH.

Tecplot related:

On the cluster Tecplot's help button doesn't work, where do I find TecPlot documentation?
Html docs can be found on the vendor's website
PDF versions are an option as well.

Are there any Tecplot tutorials?
See this tutorial link.

Standalone HPC math libraries:

The following libraries are installed. To check to see if there is an environment module associated with these, try:
-bash-3.2$ module avail

BLACS
BLAS
Lapack
Scalapack
SuperLU
gsl
UMFPack

Corresponding installation locations are found at:

SuperLU: /opt/shared/SuperLU_4.0/lib
Scalapack: /opt/shared/scalapack-1.8.0/
Lapack: /opt/shared/lapack-3.2.1/
BLAS: /opt/shared/BLAS/
BLACS: /opt/shared/BLACS/LIB/
UMFPACK: /opt/shared/UMFPACK/5.5.1
Cula: /opt/shared/cula/

Installed library files for gsl, atlas and many others are located in /usr/lib64/.

Linkage to specific library file(s) can be found in their respective install location at /opt/shared/. Note, you need to specify which libary file, since in general there may be more than one. For example:

> ls /opt/shared/BLACS/LIB/*.a
/opt/shared/BLACS/LIB/blacsCinit_MPI-LINUX-0.a /opt/shared/BLACS/LIB/blacsF77init_MPI-LINUX-0.a /opt/shared/BLACS/LIB/blacs_MPI-LINUX-0.a

For example:
> gcc mycode.c -o mycode -L /opt/shared/SuperLU_4.0/lib -libsuperlu.a

Bioinformatic programs:

How do I access Blast programs on the cluster?
-bash-3.2$ module load blast
-bash-3.2$ srun ... blastx ...
And likewise for other commands. Also consider what queue is best at meeting your compute needs.

Where are the blast commands located?
/opt/shared/ncbi-blast/bin

How do I start the serial version of MrBayes in interactive mode?
-bash-3.2$ module load mrbayes
-bash-3.2$ srun ... mb

How do I start the parallel version of MrBayes in interactive mode?
-bash-3.2$ module load mrbayes
-bash-3.2$ module load openmpi
-bash-3.2$ srun ... mb -openmpi

A similar solution is available for mpi using mvapich2.

How do I start Haploview?
-bash-3.2$ module load java
-bash-3.2$ module load haploview
-bash-3.2$ srun ... java -jar /opt/CTSA/Haploview4.1/Haploview.jar

How do I start plink?
plink requires the presence of two file with .ped and .med extensions. These correspond to the expected input format and corresponding data information. As an example copy the test.ped and test.med files from the installation directory to your home directory.
-bash-3.2$ cp /opt/CTSA/plink-1.06-x86_64/test* .
-bash-3.2$ module load plink
-bash-3.2$ srun ... plink --file test --noweb

How do I access online help for plink?
Command line help can be seen by:
-bash-3.2$ plink --help

Also check the plink site for additional up to date information.

HOw do I start velvet?
-bash-3.2$ module load velvet

Velvet has two versions: velvetg and velveth
To run one of these:
-bash-3.2$ srun ... velvetg (velvet arguments)
Without arguments velvet returns some command line options.

Where is velvet installed?
/opt/shared/velvet-1.0.02/

How do I start Merlin?
-bash-3.2$ module load merlin
-bash-3.2$ srun ... merlin

How do I start Pedcheck?
-bash-3.2$ module load pedcheck
For interactive command line use:
-bash-3.2$ srun ... pedcheck

Submitting as a batch job:
-bash-3.2$ srun ... pedcheck -p myfile.ped -d myfile.dat
where -p <pedigree file> and -d <locus file> are your input files.

What is required to run Solar?
This program requires a user specific ID key. Send a request to cluster-support@tufts.edu requesting one. This can take up to 24 hours for the vendor to respond. This is a one time registration event. Once you have received your key:

Key for username abcde01 is: 3gakiuxe
To register, log in with the specified username, run SOLAR, and use the
`register` command, for example:

solar> register 3gakiuxe

This will create a file named .solar_reg in the home directory.

How do I start Solar?
-bash-3.2$ module load solar
For interactive command line use:
-bash-3.2$ srun ... solar

Note, to follow the Readme file suggestions, run the .tcl scripts this way:
solar> makemibd
solar> doanalysis

How do I run Bowtie?
The install directory is /opt/shared/bowtie/bowtie-0.12.7/ where you will find sample indexes and reads in subdirectories of the same name. As an example, to run bowtie against ecoli indexes with a sample reads file:
-bash-3.2$ module load bowtie
-bash-3.2$ cp /opt/shared/bowtie/bowtie-0.12.7/indexes/e_coli.* .
-bash-3.2$ cp /opt/shared/bowtie/bowtie-0.12.7/reads/e_coli_1000.fq .
-bash-3.2$ srun ... bowtie e_coli e_coli_1000.fq > my_ecoli.outputfile

This will run an interactive job and dump the output to a text file, called my_ecoli.outputfile

How do I run Cufflinks?
The installed version has 3 binaries: cuffcompare cuffdiff cufflinks

Load the Cufflinks module to access them.
-bash-3.2$ module load cufflinks

To test it out, download the test data file from the cufflinks website or grab a copy from:
-bash-3.2$ cp /cluster/shared/dmarshal/cufflinks-test_data.txt .
To run an interactive job:
-bash-3.2$ srun ... cufflinks cufflinks-test_data.txt

How to run Tophat?
All tophat binaries are located at:
/opt/shared/tophat/1.0.14/bin/

To access them and the bowtie dependices:
-bash-3.2$ module load tophat
-bash-3.2$ module load bowtie

To test it out, download the test data set from the tophat site or grab the tar file from:
-bash-3.2$ cp /cluster/shared/dmarshal/ISO/tophat_test_data.tar .
-bash-3.2$ tar xvf tophat_test_data.tar
-bash-3.2$ cd test_data/

-bash-3.2$ srun ... tophat -r 20 test_ref reads_1.fq reads_2.fq

How to run IMa2?
-bash-3.2$ module load ima2/8.27.12

Refer to module output for explicit versions:
-bash-3.2$ module available

How to list command line arguments for IMa2?
-bash-3.2$ srun ... IMa2

How to run a typical IMa2 example?
-bash-3.2$ srun ... IMa2 -iyour_input_data -oyour_output_file_name -q2 -m1 -t3 -b10000 -l100 -s123
Note those commandline options are only for illustration, yours may be very different.

Documentation may be found on the IMa website.

How to get started with bioPerl?
Check the corresponding BioPerl wiki

How to start Structure and the interfaces?
Check the public Structure website for the manual. A parameter file is needed in both cases of usage below.

For the graphical interface:
-bash-4.1$ module load structure-java/2.3.4
-bash-4.1$ srun ... java -jar /opt/shared/structure/2.3.4/class/Structure.jar

For the command line text version:

-bash-4.1$ module load structure
-bash-4.1$ srun ... structure

How to start Desmond?

-bash-4.1$ module load Schrodinger

To see command line arguments for desmond, type:

-bash-4.1$ srun ... desmond

Several arguments are need to specify a proper desmond run.
Check the manual on the Desmond website for details.

Where do I find the PolyPhred programs and how do I access them?
The program binaries are in /opt/shared/phred/. You must load the phred module for access.

-bash-3.2$ module load phred

How does one work with diyabc on the cluster?
The cluster version is simply two linux executables, diyabc_sim and diyabc_cat. The first is for simulation generation and the second is for combining output reference table files. Unlike the Windows version, these are command line tools without a gui user interface. The manual pdf available on the diyabc website details usage. The following is how to run the example from the manual.
-bash-3.2$ module load diyabc
Copy the two input files to your local directory:
-bash-3.2$ cp /opt/shared/diyabc/ga_* .
-bash-3.2$ sbatch ....options.... --wrap="diyabc.output diyabc_sim ga_001 1 1000 4"

Output files can be combined for transfer back to you your PC version for post processing and display.

How is QIIME started?
This functionality is built upon python modules and codes. The environment to support it has many dependencies that are satisfied by loading the QIIME module.

-bash-3.2$ module load QIIME

To submit a job from the QIIME tutorial:
-bash-3.2$ sbatch ....options....   --wrap="check_id_map.py -m Fasting_Map.txt -o mapping_output"

This example is straight forward. However enhancements to bsub options and to QIIME python functions will vary as will data sources locations.

 


For additional information, please contact Research Technology Services at tts-research@tufts.edu