...
Professor Lenore Cowen, Matt Menke, Noah Daniels and I used the cluster to hierarchically organize the protein structural domains into clusters based on geometric dissimilarity using the program Matt (http://bcb.cs.tufts.edu/mattweb/). The first step in the experiment was to align all the known protein domains using Matt. To compare all the 10,418 representative domains against each implied running Matt approximately 54 million times. While a single run takes only about 0.1 CPU seconds, but running it 54 million times would take approximately 74 days on a single processor. By making use of the ability to run multiple jobs on separate nodes on the cluster we split the job into smaller batches of 0.5 million alignment operations per batch, thus creating in 109 batches in total109 jobs that we submitted to the cluster. Each batch job took approximately 15 hours which is a significant reduction from 74 days. By running the batches jobs simultaneously on separate nodes of the research cluster we were able to reduce the time taken to run perform our job analysis from 2.5 months to less than a day. The speed up not only helped us in running This speed up proved to be an additional benefit when we realized we needed to run an additional experiment to compare our results against a competing program but also publish the outcomes soonerusing an alternative to Matt, as we were able to run that second experiment without significantly delaying our time to publication. This research has resulted in a paper, "Touring Protein Space with Matt", that has been accepted to the International Symposium on Bioinformatics Research and Applications (ISBRA 2010) and will be presented in May.
Recognizing the value of running large tasks on the research cluster and the future CPU intensive programming requirements of the group, Prof. Cowen has contributed additional nodes to the UIT research cluster. While members of the BCB research group (http://bcb.cs.tufts.edu/) get priority to run programs on those nodes anyone having account on the cluster can run programs on them.
...