JCSE, vol. 6, no. 4, pp.287-293, 2012
DOI: http://dx.doi.org/10.5626/JCSE.2012.6.4.287
A Comparative Performance Study for Compute Node Sharing
Jeho Park, Shui F. Lam
Computing and Information Services, Harvey Mudd College, Claremont, CA, USA/ Computer Engineering and Computer Science, California State University Long Beach, Long Beach, CA, USA
Abstract: We introduce a methodology for the study of the application-level performance of time-sharing parallel jobs on a set of
compute nodes in high performance clusters and report our findings. We assume that parallel jobs arriving at a cluster
need to share a set of nodes with the jobs of other users, in that they must compete for processor time in a time-sharing
manner and other limited resources such as memory and I/O in a space-sharing manner. Under the assumption, we developed
a methodology to simulate job arrivals to a set of compute nodes, and gather and process performance data to calculate
the percentage slowdown of parallel jobs. Our goal through this study is to identify a better combination of jobs that
minimize performance degradations due to resource sharing and contention. Through our experiments, we found a couple
of interesting behaviors for overlapped parallel jobs, which may be used to suggest alternative job allocation schemes
aiming to reduce slowdowns that will inevitably result due to resource sharing on a high performance computing cluster.
We suggest three job allocation strategies based on our empirical results and propose further studies of the results using a
supercomputing facility at the San Diego Supercomputing Center.
Keyword:
Resource sharing; Resource allocation; Time-sharing cluster; Job scheduling; High performance computing; Percentage slowdown
Full Paper: 128 Downloads, 2679 View
|