Abstract
Over the past decade, there has been an increased interest in designing and implementing high-performance cluster systems by utilizing commodity components that are connected through standard network technology. Networked clusters of computers are commonly used to either process multiple sequential jobs concurrently, or to execute complex scientific and commercial parallel applications that are based on a message passing paradigm. In some areas, these clusters represent a cost-effective alternative to the more expensive supercomputers that are in use today. However, workstation clusters normally lack the high-powered interconnect fabric, as well as the more optimized protocol system architecture that is an integral part of commercial supercomputer systems. This study quantifies the speedup of parallel applications in a cluster environment in an analytical model that takes parallel processing, internode communication, and I/O latency into consideration. The model illustrates the (relative) impact that CPU and I/O parallelism has on the speedup behavior based on the 3 different inter-node communication scenarios: broadcast, nearest-neighbor, and request-response. This work revealed that it is important to consider the performance impact that the application has with regard to features such as remote memory access, completion notification, and address translation issues. & 2005 IEEE.
| Original language | American English |
|---|---|
| Pages (from-to) | 311-318 |
| Number of pages | 8 |
| Journal | Conference Proceedings - IEEE SOUTHEASTCON |
| State | Published - 2005 |
| Event | IEEE Southeastcon 2005: Excellence in Engineering, Science and Technology - Ft. Lauderdale, United Kingdom Duration: Apr 8 2005 → Apr 10 2005 |
ASJC Scopus Subject Areas
- Electrical and Electronic Engineering