Array Operation Synthesis to Optimize HPF Programs on Distributed Memory Machines

Gwan Hwan Hwang*, Jenq Kuen Lee, Roy Dz Ching Ju

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)


An increasing number of programming languages, such as Fortran 90, HPF, and APL, provide a rich set of intrinsic array functions and array expressions. These constructs, which constitute an important part of data parallel languages, provide excellent opportunities for compiler optimizations. The synthesis of consecutive array operations or array expressions into a composite access function of the source arrays at compile time has been shown (A. T. Budd, ACM Trans. Programm. Lang. Syst.6 (July 1984), 297-313; G. H. Hwang et al., in "Proc. of ACM SIGPLAN Conference on Principles and Practice of Parallel Programming, 1995," pp. 112-122) to be an effective scheme for optimizing programs on flat shared memory parallel architectures. It remains, however, to be studied how the synthesis scheme can be incorporated into optimizing HPF-like programs on distributed memory machines by taking into account communication costs. In this paper, we propose solutions to address this issue. We first show how to perform array operation synthesis on HPF programs, and we demonstrate its performance benefits on distributed memory machines with real applications. In addition, to prevent a situation we call "synthesis anomaly," we present an optimal solution to guide the array synthesis process on distributed memory machines. Due to the optimal problem being NP-hard, we further develop a practical strategy that compilers can use on distributed memory machines with HPF programs. Our synthesis engine is implemented as a Web-based tool, called Syntool, and experimental results show significant performance improvement over the base codes for HPF code fragments from real appli- cations on parallel machines. Our experiments were performed on three distributed memory machines: an 8-node DEC Alpha Farm, a 16-node IBM SP-2, and a 16-node nCUBE/2.

Original languageEnglish
Pages (from-to)467-500
Number of pages34
JournalJournal of Parallel and Distributed Computing
Issue number4
Publication statusPublished - 2001 Apr
Externally publishedYes


  • HPF compiler; array operation synthesis; distributed memory machines; compiler optimization; parallelizing compiler

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computer Networks and Communications
  • Artificial Intelligence


Dive into the research topics of 'Array Operation Synthesis to Optimize HPF Programs on Distributed Memory Machines'. Together they form a unique fingerprint.

Cite this