TY - GEN
T1 - Array operation synthesis to optimize HPF programs
AU - Hwang, Gwan Hwan
AU - Lee, Jenq Kuen
AU - Ju, D. C.R.
N1 - Publisher Copyright:
© 1996 IEEE.
PY - 1996
Y1 - 1996
N2 - The synthesis of consecutive array operations or array expressions into a composite access function of the source arrays at compile time can reduce the redundant data movement, temporary storage usage and loop synchronization overhead on flat shared-memory parallel machines with uniform memory accesses. However, it remains open how the synthesis scheme can be incorporated into optimizing HPF (High-Performance Fortran) programs on distributed-memory machines by taking into account communication costs. In this paper, we propose solutions to address this open problem. We first apply the array operation synthesis (developed for Fortran 90 programs) to HPF programs and demonstrate its performance benefits on distributed-memory machines. In addition, to prevent a situation we call the »synthesis performance anomaly», we derive a cost model and present an optimal solution based on this model to guide the array synthesis process on distributed-memory machines. We also show that the optimal problem is NP-hard. Therefore, we develop a practical heuristic algorithm for compilers to devise a synthesis strategy on distributed-memory machines with HPF programs. Experimental results show a significant performance improvement over the base codes for HPF code fragments from real applications on a DEC Alpha processor farm by incorporating our proposed optimizations.
AB - The synthesis of consecutive array operations or array expressions into a composite access function of the source arrays at compile time can reduce the redundant data movement, temporary storage usage and loop synchronization overhead on flat shared-memory parallel machines with uniform memory accesses. However, it remains open how the synthesis scheme can be incorporated into optimizing HPF (High-Performance Fortran) programs on distributed-memory machines by taking into account communication costs. In this paper, we propose solutions to address this open problem. We first apply the array operation synthesis (developed for Fortran 90 programs) to HPF programs and demonstrate its performance benefits on distributed-memory machines. In addition, to prevent a situation we call the »synthesis performance anomaly», we derive a cost model and present an optimal solution based on this model to guide the array synthesis process on distributed-memory machines. We also show that the optimal problem is NP-hard. Therefore, we develop a practical heuristic algorithm for compilers to devise a synthesis strategy on distributed-memory machines with HPF programs. Experimental results show a significant performance improvement over the base codes for HPF code fragments from real applications on a DEC Alpha processor farm by incorporating our proposed optimizations.
UR - http://www.scopus.com/inward/record.url?scp=0005293335&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0005293335&partnerID=8YFLogxK
U2 - 10.1109/ICPP.1996.538553
DO - 10.1109/ICPP.1996.538553
M3 - Conference contribution
AN - SCOPUS:0005293335
T3 - Proceedings of the International Conference on Parallel Processing
SP - 1
EP - 8
BT - Software
A2 - Pingali, K.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th International Conference on Parallel Processing, ICPP 1996
Y2 - 12 August 1996 through 16 August 1996
ER -