期刊文献+

国产万亿次机群系统NPB性能测试分析 被引量:13

Performance Analysis of NPB Benchmark on Domestic Tera-Scale Cluster Systems
在线阅读 下载PDF
导出
摘要 对3个国产万亿次机群系统进行了NPB性能测试分析,重点研究大规模并行处理时(处理器数目达到上千个)的性能特点和趋势.分析了不同的处理器、互连网络等系统配置对NPB性能的影响,发现NPB的8个程序在3个万亿次机器上的性能特点和表现并不一致,表明国产高性能机群在设计上正在逐渐走出同质化的趋势,向多样化发展.进一步分析表明,目前NPB程序的可扩展性可以达到几百个处理器,但尚不能达到上千个处理器,NPB程序能发挥出的系统峰值的百分比仍然徘徊在10%左右,机群系统的并行可扩展性和应用程序对机器运算潜能的利用还需要进一步提高.对于处理器数目达到上千个的万亿次机群系统来说,对集合通信和细粒度通信能力的支持亟需提高. In this paper, NPB benchmarking is performed on three domestic tera-scale cluster systems with emphasis on the performance characteristics and trends when carrying out tera-scale parallel computing on systems with thousands of processors. The effects of different system configurations (processor, interconnection network, etc.) on the final NPB performance are analyzed and it is found that the programs in NPB suites got their best performance on different clusters. Through further analysis, it is indicated that the scalability of NPB programs can reach hundreds of processors, but can't reach thousands of processors. Most of the NPB programs can only exploit around 10% of the system peak performance, so the scalability of cluster systems and real application performance on tera-scale cluster systems need further improvement. For manufacturing of tera-scale cluster systems with thousands of processors, the performance of collective communication and fine-grained message passing needs further improvement.
出处 《计算机研究与发展》 EI CSCD 北大核心 2005年第6期1079-1084,共6页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60303020) 国家"九七三"重点基础研究发展规划基金项目(G1999032805) 国家"八六三"高技术研究发展计划重大专项基金项目(2004AA104020) 中国科学院软件研究所培育基金项目(CXK25628)
关键词 万亿次机群 性能评测 NPB tera-scale cluster system performance evaluation NAS parallel Benchmarks(NPB)
  • 相关文献

参考文献16

  • 1Lei Hu, Ian Gorton. Performance evaluation for parallel systems:A survey. University of NSW, Sydney, Australia, Tech Rep:UNSW-CSE- TR-9707, 1997
  • 2Marcelo Lobosco, Vitor Santos Costa, Claudio L. de Amorim.Performance evaluation of fast ethernet, giganet and myrinet on a Cluster. In: Proc. Int'l Conf. Computer Science. Berlin:Springer-Verlag, 2002
  • 3Jack Dongarra. Performance of various computers using standard linear equations software. University of Tennessee Computer Science, America, Tech Rep: CS-89-85, 2003
  • 4A.B. Yoo, B. R. de Supinski, F. Mueller, et al. Memory benchmarks for SMP-based high performance parallel computers.Lawrence Livermore National Laboratory, Tech Rep: UCRL-JC-146246, 2001
  • 5罗水华,杨广文,张林波,石威,郑纬民.并行集群系统的Linpack性能测试分析[J].数值计算与计算机应用,2003,24(4):285-292. 被引量:10
  • 6都志辉,吴博,刘鹏,陈渝,王小鸽,李三立.LINPACK与机群系统的LINPACK测试[J].计算机科学,2002,29(5):8-10. 被引量:15
  • 7.TOP500[EB/OL].http: ∥ www. top500. org,2004-10-02.
  • 8.TOP100[EB/OL].http:∥www.samss.org.cn,2004-10-02.
  • 9HPC Challenge Benchmark. http: ∥ icl.cs.utk.edu/hpcc/, 2004-12-21
  • 10NAS Parallel Benchmarks. http: ∥ science.nas.nasa.gov/Software/NPB, 2004-09-15

二级参考文献28

  • 1黄铠 徐志伟.可扩展并行计算技术、结构与编程[M].北京:机械工业出版社,2000..
  • 2胡明昌 胡伟武 唐志敏.MPI并行程序在曙光3000和微机机群上性能的比较[A]..863计划智能计算机主题学术会议[C].,2001.3.
  • 3史岗 胡伟武 韩承德.曙光3000上分布式共享存储系统的实现[A]..863计划智能计算机主题学术会议[C].,2001.3.
  • 4Eicken T, Culler D E, Goldstein S C, Schauser K E. Active messages: a mechanism for integrated communication and computation[C]. In: Proceedings of the 19th International Symposium on Computer Architecture, 1992.
  • 5Culler D E, Karp R M, Patterson D A, Sahay A, Schauser K E,Santos E, Subramonian R and yon T Eicken. LogP: towards a realistic model of parallel computation[C]. In:Fourth ACM SIGPLAN Sym-posium on Principles and Practice of Parallel Programming, 1993. 262-273.
  • 6Alexandrov, M. Ionescu, K. Schauser, and C. Scheiman. Log-GP: Incorporating long messages into the Log P model - one step closer towards a realistic model for parallel computation[C]. In:7th Annual Symposium on Parallel Algorithms and Architectures, May 1995.
  • 7Martin R P, Vahdat A M,Culler D E and Anderson T E. Effects of communication latency, overhead, and bandwidth in a cluster architecture[C].In : Proceedings of the 19th International Symposium on Computer Architecture, 1997.
  • 8Mukherjee S S and Hill M D. A survey of user-level network interfaces for system area networks, computer sciences department[R]. University of Wisconsin-Madison, Technical Repor # 1340, Feb. 1997.
  • 9Araki S,Bilas A, Dubnicki C, Edler J, Konishi K, Philbin J.User-space communication: A quantitative study[C]. In:Proc.of The 1998 SC98 conference. Nov, 1998.
  • 10Barak A,Gilderman I, Metrik I. Performance of the communication layers of TCP/IP with the Myrinet gigabit LAN[J]. Computer Communications,1999. 22, 989-997.

共引文献18

同被引文献119

引证文献13

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部