异构HPL算法中CPU端高性能BLAS库优化

doi:10.13328/j.cnki.jos.006002

微信服务号

微信订阅号

2025年5月1日 7:45 星期四

首页 > 过刊浏览>2021年第32卷第8期 >2289-2306. DOI:10.13328/j.cnki.jos.006002

PDF HTML阅读 XML下载导出引用引用提醒

异构HPL算法中CPU端高性能BLAS库优化
DOI:
                        10.13328/j.cnki.jos.006002
                    
CSTR:
                        
                    
作者:
                        蔡雨蔡雨
信息技术有限公司,江苏 苏州 215000
在期刊界中查找
在百度中查找
在本站中查找
孙成国孙成国
信息技术有限公司,江苏 苏州 215000
在期刊界中查找
在百度中查找
在本站中查找
杜朝晖杜朝晖
信息技术有限公司,江苏 苏州 215000
在期刊界中查找
在百度中查找
在本站中查找
刘子行刘子行
信息技术有限公司,江苏 苏州 215000
在期刊界中查找
在百度中查找
在本站中查找
康梦博康梦博
信息技术有限公司,江苏 苏州 215000
在期刊界中查找
在百度中查找
在本站中查找
李双双李双双
信息技术有限公司,江苏 苏州 215000
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:蔡雨(1988-),男,高级主管工程师,主要研究领域为CPU架构,性能优化.
刘子行(1977-),男,高级主管工程师,主要研究领域为安全软件.
孙成国(1985-),男,高级主管工程师,主要研究领域为高性能计算,性能优化.
康梦博(1989-),男,高级工程师,主要研究领域为性能优化.
杜朝晖(1975-),男,主任工程师,主要研究领域为安全软件.
李双双(1984-),男,高级工程师,主要研究领域为数学库.
通讯作者:孙成国,E-mail:sunchengguo@hygon.cn
中图分类号:TP303
基金项目:

CPU-side High Performance BLAS Library Optimization in Heterogeneous HPL Algorithm

Author:

CAI Yu
CAI Yu
Information Technology Co., Ltd., Suzhou 215000, China
在期刊界中查找
在百度中查找
在本站中查找
SUN Cheng-Guo
SUN Cheng-Guo
CPU Architecture Design Department, Chengdu Hygon IC Design Co., Ltd., Suzhou 215000, China
在期刊界中查找
在百度中查找
在本站中查找
DU Zhao-Hui
DU Zhao-Hui
CPU Architecture Design Department, Chengdu Hygon IC Design Co., Ltd., Suzhou 215000, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Zi-Xing
LIU Zi-Xing
CPU Architecture Design Department, Chengdu Hygon IC Design Co., Ltd., Suzhou 215000, China
在期刊界中查找
在百度中查找
在本站中查找
KANG Meng-Bo
KANG Meng-Bo
CPU Architecture Design Department, Chengdu Hygon IC Design Co., Ltd., Suzhou 215000, China
在期刊界中查找
在百度中查找
在本站中查找
LI Shuang-Shuang
LI Shuang-Shuang
CPU Architecture Design Department, Chengdu Hygon IC Design Co., Ltd., Suzhou 215000, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

异构HPL（high-performance Linpack）效率的提高需要充分发挥加速部件和通用CPU计算能力，加速部件集成了更多的计算核心，负责主要的计算，通用CPU负责任务调度的同时也参与计算.在合理划分任务、平衡负载的前提下，优化CPU端计算性能对整体效率的提升尤为重要.针对具体平台体系结构特点对BLAS（basic linear algebra subprograms）函数进行优化往往可以更加充分地利用通用CPU计算能力，提高系统整体效率.BLIS（BLAS-like library instantiation software）算法库是开源的BLAS函数框架，具有易开发、易移植和模块化等优点.基于异构系统平台体系结构以及HPL算法特点，充分利用三级缓存、向量化指令和多线程并行等技术手段优化CPU端调用的各级BLAS函数，应用auto-tuning技术优化矩阵分块参数，从而形成了HygonBLIS算法库.与MKL相比，在异构环境下，HPL算法整体性能提高了11.8%.

关键词:BLAS;遗传算法auto-tuning;向量化指令;数据预取;多线程并行

Abstract:

Improving the efficiency of heterogeneous HPL needs to fully utilize the computing power of acceleration components and CPU, the acceleration components integrate more computing cores and are responsible for the main calculation. The general CPU is responsible for task scheduling and also participates in calculation. Under the premise of reasonable division of tasks and load balancing, optimizing CPU-side computing performance is particularly important to improve overall efficiency. Optimizing the basic linear algebra subprogram (BLAS) functions for specific platform architecture characteristics can often make full use of general-purpose CPU computing capabilities to improve the overall system efficiency. The BLAS-like Library Instantiation Software (BLIS) algorithm library is an open source BLAS function framework, which has the advantages of easy development, portability, and modularity. Based on the heterogeneous system platform architecture and HPL algorithm characteristics, this study uses three-level cache, vectorized instructions, and multi-threaded parallel technology to optimize the BLAS functions called by the CPU, applies auto-tuning technology to optimize the matrix block parameters, and eventually forms the HygonBLIS algorithm library. Compared with MKL, the overall performance of the HPL using HygonBLIS has been improved by 11.8% in the heterogeneous environment.

Key words:BLAS;genetic algorithm auto-tuning;vectorization instruction;data prefetching;multi-threading parallelization

引用本文

蔡雨,孙成国,杜朝晖,刘子行,康梦博,李双双.异构HPL算法中CPU端高性能BLAS库优化.软件学报,2021,32(8):2289-2306

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2019-07-25
最后修改日期:2020-03-19
录用日期:
在线发布日期: 2021-08-05
出版日期: 2021-08-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码