版权说明 操作指南
首页 > 成果 > 详情

A parallel computing method using blocked format with optimal partitioning for SpMV on GPU

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Yang, Wangdong*;Li, Kenli*;Li, Keqin
通讯作者:
Yang, Wangdong;Li, Kenli
作者机构:
[Li, Keqin; Yang, Wangdong; Yang, WD; Li, KL; Li, Kenli] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China.
[Yang, Wangdong] Hunan City Univ, Coll Informat Sci & Engn, Yiyang 413000, Hunan, Peoples R China.
[Li, Kenli] Natl Supercomp Ctr Changsha, Changsha 410082, Hunan, Peoples R China.
[Li, Keqin] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA.
通讯机构:
[Yang, WD; Li, KL; Yang, Wangdong] H
[Li, Kenli] N
Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China.
Hunan City Univ, Coll Informat Sci & Engn, Yiyang 413000, Hunan, Peoples R China.
Natl Supercomp Ctr Changsha, Changsha 410082, Hunan, Peoples R China.
语种:
英文
关键词:
Blocked format;CPU/GPU;Dynamic programming;Heterogeneous parallel computing;Partitioning;Reordering;Sparse matrix–vector multiplication
期刊:
Journal of Computer and System Sciences
ISSN:
0022-0000
年:
2018
卷:
92
期:
C
页码:
152-170
基金类别:
National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [61572175, 61370095, 61472124]; Key Program of National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [61432005]; National Outstanding Youth Science Program of National Natural Science Foundation of ChinaNational Natural Science Foundation of China (NSFC) [61625202]; International (Regional) Cooperation and Exchange Program of National Natural Science Foundation of China [61661146006]
机构署名:
本校为通讯机构
院系归属:
信息与电子工程学院
摘要:
For large-scale sparse matrices, SpMV cannot be processed on GPU using the common storage formats because of the memory limitation. In addition, the parallel effect is poor using general formats for the sparse matrices with extremely uneven distribution of non-zero elements, which leads to performance deterioration. This paper presents an optimal partitioning strategy based on the distribution of non-zero elements in a sparse matrix to improve the performance of SpMV, and uses a hybrid format, which mixes CSR and ELL formats, to store the blocks partitioned from the sparse matrix. The hybrid b...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com