Threaded accurate matrix-matrix multiplications with sparse matrix-vector multiplications

Shuntaro Ichimura, Takahiro Katagiri, Katsuhisa Ozaki, Takeshi Ogita, Toru Nagai

研究成果: Conference contribution

3 引用 (Scopus)

抜粋

Basic Linear Algebra Subprograms (BLAS) is a frequently used numerical library for linear algebra computations. However, it places little emphasis on computational accuracy, especially with respect to the accuracy assurance of the results. Although some algorithms for ensuring the computational accuracy of BLAS operations have been studied, there is a need for performance evaluation in advanced computer architectures. In this study, we parallelize high-precision matrix-matrix multiplication using thread-level parallelism. In addition, we conduct a performance evaluation from the viewpoints of execution speed and accuracy. We implement a method to convert dense matrices into sparse matrices by exploiting the nature of the target algorithm and adapting sparse-vector multiplication. Results obtained using the FX100 supercomputer system at Nagoya University indicate that (1) implementation with the ELL format achieves 1.43x speedup and (2) a maximum of 38x speedup compared to conventional implementation for dense matrix operations with dgemm.

元の言語English
ホスト出版物のタイトルProceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018
出版者Institute of Electrical and Electronics Engineers Inc.
ページ1093-1102
ページ数10
ISBN(印刷物)9781538655559
DOI
出版物ステータスPublished - 2018 8 3
外部発表Yes
イベント32nd IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018 - Vancouver, Canada
継続期間: 2018 5 212018 5 25

Other

Other32nd IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018
Canada
Vancouver
期間18/5/2118/5/25

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems and Management

フィンガープリント Threaded accurate matrix-matrix multiplications with sparse matrix-vector multiplications' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Ichimura, S., Katagiri, T., Ozaki, K., Ogita, T., & Nagai, T. (2018). Threaded accurate matrix-matrix multiplications with sparse matrix-vector multiplications. : Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018 (pp. 1093-1102). [8425535] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IPDPSW.2018.00168