Abstract
Basic Linear Algebra Subprograms (BLAS) is a frequently used numerical library for linear algebra computations. However, it places little emphasis on computational accuracy, especially with respect to the accuracy assurance of the results. Although some algorithms for ensuring the computational accuracy of BLAS operations have been studied, there is a need for performance evaluation in advanced computer architectures. In this study, we parallelize high-precision matrix-matrix multiplication using thread-level parallelism. In addition, we conduct a performance evaluation from the viewpoints of execution speed and accuracy. We implement a method to convert dense matrices into sparse matrices by exploiting the nature of the target algorithm and adapting sparse-vector multiplication. Results obtained using the FX100 supercomputer system at Nagoya University indicate that (1) implementation with the ELL format achieves 1.43x speedup and (2) a maximum of 38x speedup compared to conventional implementation for dense matrix operations with dgemm.
Original language | English |
---|---|
Title of host publication | Proceedings - 2018 IEEE 32nd International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1093-1102 |
Number of pages | 10 |
ISBN (Print) | 9781538655559 |
DOIs | |
Publication status | Published - 2018 Aug 3 |
Externally published | Yes |
Event | 32nd IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018 - Vancouver, Canada Duration: 2018 May 21 → 2018 May 25 |
Other
Other | 32nd IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2018 |
---|---|
Country/Territory | Canada |
City | Vancouver |
Period | 18/5/21 → 18/5/25 |
Keywords
- Accuracy Assurance
- Component
- Error-free Transformation
- High-precision Matrix-Matrix Multiplications
- Sparse Matrix-vector Multiplications
- Thread Parallelism
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Hardware and Architecture
- Information Systems and Management