Conjugate Gradient Solvers with High Accuracy and Bit-wise Reproducibility between CPU and GPU using Ozaki scheme

Daichi Mukunoki, Katsuhisa Ozaki, Takeshi Ogita, Roman Iakymchuk

研究成果: Conference contribution

抄録

On Krylov subspace methods such as the Conjugate Gradient (CG) method, the number of iterations until convergence may increase due to the loss of computational accuracy caused by rounding errors in floating-point computations. At the same time, because the order of the computation is nondeterministic on parallel computation, the result and the behavior of the convergence may be nonidentical in different computational environments, even for the same input. In this study, we present an accurate and reproducible implementation of the unpreconditioned CG method on x86 CPUs and NVIDIA GPUs. In our method, while all variables are stored on FP64, all inner product operations (including matrix-vector multiplications) are performed using the Ozaki scheme. The scheme delivers the correctly rounded computation as well as bit-level reproducibility among different computational environments. In this paper, we show some examples where the standard FP64 implementation of CG results in nonidentical results across different CPUs and GPUs. We then demonstrate the applicability and the effectiveness of our approach in terms of accuracy and reproducibility and their performance on both CPUs and GPUs. Furthermore, we compare the performance of our method against an existing accurateand reproducible CG implementation based on the Exact Basic Linear Algebra Subprograms (ExBLAS) on CPUs.

本文言語English
ホスト出版物のタイトルProceedings of International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2021
出版社Association for Computing Machinery
ページ100-109
ページ数10
ISBN(電子版)9781450388429
DOI
出版ステータスPublished - 2021 1 20
イベント2021 International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2021 - Virtual, Online, Korea, Republic of
継続期間: 2021 1 202021 1 22

出版物シリーズ

名前ACM International Conference Proceeding Series

Conference

Conference2021 International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2021
CountryKorea, Republic of
CityVirtual, Online
Period21/1/2021/1/22

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

フィンガープリント 「Conjugate Gradient Solvers with High Accuracy and Bit-wise Reproducibility between CPU and GPU using Ozaki scheme」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル