Date of Award
Spring 2016
Document Type
Thesis
Degree Name
Master of Science (MS)
Department
Computer Science
Committee Director
Mohammad Zubair
Committee Member
Desh Ranjan
Committee Member
Balint Joo
Committee Member
Michele Weigle
Abstract
Lattice quantum chromodynamics (LQCD) stands unique as the only computationally tractable, non-perturbative, and model-independent quantum field theory of the strong nuclear force. The computational core of LQCD is the Wilson Dslash operator, a nearest neighbor stencil operator summing matrix-vector multiplications over lattice points, whose performance is bandwidth-bound on most architectures. Reportedly, up to 90\% of LQCD running time may be spent computing Dslash. In recent years, efforts have been made by researchers to optimize LQCD calculations for floating point coprocessor cards such as GPUs and Intel Xeon Phi Knights Corner (KNC), which boast powerful vector processing units. Most of these efforts in the area of Dslash have focused on single right-hand side solvers. This thesis will present two optimized Dslash kernels which simplify vectorization using multiple right-hand sides and traverse lattices using novel methods. The speedups resulting from these approaches will be explored in the context of KNC's architecture.
Rights
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
DOI
10.25777/sebh-yy07
ISBN
9781339860022
Recommended Citation
Walden, Aaron.
"An Optimized Multiple Right-Hand Side Dslash Kernel for Intel Xeon Phi"
(2016). Master of Science (MS), Thesis, Computer Science, Old Dominion University, DOI: 10.25777/sebh-yy07
https://digitalcommons.odu.edu/computerscience_etds/11