Developed a CUDA version of the FDTD method and achieved a speedup 40x. Implemented on a NVIDIA Quadro FX 3800 GPU, which has 192 SPs, 1GB global memory, and a memory bandwidth of 51.2 GB/s.
SIAM Journal on Numerical Analysis, Vol. 26, No. 6 (Dec., 1989), pp. 1474-1486 (13 pages) An explicit finite difference algorithm is developed to approximate the solution of a nonlinear and nonlocal ...
We analyze finite difference methods for the Gross-Pitaevskii equation with an angular momentum rotation term in two and three dimensions and obtain the optimal convergence rate, for the conservative ...