I spent the past week mostly reading about how to effectively program CUDA enabled devices, and looking at CuPy’s documentation and at how QuTiP’s data layer works. One does not simply optimize GPU code Even though I will not be working on low level CUDA code, at least in the coming weeks, as I will be leveraging…