yes believe me it starts at the very basic. it is in C, so you do not have the barrier of C++ with object oriented programming.
it does not teach you well about parallelised computing but it forces you to write parallelized code for simple examples. and remember simple things work, complex things never work!
you just have to install the Nvidia CUDA Development kit on your rig, it is quite a huge package.
i sent the GRC to your address. i feel like funding myself 22 years ago :-) might the source be with you and take care @applepiie
Thanks, it's just great! I wrote a comment on the other part.