GPU Computing

13.2. GPU Computing#

13.2.1. GPU vs CPU Computing#

The difference in CPU and GPU computing capabilities comes from their design. CPUs are designed to execute a sequence of instructions (aka thread) as fast as possible and with multicore design to run multiple threads simultaneously, mostly targeting lower response time. While GPU design focuses on providing higher throughput by executing thousands of threads concurrently by providing more computing resources aiming to excel parallel workloads efficiently.

../../_images/gpu_vs_cpu.png — Fig. 13.4 GPU design provides vastly greater compute resources particularly for parallel workloads#

13.2.2. How to use GPUs#

Here is how we can utilize the compute power of GPUs for Matrix Multiplication using the CuPy package (equivalent of NumPy for GPUs).

Alternatively PyTorch provides a very efficient built-in function for matrix multiplication.

13.2.3. More Frameworks and Libraries#

In addition to the aforementioned CuPy and PyTorch, there are many more Frameworks and Libraries that enable applications to run computation on GPUs.

13.2.3.1. Nvidia RAPIDS#

TBD

13.2.3.2. Pytorch Lightning#

TBD