Fundamentals of Accelerated Computing with CUDA C/C++

Instructor:   Asoc. prof. Dr. sc. ing. Arnis Lektauers (NVIDIA Ambassador in Latvia).

Prerequisites:

  • Basic C/C++ competency, including familiarity with variable types, loops, conditional statements, functions, and array manipulations;
  • No previous knowledge of CUDA programming is assumed.

Duration:   8h

Format:      Training course

Certificate: Upon successful completion of the assessment, participants will receive an NVIDIA DLI certificate to recognize their subject matter competency and support professional career growth.

Hardware Requirements: Desktop or laptop computer capable of running the latest version of Chrome or Firefox. Each participant will be provided with dedicated access to a fully configured, GPU-accelerated server in the cloud.

This workshop teaches the fundamental tools and techniques for accelerating C/C++ applications to run on massively parallel GPUs with CUDA®. You’ll learn how to write code, configure code parallelization with CUDA, optimize memory migration between the CPU and GPU accelerator, and implement the workflow that you’ve learned on a new task—accelerating a fully functional, but CPU-only, particle simulator for observable massive performance gains. At the end of the workshop, you’ll have access to additional resources to create new GPU-accelerated applications on your own.

CUDA is used to accelerate CPU-only applications by making them run on GPUs . These CUDA applications are massively parallel and way faster than their CPU-only counterparts. Experience C/C++ application acceleration by:

  • Parallelizing applications to run on GPUs;
  • Optimizing applications by using CUDA techniques like memory management;
  • Learning techniques like concurrency and CUDA streams;
  • Learning tools like Nsight Systems to profile and identify bottlenecks.

Upon completion, you’ll be able to accelerate and optimize existing C/C++ CPU-only applications using the most essential CUDA techniques and Nsight Systems. You’ll understand an iterative style of CUDA development that will allow you to ship accelerated applications quickly.

 

This course is created by NVIDIA Deep Learning Institute in collaboration with HPC Competence Center in Latvia – SuperS – within the EuroCC project.

Full information about the course is available here.