You've been logged out of GDC Vault since the maximum users allowed for this account has been reached. To access Members Only content on GDC Vault, please log out of GDC Vault from the computer which last accessed this account.

Click here to find out about GDC Vault Membership options for more users.


Session Name:

Optimizing DX12/DXR GPU Workloads using Nsight Graphics: GPU Trace and the Peak-Performance-Percentage (P3) Method (Presented by NVIDIA)


This talk shows how Nsight GPU Trace can be used to determine the performance limiters of any DX12 workload on NVIDIA Turing GPUs, and improve performance by applying architecture-aware optimizations. Because the tool captures all of its metrics in a single pass (no frame replay), it can be used on DX12 frames that use asynchronous compute or copy queues. After recaping what the Peak-Performance-Percentage Method is, the talk shows how it can be applied to unlock perf speedups on various workloads, including: compute shaders with large thread-group sizes, pixel shaders with out-of-order completion, ray-tracing BVH updates and ray-tracing denoisers.

Did you know free users get access to 30% of content from the last 2 years?

Get your team full access to the most up to date GDC content

  • GDC 2019
  • Louis Bavoil
  • free content
  • Programming
  • Programming