Channels ▼
RSS

Web Development

CUDA, Supercomputing for the Masses: Part 20


Summary

As this article has shown, the analysis capabilities of NVIDIA's Parallel Nsight v1.0 are powerful indeed! Still, this tutorial has barely scratched the surface plus it is difficult to demonstrate how interactive Parallel Nsight makes the analysis process. To get a better sense of the capabilities of this software, please view some of the online video demonstrations or better yet, download the trial version and try it for yourself.

This article also provides an example hybrid CPU/GPU code that can concurrently share a computation between both a host system processing core and a GPU. Through the Parallel Nsight traces and reports, it should also be clear why primitive restart is such a fast rendering technique for applications that mix graphics and CUDA.

It is also worth noting that Parallel Nsight is an evolving project, with Parallel Nsight 1.5 coming soon with additional improvements to Analysis and NVTX. You should be aware of the following when using version 1.0 of Parallel Nsight for analysis:

  • Asynchronous data transfers when using mapped pinned memory do not appear in the application traces.
  • It is unclear if OpenMP will be supported.
  • Call cudaThreadExit() at the end of main()to ensure that the Parallel Nsight receives all the trace information from short-lived applications like test programs. If the application exit behavior is complicated, a fallback solution is to use cudaThreadSynchronize() or cudaThreadExit() in an atexit() registered function.


Rob Farber is a senior scientist at Pacific Northwest National Laboratory. He has worked in massively parallel computing at several national laboratories and as co-founder of several startups. He can be reached at rmfarber@gmail.com


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 
Dr. Dobb's TV