Going Parallel: Part 4 -- Enter Intel Parallel Studio
Previously I examined a Dhrystone app and identified hotspots. Since then Intel Parallel Studio has been released, so I thought I'd convert the project to use it. This time I concentrate on converting my project to Visual Studio, then use Parallel Studio begin implementing parallelism.
Links to previous sections of blog
- Going Parallel: Part 1 -- Doing Two Things at Once. Impossible!
- Going Parallel: Part 2 -- So Who's Really Writing Parallel Applications?
- Going Parallel: Part 3 -- Let's Get started!
Steps to using Parallel Studio
First of all you must already have Visual Studio installed -- either 2005 or 2008. I'm using Visual; Studio 2005.
Parallel Studio can be downloaded from http://software.intel.com/en-us/intel-parallel-studio-home/ by pressing the Evaluate button.
After you have downloaded and installed the product, you might want to consider downloading the Intel Parallel Advisor Lite from http://software.intel.com/en-us/articles/intel-parallel-advisor-lite/. I'll use the Advisor Lite in a subsequent blog, but you might as well get it whilst you are in 'downloading mode'
Creating a Visual Studio Project
First Create new Win32 Console application from the File | New | Project menu
Once you've put a valid path in the Location field, the 'OK' button becomes live. Press OK!
Press the Next> button, and edit the settings as follows...
Finally press Finish.
In the new project, add the existing files from the previous blog (dhry_1.c, dhry_2.c, dhry.h)
After adding all three files, open the project properties and add the TIME to the Preprocessor Definitions
Build the project with the default settings and make sure the program runs.
Using the Intel C++ Compiler
From the solution context menu select 'Use Intel C++...'
First thing you should notice is a Composer icon appearing in the solution browser.
Adding the Parallelism using OpenMP
I like OpenMP. It's an easy way to add parallelism to code.In my code example, I added a single statement to the beginning of the main lop in the program
#pragma
omp parallel forfor (Run_Index = 1; Run_Index <= Number_Of_Runs; ++Run_Index)
To enable OpenMP, you need to modify the compiler options:
I suggest that you also add an extra command-line option to make the compiler spew out any remarks. We can then get better information about any OpenMP directives that the compiler handles.
I then built the code with the Intel compiler ...
The output window shows that the OpenMP code was parallelised
dhry_1.c(145): (col. 1) remark: OpenMP DEFINED LOOP WAS PARALLELIZED.
Error, Errors Errors!
On running the application I got an application error:
In the next blog I'll look at using Static and Dynamic analysis to help correct the errors I have just introduced.

