-ax compiler option
-ipo compiler option
-opt-ra-region-strategy compiler option
example
-opt-report compiler option
-prof-gen-sampling compiler option
using with profrun
using with SSP
-prof-gen compiler option
using with SSP
-prof-genx compiler option
code coverage tool
test priorization tool
-prof-use compiler option
code coverage tool
profmerge utility
using with profrun
using with SSP
-ssp compiler option
using with SSP
-x compiler option
.dpi file
Code-coverage Tool
Profmerge and Proforder Utilities
Test-prioritization Tool
.dyn file
Code-coverage Tool
Profmerge and Proforder Utilities
Test-prioritization Tool
.dyn files
Profile an Application
Profile-guided Optimization (PGO) Quick Reference
.hpi file
.spi file
Code-coverage Tool
Generating Function Order Lists
Test-prioritization Tool
.tb5 file
/Qax compiler option
/Qipo compiler option
/Qopt-ra-region-strategy compiler option
example
/Qopt-report compiler option
/Qprof-gen-sampling compiler option
using with profrun
using with SSP
/Qprof-gen compiler option
using with SSP
/Qprof-genx compiler option
code coverage tool
test priorization tool
/Qprof-use compiler option
code coverage tool
profmerge utility
using with profrun
using with SSP
/Qssp compiler option
using with SSP
/Qx compiler option
accessing arrays efficiently
advanced PGO options
aliases
Stacks: Automatic Allocation and Checking
Vectorization Examples
aligning data
Aligning Data
Setting Data Type and Alignment
alignment
Alignment Options
Efficient Compilation
Setting Data Type and Alignment
Understanding Data Alignment
Vectorization Examples
example
options
strategy
Alignment Options
Vectorization Examples
ALLOCATABLE
arrays as arguments
effects of compiler options on allocation
allocating registers
analyzing
code for OpenMP*
data flow
performance
analyzing applications
Performance Enhancement Strategies
Using a Performance Methodology
Using Intel Performance Analysis Tools
Intel(R) Debugger
Intel(R) Threading Tools
VTune™ Performance Analyzer
application
basic block
code coverage
OpenMP*
pipelining
visual presentation
application characteristics
application performance
application tests
architectures
coding guidelines for
argument aliasing
arrays
alignment in vectorization
efficient compilation of
rules for improving I/O performance
using efficiently
assumed-shape arrays
ATOMIC
using
auto-parallelization
Auto-parallelization Overview
Parallelism Overview
diagnostic
enabling
environment variables
guidelines
overview
programming with
threshold
auto-parallelized loops
auto-parallelizer
Auto-parallelization Overview
Parallelism: an Overview
controls
Auto-parallelization Threshold Control and Diagnostics
Parallelism Overview
enabling
auto-vectorization
Coding Guidelines for Intel(R) Architectures
Parallelism Overview
auto-vectorizer
automatic
allocation of stacks
Optimizing the Compilation Process Overview
Stacks: Automatic Allocation and Checking
checking of stacks
optimization for systems based on IA-32 architecture
automatic optimizations
Automatic Optimizations Overview
Enabling Automatic Optimizations
avoid
EQUIVALENCE statements
inefficient data types
mixed arithmetic expressions
slow arithmetic operators
small integer data items
unnecessary operations in DO loops
BARRIER
using
big-endian data
conversion of little-endian data to
buffers
UBC system
cache size intrinsic
capturing IPO output
changing number of threads
checking
floating-point stacks
stacks
chunk size
specifying
clauses
in parallel region directives
in worksharing construct directives
summary table of
Cluster OpenMP*
code coverage tool
color scheme
dynamic counters in
exporting data
syntax of
code design considerations
preparing for OpenMP* programming
code layout
coding
for efficiency in Intel Fortran
guidelines for Intel Architectures
combined parallel and worksharing constructs
compilation
efficient
optimizing
compilation units
compiler optimization
compiler optimizations
compiler reports
High-Level Optimization (HLO)
Interprocedural Optimizations (IPO)
report generation
software pipelining
vectorization
compiler reports quick reference
compiling large programs
compiling samples
compiling with IPO
conditional parallel region execution
controlling
auto-parallelizer diagnostics
Auto-parallelization Threshold Control and Diagnostics
Parallelism Overview
data scope attributes
inline expansion
COPYIN
summary of data scope attribute clauses
correct usage of countable loop
countable loop
correct usage of
counters for dynamic profile
CPU dispatch
automatic
CPU time
create libraries using IPO
creating
DPI lists
multithreaded applications
criteria
for inline function expansion
cross-iteration dependencies
data alignment
data format
alignment
Alignment Options
Vectorization Examples
dependence
Auto-parallelization: Threshold Control and Diagnostics
Pipelining for IA-64 Architecture
options
partitioning
prefetching
HLO Overview
Loop Count and Loop Distribution
scope attribute clauses
sharing
structure
type
Improving Run-time Efficiency
Parallelism Overview
Vectorization Overview
data prefetches
data scope attribute clauses
data types
dataflow analysis
Auto-parallelization Overview
Parallelism Overview
DEFAULT
summary of data scope attribute clauses
using
deferred-shape arrays
denormal exceptions
denormal numbers
flush-to-zero
denormals-are-zero
dequeuing
derived-type components
determining parallelization
device-specific blocksize
diagnostic reports
diagnostics
Parallelism: an Overview
Vectorization Key Programming Guidelines
auto-parallelizer
Auto-parallelization Threshold Control and Diagnostics
Parallelism Overview
difference operators
differential coverage
directives
commons
dcommons
IVDEP
NOVECTOR
records
sequence
structure
VECTOR
VECTOR ALWAYS
VECTOR NONTEMPORAL
directives for OpenMP*
Combined Parallel and Worksharing Constructs
Worksharing Construct Directives
ATOMIC
BARRIER
CRITICAL
DO
END DO
END PARALLEL
END PARALLEL DO
END PARALLEL SECTIONS
END SECTIONS
END SINGLE
FLUSH
MASTER
ORDERED
PARALLEL
PARALLEL DO
PARALLEL SECTIONS
PARALLEL WORKSHARE
SECTION
SECTIONS
SINGLE
WORKSHARE
disabling
function splitting
inlining
disabling optimization
disk I/O
efficient use of
DISTRIBUTE POINT
using
DO constructs
order of
double-precision
numbers
Coding Guidelines for Intel Architectures
Improving Run-time Efficiency
OpenMP* Run-time Library Routines
dual-core
Dual-Core Intel(R) Xeon(R) processors
Dual-Core Itanium(R) 2 Processor
dual core thread affinity
dummy arguments
Improving I/O Performance
Using Arrays Efficiently
dumping profile information
Dumping Profile Information
Interval Profile Dumping
dyn files
Basic PGO Options
Dumping and Resetting Profile Information
Dumping Profile Information
Example of Profile-Guided Optimization
PGO Environment Variables
dynamic-information files
dynamic information
PGO Environment Variables
Profile-guided Optimizations Overview
dumping profile information
files
resetting profile counters
threads
>>