<<
Quad-Core Intel® Xeon® processors
quick reference
Compiler Reports Quick Reference
Enabling Automatic Optimizations
Interprocedural Optimization (IPO) Quick Reference
automatic optimizations
compiler reports
IPO options
READ
using for little-to-big endian conversion
RECL
specifier for OPEN
record buffers
efficient use of
record structures
and alignment
REDUCTION
in the DO directive
summary of data scope attribute clauses
Data Scope Attribute Clauses Overview
OpenMP Directives and Clauses Summary
used in PARALLEL directive
using
variables
reductions in loops
register allocation
report generation
report software pipelining (SWP)
resetting
dynamic profile counters
profile information
restricting optimization
routines
Intel extension
OpenMP* run-time
timing
run-time checking
run-time performance
improving
slowing down
sample of timing
samples
Compiler Optimization Sample
Included Samples
OpenMP* Samples
PGO Sample
Vectorization Sample
code-coverage tool
compiler optimization sample
included samples
IPO samples
location
OpenMP* sample
PGO sample
test-prioritization tool
vectorization samples
scalar clean-up iterations
scalars
allocation of
SCHEDULE
DYNAMIC
GUIDED
RUNTIME
STATIC
using in DO directives
using to specify types and chunk sizes
serial execution
shared scalars
shared scoping
shared variables
single-precision real
Coding Guidelines for Intel(R) Architectures
Improving Run-time Efficiency
SMP systems
software pipelining
Parallelism Overview
SWP Reports
affect of LOOP COUNT on
for IA-64 architecture based applications
optimization
reports
source code
specialized code
Automatic Processor-specific Optimization (IA-32 Architecture Only)
Coding Guidelines for Intel Architectures
Parallelism Overview
specifying symbol visibility
SSE
optimizing
SSE2
SSP
profun
using with -prof-gen
using with -prof-use
using with -ssp
using with /Qprof-gen
using with /Qprof-use
using with /Qssp
stacks
Optimizing Compilation Process Overview
Stacks: Automatic Allocation and Checking
statement functions
strategies for optimization
Stream_LF records
Streaming SIMD Extensions
Coding Guidelines for Intel(R) Architectures
Key Programming Guidelines for Vectorization
structures
record
subroutines in the OpenMP* run-time library
support
for loop unrolling
for OpenMP*
for optimization
for prefetching
for vectorization
parallel run-time
SWP
SWP reports
using
symbol visibility
specifying
symbol visibility on Linux*
symbol visibility on Mac OS* X
synchronization
Auto-parallelization Overview
Parallelism Overview
constructs
targeting
Targeting a Processor
Targeting Itanium(R) Processors Automatically
Dual-Core Intel(R) Xeon(R) processors
Dual-Core Itanium(R) 2 Processor
IA-32 architecture processors
IA-32 architectures
Intel(R) 64 architecture processors
Intel(R) Core™ Duo processors
Intel(R) Core™ Solo processors
Intel(R) Core™2 Duo processors
Intel(R) Core™2 Extreme processors
Intel(R) Core™2 Quad processors
Intel(R) Pentium(R) 4 processors
Intel(R) Pentium(R) II processors
Intel(R) Pentium(R) III processors
Intel(R) Pentium(R) Pro processors
Intel(R) Pentium(R) processors
Intel(R) Xeon(R) processors
Itanium(R) 2 processors
Itanium(R) processors
Targeting a Processor
Targeting Itanium(R) Processors Automatically
Quad-Core Intel® Xeon® processors
run-time checking
technical applications
test prioritization tool
examples
options
requirements
thread affinity
thread pooling
threads
changing the number of
parallel processing model for
thread-level parallelism
thread sleep time
threshold control for auto-parallelization
timing
OpenMP* routines for
tool options
code coverage tool
profmerge
proforder
profrun
test prioritization
tools
topology maps
transformations
reordering
UBC buffers
unaligned data
unbuffered WRITEs
underflow
Coding Guidelines for Intel(R) Architectures
Stacks: Automatic Allocation and Checking
unformatted files
unvectorizable copy
usage rules
user functions
Controlling Inline Expansion of User Functions
Developer Directed Inline Expansion of User Functions
using
auto-parallelization
Auto-parallelization Overview
Parallelism: an Overview
dynamic libraries
EQUIVALENCE statements
floating-point conversions
formatted or unformatted files
implied-DO loops
intrinsics
memory
noniterative worksharing SECTIONS
OpenMP*
profile-guided optimization
slow arithmetic operators
timing for an application
unbuffered WRITEs
worksharing
utilities
profmerge
proforder
profrun
variables
automatic
length of
loop assigns for
PGO environment
private scoping for
VECTOR ALWAYS directive
vector copy
VECTOR directive
VECTOR NONTEMPORAL directive
vectorization
examples
options
options for
overview
programming guidelines
Key Programming Guidelines for Vectorization
Vectorization Overview (IA-32 Architecture Only)
reports
support for
vectorizing
loops
Loop Constructs
Profile-guided Optimizations Overview
VOLATILE
using for loop collapsing
VTune™ Performance Analyzer
profrun
>>