next up previous
Next: Cycles, instructions, TLB and Up: HPM event sets and Previous: HPM event sets and


Cycles, instructions, floats including divides, multiply-adds, loads and stores

Event set 60 is the default if no event set is specified. Use it for counts of cycles, total instruction and various floating point operations. The floating point operations include: divides, multiply-adds1, load, stores and completed floating point instructions. Among the derived metrics you can find an estimate of the floating point performance (excluding store operations) of the code section under investigation.

Raw counters
PM_FPU_FDIV Number of floating point divisions (hardware)
PM_FPU_FMA Number of floating point multiply-additions
PM_FPU0_FIN Operations on floating point unit 0 producing a result
PM_FPU1_FIN Operations on floating point unit 1 producing a result
PM_CYC Number of processor cycles
PM_FPU_STF Number of floating point stores (by floating point unit)
PM_INST_CMPL Number of completed instructions
PM_LSU_LDF Number of floating point loads (by load store unit)

Derived Metrics
Utilization rate User time divided by wall-clock time in percent
Load and store operations
Total number of floats loaded and stored in 1000000 operations
Instructions per load/store
Completed instructions divided by result of the previous line
MIPS
Completed instructions divided by wall-clock time in 1000000/s
Instructions per cycle Completed instructions divided by number of cycles
HW Float point instructions per Cycle
Sum of result-producing operations on both FPUs divided by the number of cycles
Floating point instructions + FMAs (flips)
Sum of result-producing operations on both FPUs plus the number of executed floating point multiply-additions minus the stores by the FPUs
Each FMA contains 2 calculations in a single instruction. Store instructions contain no calculation.
Flip rate (flips/WCT)
Result of the previous line divided by the wall-clock time in 1000000/sec
Includes many overheads for HPMCOUNT, more useful for LIBHPM
Flips/user time As above but with user time instead of wall-clock time
FMA percentage
Twice the number of floating point multiply adds divided by the flips in percent
Computation intensity
Number of flips divided by the total number of floats loaded and stored


next up previous
Next: Cycles, instructions, TLB and Up: HPM event sets and Previous: HPM event sets and
Joachim Hein
2003-11-03