RDRAND (R64) - Throughput and Uops
With 1 independent instruction
With unroll_count=10 and no inner loop
Code:
0: 49 0f c7 f0 rdrand r8
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 1373.38
Reference cycles: 1028.8
UOPS_EXECUTED.THREAD: 21.1
RETIRE_SLOTS: 24.0
UOPS_MITE: 0.62
UOPS_MS: 24.75
UOPS_PORT_0: 5.0
UOPS_PORT_1: 7.4
UOPS_PORT_6: 6.6
UOPS_PORT_49: 0.0
UOPS_PORT_78: 0.0
UOPS_PORT_5B: 3.2
UOPS_PORT_5B>=2: 0.0
UOPS_PORT_23A: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.1
With unroll_count=10, no inner loop, and 1 NOP
Code:
0: 49 0f c7 f0 rdrand r8 4: 90 nop
Show nanoBench command
Results:
Instructions retired: 2.0
Core cycles: 1380.12
Reference cycles: 1035.2
UOPS_EXECUTED.THREAD: 21.2
RETIRE_SLOTS: 25.0
UOPS_MITE: 1.2
UOPS_MS: 22.8
UOPS_PORT_0: 3.2
UOPS_PORT_1: 9.2
UOPS_PORT_6: 5.7
UOPS_PORT_49: 0.0
UOPS_PORT_78: 0.0
UOPS_PORT_5B: 4.1
UOPS_PORT_5B>=2: 0.8
UOPS_PORT_23A: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.12
With loop_count=10 and unroll_count=1
Code:
0: 49 0f c7 f0 rdrand r8
Show nanoBench command
Results:
Instructions retired: 3.0
Core cycles: 1379.67
Reference cycles: 1036.8
UOPS_EXECUTED.THREAD: 21.87
RETIRE_SLOTS: 28.18
UOPS_MITE: 0.4
UOPS_MS: 7.0
UOPS_PORT_0: 4.07
UOPS_PORT_1: 7.62
UOPS_PORT_6: 5.8
UOPS_PORT_49: 0.0
UOPS_PORT_78: 0.0
UOPS_PORT_5B: 5.17
UOPS_PORT_5B>=2: 0.8
UOPS_PORT_23A: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.68