RDSEED (R64) - Throughput and Uops
With 1 independent instruction
With unroll_count=10 and no inner loop
Code:
0: 49 0f c7 f8 rdseed r8
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 619.97
Reference cycles: 365.33
UOPS_EXECUTED.THREAD: 21.0
RETIRE_SLOTS: 23.0
UOPS_MITE: 0.43
UOPS_MS: 22.92
UOPS_PORT_0: 4.0
UOPS_PORT_1: 7.0
UOPS_PORT_6: 6.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 0.0
UOPS_PORT_5B: 5.2
UOPS_PORT_5B>=2: 0.0
UOPS_PORT_23A: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.08
With unroll_count=10, no inner loop, and 1 NOP
Code:
0: 49 0f c7 f8 rdseed r8 4: 90 nop
Show nanoBench command
Results:
Instructions retired: 2.0
Core cycles: 625.9
Reference cycles: 369.33
UOPS_EXECUTED.THREAD: 21.4
RETIRE_SLOTS: 24.0
UOPS_MITE: 1.5
UOPS_MS: 22.9
UOPS_PORT_0: 4.9
UOPS_PORT_1: 7.0
UOPS_PORT_6: 6.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 0.0
UOPS_PORT_5B: 4.3
UOPS_PORT_5B>=2: 0.0
UOPS_PORT_23A: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.1
With loop_count=10 and unroll_count=1
Code:
0: 49 0f c7 f8 rdseed r8
Show nanoBench command
Results:
Instructions retired: 3.0
Core cycles: 607.83
Reference cycles: 368.0
UOPS_EXECUTED.THREAD: 21.6
RETIRE_SLOTS: 25.07
UOPS_MITE: 1.13
UOPS_MS: 16.73
UOPS_PORT_0: 4.15
UOPS_PORT_1: 7.0
UOPS_PORT_6: 6.7
UOPS_PORT_49: 0.0
UOPS_PORT_78: 0.0
UOPS_PORT_5B: 5.5
UOPS_PORT_5B>=2: 0.3
UOPS_PORT_23A: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.7