RDSEED (R64) - Throughput and Uops
With 1 independent instruction
With unroll_count=10 and no inner loop
Code:
0: 49 0f c7 f8 rdseed r8
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 1548.67
Reference cycles: 511.5
UOPS_EXECUTED.THREAD: 58.37
RETIRE_SLOTS: 55.62
UOPS_MITE: 0.32
UOPS_MS: 64.1
UOPS_PORT_0: 7.62
UOPS_PORT_1: 19.42
UOPS_PORT_5: 11.1
UOPS_PORT_6: 19.33
UOPS_PORT_23: 0.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.0
With unroll_count=10, no inner loop, and 1 NOP
Code:
0: 49 0f c7 f8 rdseed r8 4: 90 nop
Show nanoBench command
Results:
Instructions retired: 2.0
Core cycles: 1548.78
Reference cycles: 512.53
UOPS_EXECUTED.THREAD: 58.35
RETIRE_SLOTS: 57.2
UOPS_MITE: 0.9
UOPS_MS: 64.1
UOPS_PORT_0: 7.5
UOPS_PORT_1: 19.6
UOPS_PORT_5: 10.9
UOPS_PORT_6: 19.13
UOPS_PORT_23: 0.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0
With loop_count=10 and unroll_count=1
Code:
0: 49 0f c7 f8 rdseed r8
Show nanoBench command
Results:
Instructions retired: 3.0
Core cycles: 1548.98
Reference cycles: 512.53
UOPS_EXECUTED.THREAD: 60.4
RETIRE_SLOTS: 57.78
UOPS_MITE: 1.83
UOPS_MS: 64.1
UOPS_PORT_0: 7.67
UOPS_PORT_1: 20.33
UOPS_PORT_5: 11.1
UOPS_PORT_6: 20.27
UOPS_PORT_23: 0.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0