RDTSC - Throughput and Uops
With unroll_count=500 and no inner loop
- Code:
0: 0f 31 rdtsc
- Show nanoBench command
- Results:
- Instructions retired: 1.0
- Core cycles: 25.0
- Reference cycles: 21.5
- UOPS_EXECUTED.THREAD: 18.0
- RETIRE_SLOTS: 18.0
- UOPS_MITE: 0.0
- UOPS_MS: 18.0
- UOPS_PORT_0: 3.0
- UOPS_PORT_1: 4.0
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.5
- UOPS_PORT_6: 7.5
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 0.0
With unroll_count=500, no inner loop, and 1 NOP
- Code:
0: 0f 31 rdtsc
2: 90 nop
- Show nanoBench command
- Results:
- Instructions retired: 2.0
- Core cycles: 25.0
- Reference cycles: 21.45
- UOPS_EXECUTED.THREAD: 18.0
- RETIRE_SLOTS: 19.0
- UOPS_MITE: 1.0
- UOPS_MS: 18.0
- UOPS_PORT_0: 3.06
- UOPS_PORT_1: 4.31
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.38
- UOPS_PORT_6: 7.25
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 1.0
With loop_count=1000 and unroll_count=10
- Code:
0: 0f 31 rdtsc
- Show nanoBench command
- Results:
- Instructions retired: 1.2
- Core cycles: 25.0
- Reference cycles: 21.48
- UOPS_EXECUTED.THREAD: 18.1
- RETIRE_SLOTS: 18.1
- UOPS_MITE: 0.1
- UOPS_MS: 18.0
- UOPS_PORT_0: 3.0
- UOPS_PORT_1: 4.42
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.48
- UOPS_PORT_6: 7.2
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 0.1
With loop_count=100 and unroll_count=100
- Code:
0: 0f 31 rdtsc
- Show nanoBench command
- Results:
- Instructions retired: 1.02
- Core cycles: 25.0
- Reference cycles: 21.48
- UOPS_EXECUTED.THREAD: 18.01
- RETIRE_SLOTS: 18.01
- UOPS_MITE: 0.01
- UOPS_MS: 18.0
- UOPS_PORT_0: 3.0
- UOPS_PORT_1: 4.44
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.5
- UOPS_PORT_6: 7.08
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 0.01