RDTSC - Throughput and Uops
With unroll_count=500 and no inner loop
- Code:
0: 0f 31 rdtsc
- Show nanoBench command
- Results:
- Instructions retired: 1.0
- Core cycles: 25.0
- Reference cycles: 17.45
- UOPS_EXECUTED.THREAD: 14.0
- RETIRE_SLOTS: 16.0
- UOPS_MITE: 4.0
- UOPS_MS: 12.0
- UOPS_PORT_0: 2.67
- UOPS_PORT_1: 5.0
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.0
- UOPS_PORT_6: 3.33
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 1.0
With unroll_count=500, no inner loop, and 1 NOP
- Code:
0: 0f 31 rdtsc
2: 90 nop
- Show nanoBench command
- Results:
- Instructions retired: 2.0
- Core cycles: 25.0
- Reference cycles: 17.23
- UOPS_EXECUTED.THREAD: 14.0
- RETIRE_SLOTS: 17.0
- UOPS_MITE: 5.0
- UOPS_MS: 12.0
- UOPS_PORT_0: 2.75
- UOPS_PORT_1: 4.25
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.5
- UOPS_PORT_6: 3.5
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 2.0
With loop_count=1000 and unroll_count=10
- Code:
0: 0f 31 rdtsc
- Show nanoBench command
- Results:
- Instructions retired: 1.2
- Core cycles: 25.0
- Reference cycles: 17.29
- UOPS_EXECUTED.THREAD: 14.1
- RETIRE_SLOTS: 16.1
- UOPS_MITE: 4.1
- UOPS_MS: 12.0
- UOPS_PORT_0: 2.55
- UOPS_PORT_1: 4.48
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.42
- UOPS_PORT_6: 3.65
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 1.1
With loop_count=100 and unroll_count=100
- Code:
0: 0f 31 rdtsc
- Show nanoBench command
- Results:
- Instructions retired: 1.02
- Core cycles: 25.0
- Reference cycles: 17.85
- UOPS_EXECUTED.THREAD: 14.01
- RETIRE_SLOTS: 16.01
- UOPS_MITE: 4.01
- UOPS_MS: 12.0
- UOPS_PORT_0: 2.79
- UOPS_PORT_1: 4.45
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 0.0
- UOPS_PORT_4: 0.0
- UOPS_PORT_5: 3.27
- UOPS_PORT_6: 3.5
- UOPS_PORT_7: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- UOPS_MITE>=1: 1.01