SFENCE - Throughput and Uops
With unroll_count=500 and no inner loop
- Code:
0: 0f ae f8 sfence
- Show nanoBench command
- Results:
- Instructions retired: 1.0
- Core cycles: 5.0
- Reference cycles: 4.62
- UOPS_RETIRED.ANY: 2.0
- RETIRE_SLOTS: 2.0
- UOPS_MS: 0.0
- UOPS_PORT_0: 0.0
- UOPS_PORT_1: 0.0
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 1.0
- UOPS_PORT_4: 1.0
- UOPS_PORT_5: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- INST_DECODED.DEC0: 1.0
With loop_count=1000 and unroll_count=10
- Code:
0: 0f ae f8 sfence
- Show nanoBench command
- Results:
- Instructions retired: 1.2
- Core cycles: 4.99
- Reference cycles: 4.6
- UOPS_RETIRED.ANY: 2.2
- RETIRE_SLOTS: 2.2
- UOPS_MS: 0.0
- UOPS_PORT_0: 0.0
- UOPS_PORT_1: 0.0
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 1.0
- UOPS_PORT_4: 1.0
- UOPS_PORT_5: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- INST_DECODED.DEC0: 0.01
With loop_count=100 and unroll_count=100
- Code:
0: 0f ae f8 sfence
- Show nanoBench command
- Results:
- Instructions retired: 1.02
- Core cycles: 4.99
- Reference cycles: 4.6
- UOPS_RETIRED.ANY: 2.02
- RETIRE_SLOTS: 2.02
- UOPS_MS: 0.0
- UOPS_PORT_0: 0.0
- UOPS_PORT_1: 0.0
- UOPS_PORT_2: 0.0
- UOPS_PORT_3: 1.0
- UOPS_PORT_4: 1.0
- UOPS_PORT_5: 0.0
- DIV_CYCLES: 0.0
- ILD_STALL.LCP: 0.0
- INST_DECODED.DEC0: 1.0