MOVSQ - Throughput and Uops
With unroll_count=500 and no inner loop
Code:
0: 48 a5 movs QWORD PTR es:[rdi],QWORD PTR ds:[rsi]
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 4.0
Reference cycles: 4.0
RS_UOPS_DISPATCHED: 8.0
UOPS_PORT_0: 1.0
UOPS_PORT_1: 1.0
UOPS_PORT_2: 2.0
UOPS_PORT_3: 1.0
UOPS_PORT_4: 1.0
UOPS_PORT_5: 2.0
With loop_count=1000 and unroll_count=10
Code:
0: 48 a5 movs QWORD PTR es:[rdi],QWORD PTR ds:[rsi]
Show nanoBench command
Results:
Instructions retired: 1.2
Core cycles: 4.1
Reference cycles: 4.1
RS_UOPS_DISPATCHED: 8.2
UOPS_PORT_0: 1.0
UOPS_PORT_1: 0.9
UOPS_PORT_2: 2.0
UOPS_PORT_3: 1.0
UOPS_PORT_4: 1.0
UOPS_PORT_5: 2.3
With loop_count=100 and unroll_count=100
Code:
0: 48 a5 movs QWORD PTR es:[rdi],QWORD PTR ds:[rsi]
Show nanoBench command
Results:
Instructions retired: 1.02
Core cycles: 4.01
Reference cycles: 4.01
RS_UOPS_DISPATCHED: 8.02
UOPS_PORT_0: 1.01
UOPS_PORT_1: 1.0
UOPS_PORT_2: 2.0
UOPS_PORT_3: 1.0
UOPS_PORT_4: 1.0
UOPS_PORT_5: 2.01