PCMPESTRM (XMM, XMM, I8) - Throughput and Uops
With unroll_count=500 and no inner loop
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 9.0
Reference cycles: 5.48
TOPDOWN_RETIRING.ALL: 11.0
UOPS_MS: 10.0
With loop_count=1000 and unroll_count=10
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.2
Core cycles: 9.1
Reference cycles: 5.52
TOPDOWN_RETIRING.ALL: 11.2
UOPS_MS: 10.0
With loop_count=100 and unroll_count=100
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.02
Core cycles: 9.02
Reference cycles: 5.47
TOPDOWN_RETIRING.ALL: 11.02
UOPS_MS: 10.0