PCMPESTRM (XMM, XMM, I8) - Throughput and Uops
With unroll_count=500 and no inner loop
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 14.0
Reference cycles: 9.05
UOPS_RETIRED.ANY: 9.0
UOPS_MS: 9.0
With loop_count=1000 and unroll_count=10
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.2
Core cycles: 14.1
Reference cycles: 9.2
UOPS_RETIRED.ANY: 9.2
UOPS_MS: 9.0
With loop_count=100 and unroll_count=100
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.02
Core cycles: 14.01
Reference cycles: 9.14
UOPS_RETIRED.ANY: 9.02
UOPS_MS: 9.0