PCMPESTRM (XMM, XMM, I8) - Throughput and Uops
With unroll_count=500 and no inner loop
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 17.0
Reference cycles: 8.51
UOPS_RETIRED.ALL: 8.0
UOPS_MS: 8.0
With loop_count=1000 and unroll_count=10
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.2
Core cycles: 17.0
Reference cycles: 8.5
UOPS_RETIRED.ALL: 8.2
UOPS_MS: 8.0
With loop_count=100 and unroll_count=100
Code:
0: 66 0f 3a 60 ca 02 pcmpestrm xmm1,xmm2,0x2
Init:
PXOR XMM1, XMM1; PXOR XMM2, XMM2
Show nanoBench command
Results:
Instructions retired: 1.02
Core cycles: 17.0
Reference cycles: 8.5
UOPS_RETIRED.ALL: 8.02
UOPS_MS: 8.0