VP2INTERSECTD (R64, YMM, YMM) - Throughput and Uops


With different registers for different operands

With 1 independent instruction

With unroll_count=500 and no inner loop

With unroll_count=500, no inner loop, and 1 NOP

With loop_count=1000 and unroll_count=10

With loop_count=100 and unroll_count=100

With 3 independent instructions

With unroll_count=200 and no inner loop

With loop_count=1000 and unroll_count=3

With loop_count=100 and unroll_count=30


With the same register for for different operands

With 1 independent instruction

With unroll_count=500 and no inner loop

With unroll_count=500, no inner loop, and 1 NOP

With loop_count=1000 and unroll_count=10

With loop_count=100 and unroll_count=100

With 3 independent instructions

With unroll_count=200 and no inner loop

With loop_count=1000 and unroll_count=3

With loop_count=100 and unroll_count=30