VPOPCNTQ (ZMM, ZMM)
Extension:
AVX512EVEX
Category:
AVX512
ISA-Set:
AVX512_VPOPCNTDQ_512
CPL:
3
iform:
VPOPCNTQ_ZMMu64_MASKmskw_ZMMu64_AVX512
iclass:
VPOPCNTQ
ASM:
VPOPCNTQ
Operands
Operand 1 (w): Register (ZMM0, ZMM1, ZMM2, ZMM3, ZMM4, ZMM5, ZMM6, ZMM7, ZMM8, ZMM9, ZMM10, ZMM11, ZMM12, ZMM13, ZMM14, ZMM15, ZMM16, ZMM17, ZMM18, ZMM19, ZMM20, ZMM21, ZMM22, ZMM23, ZMM24, ZMM25, ZMM26, ZMM27, ZMM28, ZMM29, ZMM30, ZMM31)
Operand 2 (r): Register (ZMM0, ZMM1, ZMM2, ZMM3, ZMM4, ZMM5, ZMM6, ZMM7, ZMM8, ZMM9, ZMM10, ZMM11, ZMM12, ZMM13, ZMM14, ZMM15, ZMM16, ZMM17, ZMM18, ZMM19, ZMM20, ZMM21, ZMM22, ZMM23, ZMM24, ZMM25, ZMM26, ZMM27, ZMM28, ZMM29, ZMM30, ZMM31)
Available performance data
Emerald Rapids
Alder Lake-P
Rocket Lake
Tiger Lake
Ice Lake
AMD Zen 5
AMD Zen 4
Emerald Rapids
Measurements
Latencies
Latency operand 2 → 1:
3
Throughput
Computed from the port usage: 1.00
Measured (loop):
1.00
Measured (unrolled):
1.00
Number of μops
Executed: 1
Retire slots: 1
Decoded (MITE): 1
Microcode Sequencer (MS): 0
Port usage:
1*p5
Alder Lake-P
Measurements
Latencies
Latency operand 2 → 1:
3
Throughput
Computed from the port usage: 1.00
Measured (loop):
1.00
Measured (unrolled):
1.00
Number of μops
Executed: 1
Retire slots: 1
Decoded (MITE): 1
Microcode Sequencer (MS): 0
Port usage:
1*p5
Rocket Lake
Measurements
Latencies
Latency operand 2 → 1:
3
Throughput
Computed from the port usage: 1.00
Measured (loop):
1.00
Measured (unrolled):
1.00
Number of μops
Executed: 1
Retire slots: 1
Decoded (MITE): 1
Microcode Sequencer (MS): 0
Port usage:
1*p5
Tiger Lake
Measurements
Latencies
Latency operand 2 → 1:
3
Throughput
Computed from the port usage: 1.00
Measured (loop):
1.00
Measured (unrolled):
1.00
Number of μops
Executed: 1
Retire slots: 1
Decoded (MITE): 1
Microcode Sequencer (MS): 0
Port usage:
1*p5
Ice Lake
Measurements
Latencies
Latency operand 2 → 1:
3
Throughput
Computed from the port usage: 1.00
Measured (loop):
1.00
Measured (unrolled):
1.00
Number of μops
Executed: 1
Retire slots: 1
Decoded (MITE): 1
Microcode Sequencer (MS): 0
Port usage:
1*p5
AMD Zen 5
Measurements
Latencies
Latency operand 2 → 1:
1
Throughput
Computed from the port usage: 0.50
Measured (loop):
0.50
Measured (unrolled):
0.50
Number of μops
Executed: 1
Port usage:
1*FP03
Documentation
Latency: 2
Throughput: 0.50
Number of μops: 1
Port usage: FP0/3
AMD Zen 4
Measurements
Latencies
Latency operand 2 → 1:
2
Throughput
Computed from the port usage: 0.50
Measured (loop):
1.00
Measured (unrolled):
1.00
Number of μops
Executed: 1
Port usage:
1*FP01
Documentation
Latency: 2
Throughput: 1.00
Number of μops: 1
Port usage: FP0/1