INC_LOCK (M32)

Summary:	"Increment by 1"
Reference:	https://www.felixcloutier.com/x86/inc
Extension:	BASE
Category:	BINARY
ISA-Set:	I86
CPL:	3
iform:	INC_LOCK_MEMv
iclass:	INC_LOCK
ASM:	LOCK INC

Operands

Operand 1 (r/w): Memory
Operand 2 (w, suppressed): Flags (AF: w, OF: w, PF: w, SF: w, ZF: w)

Available performance data

Arrow Lake-P
Arrow Lake-E
Meteor Lake-P
Meteor Lake-E
Emerald Rapids
Alder Lake-P
Alder Lake-E
Rocket Lake
Tiger Lake
Ice Lake
Cascade Lake
Cannon Lake
Skylake-X
Coffee Lake
Kaby Lake
Skylake
Broadwell
Haswell
Ivy Bridge
Sandy Bridge
Westmere
Nehalem
Wolfdale
Conroe
Tremont
Goldmont Plus
Goldmont
Airmont
Bonnell
AMD Zen 5
AMD Zen 4
AMD Zen 3
AMD Zen 2
AMD Zen+

Arrow Lake-P

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 20.00
- Measured (unrolled): 20.00
Number of μops
- Executed: 5
- Retire slots: 7
- Decoded (MITE): 0
- Microcode Sequencer (MS): 7
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 2*ALU+1*JMP+1*LD+1*STA+2*STD

Arrow Lake-E

Measurements

Latencies
Throughput
- Measured (loop): 15.00
- Measured (unrolled): 15.00
Number of μops
- Executed: 1
- Microcode Sequencer (MS): 0

Meteor Lake-P

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 17.00
- Measured (unrolled): 17.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156B+1*p06+1*p23A+1*p49+1*p78

Meteor Lake-E

Measurements

Latencies
Throughput
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 1
- Microcode Sequencer (MS): 0

Emerald Rapids

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156B+1*p06+1*p23A+1*p49+1*p78

Alder Lake-P

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156B+1*p06+1*p23A+1*p49+1*p78

Alder Lake-E

Measurements

Latencies
Throughput
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 1
- Microcode Sequencer (MS): 0

Rocket Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p0156+1*p06+1*p23+1*p49+1*p78

Tiger Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 5*p0156+1*p23+1*p49+1*p78

Ice Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 5*p0156+1*p23+1*p49+1*p78

Cascade Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p0156+2*p06+2*p23+1*p4

Cannon Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25 (if an indexed addressing mode is used: 1.00)
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 6
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156+1*p06+2*p23+1*p4 (if an indexed addressing mode is used: 3*p0156+1*p06+2*p23+1*p4)

Skylake-X

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25 (if an indexed addressing mode is used: 1.50)
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p0156+2*p06+2*p23+1*p4 (if an indexed addressing mode is used: 4*p0156+2*p06+2*p23+1*p4)

IACA 2.3

Throughput
- Computed from the port usage: 1.25
- IACA: 1.48
Number of μops: 8
Port usage: 3*p0156+2*p06+1*p23+1*p237+1*p4

IACA 3.0

Throughput
- Computed from the port usage: 1.25
- IACA: 1.49
Number of μops: 8
Port usage: 3*p0156+2*p06+1*p23+1*p237+1*p4

Coffee Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25 (if an indexed addressing mode is used: 1.50)
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p0156+2*p06+2*p23+1*p4 (if an indexed addressing mode is used: 4*p0156+2*p06+2*p23+1*p4)

Kaby Lake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.25 (if an indexed addressing mode is used: 1.50)
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p0156+2*p06+2*p23+1*p4 (if an indexed addressing mode is used: 4*p0156+2*p06+2*p23+1*p4)

Skylake

Measurements

Latencies
Throughput
- Computed from the port usage: 1.50
- Measured (loop): 18.00
- Measured (unrolled): 18.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156+2*p06+2*p23+1*p4

IACA 2.3

Throughput
- Computed from the port usage: 1.25
- IACA: 1.48
Number of μops: 8
Port usage: 3*p0156+2*p06+1*p23+1*p237+1*p4

IACA 3.0

Throughput
- Computed from the port usage: 1.25
- IACA: 1.49
Number of μops: 8
Port usage: 3*p0156+2*p06+1*p23+1*p237+1*p4

Broadwell

Measurements

Latencies
Throughput
- Computed from the port usage: 1.50
- Measured (loop): 21.00
- Measured (unrolled): 21.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156+2*p06+2*p23+1*p4

IACA 2.2

Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 6
Port usage: 3*p0156+1*p23+1*p237+1*p4

IACA 2.3

Throughput
- Computed from the port usage: 1.00
- IACA: 1.00
Number of μops: 6
Port usage: 3*p0156+1*p23+1*p237+1*p4

IACA 3.0

Throughput
- Computed from the port usage: 1.00
- IACA: 1.10
Number of μops: 6
Port usage: 3*p0156+1*p23+1*p237+1*p4

Haswell

Measurements

Latencies
Throughput
- Computed from the port usage: 1.50 (if an indexed addressing mode is used: 1.25)
- Measured (loop): 19.00
- Measured (unrolled): 19.00
Number of μops
- Executed: 7
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 4*p0156+2*p06+2*p23+1*p4 (if an indexed addressing mode is used: 3*p0156+2*p06+2*p23+1*p4)

IACA 2.1

Latency: 7
Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 4
Port usage: 1*p0156+1*p23+1*p237+1*p4

IACA 2.2

Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 6
Port usage: 3*p0156+1*p23+1*p237+1*p4

IACA 2.3

Throughput
- Computed from the port usage: 1.00
- IACA: 1.00
Number of μops: 6
Port usage: 3*p0156+1*p23+1*p237+1*p4

IACA 3.0

Throughput
- Computed from the port usage: 1.00
- IACA: 1.00
Number of μops: 6
Port usage: 3*p0156+1*p23+1*p237+1*p4

Ivy Bridge

Measurements

Latencies
Throughput
- Computed from the port usage: 2.00
- Measured (loop): 22.00
- Measured (unrolled): 22.00
Number of μops
- Executed: 7
- Retire slots: 7
- Decoded (MITE): 0
- Microcode Sequencer (MS): 7
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p015+2*p23+1*p4+2*p5

IACA 2.1

Latency: 7
Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 4
Port usage: 1*p015+2*p23+1*p4

Sandy Bridge

Measurements

Latencies
Throughput
- Computed from the port usage: 2.00
- Measured (loop): 23.00
- Measured (unrolled): 23.00
Number of μops
- Executed: 8
- Retire slots: 8
- Decoded (MITE): 0
- Microcode Sequencer (MS): 8
- Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage: 3*p015+1*p1+2*p23+1*p4+2*p5

IACA 2.1

Latency: 7
Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 4
Port usage: 1*p015+2*p23+1*p4

Westmere

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 19.00
- Measured (unrolled): 19.00
Number of μops
- Executed: 5
- Retire slots: 5
- Microcode Sequencer (MS): 16
- Requires the complex decoder
Port usage: 2*p015+1*p2+1*p3+1*p4

IACA 2.1

Latency: 6
Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 4
Port usage: 1*p015+1*p2+1*p3+1*p4

Nehalem

Measurements

Latencies
Throughput
- Computed from the port usage: 1.00
- Measured (loop): 20.00
- Measured (unrolled): 20.00
Number of μops
- Executed: 5
- Retire slots: 5
- Microcode Sequencer (MS): 16
- Requires the complex decoder
Port usage: 2*p015+1*p2+1*p3+1*p4

IACA 2.1

Latency: 6
Throughput
- Computed from the port usage: 1.00
- IACA: 1.00 (with the -no_interiteration flag: 1.00)
Number of μops: 4
Port usage: 1*p015+1*p2+1*p3+1*p4