RDSEED (R64)
Summary:
"Read Random SEED"
Reference:
https://www.felixcloutier.com/x86/rdseed
Extension:
RDSEED
Category:
RDSEED
ISA-Set:
RDSEED
CPL:
3
iform:
RDSEED_GPRv
iclass:
RDSEED
ASM:
RDSEED
Operands
Operand 1 (w): Register (RAX, RCX, RDX, RBX, RSP, RBP, RSI, RDI, R8, R9, R10, R11, R12, R13, R14, R15)
Operand 2 (w, suppressed): Flags (AF: w, CF: w, OF: w, PF: w, SF: w, ZF: w)
Available performance data
Arrow Lake-P
Arrow Lake-E
Meteor Lake-P
Meteor Lake-E
Emerald Rapids
Alder Lake-P
Alder Lake-E
Rocket Lake
Tiger Lake
Ice Lake
Cascade Lake
Cannon Lake
Skylake-X
Coffee Lake
Kaby Lake
Skylake
Broadwell
Tremont
Goldmont Plus
Goldmont
AMD Zen 5
AMD Zen 4
AMD Zen 3
AMD Zen 2
AMD Zen+
Arrow Lake-P
Measurements
Throughput
Computed from the port usage: 2.67
Measured (loop):
1572.93
Measured (unrolled):
1573.78
Number of μops
Executed: 9
Retire slots: 13
Decoded (MITE): 0
Microcode Sequencer (MS): 13
Requires the complex decoder (13 other instructions can be decoded with simple decoders in the same cycle)
Port usage:
13*ALU+1*JMP+1*LD+1*SHIFT+1*SLOW
Arrow Lake-E
Measurements
Throughput
Measured (loop):
1566.42
Measured (unrolled):
1464.50
Number of μops
Executed: 13
Microcode Sequencer (MS): 13
Requires the complex decoder
Meteor Lake-P
Measurements
Throughput
Computed from the port usage: 7.00
Measured (loop):
1843.43
Measured (unrolled):
1842.27
Number of μops
Executed: 21
Retire slots: 23
Decoded (MITE): 0
Microcode Sequencer (MS): 21
Requires the complex decoder (1 other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2*p0+3*p015+3*p056+5*p06+7*p1+1*p23A+3*p5
Meteor Lake-E
Measurements
Throughput
Measured (loop):
1457.65
Measured (unrolled):
1440.97
Number of μops
Executed: 12
Microcode Sequencer (MS): 11
Requires the complex decoder
Emerald Rapids
Measurements
Throughput
Computed from the port usage: 7.00
Measured (loop):
607.83
Measured (unrolled):
619.97
Number of μops
Executed: 21
Retire slots: 23
Decoded (MITE): 0
Microcode Sequencer (MS): 23
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2*p0+3*p015+3*p056+5*p06+7*p1+1*p23A+3*p5
Alder Lake-P
Measurements
Throughput
Computed from the port usage: 7.00
Measured (loop):
1375.22
Measured (unrolled):
1373.93
Number of μops
Executed: 21
Retire slots: 23
Decoded (MITE): 0
Microcode Sequencer (MS): 23
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2*p0+3*p015+3*p056+5*p06+7*p1+1*p23A+3*p5
Alder Lake-E
Measurements
Throughput
Measured (loop):
1395.67
Measured (unrolled):
1224.33
Number of μops
Executed: 19
Microcode Sequencer (MS): 18
Requires the complex decoder
Rocket Lake
Measurements
Throughput
Computed from the port usage: 8.00
Measured (loop):
1425.43
Measured (unrolled):
1425.35
Number of μops
Executed: 33
Retire slots: 35
Decoded (MITE): 0
Microcode Sequencer (MS): 35
Requires the complex decoder (4 other instructions can be decoded with simple decoders in the same cycle)
Port usage:
2*p0+4*p015+5*p0156+11*p06+7*p1+3*p5+1*p78
Tiger Lake
Measurements
Throughput
Computed from the port usage: 14.50
Measured (loop):
13555.00
Measured (unrolled):
13817.17
Number of μops
Executed: 81
Retire slots: 65
Decoded (MITE): 0
Microcode Sequencer (MS): 92
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
4*p0+6*p015+2*p0156+24*p06+13*p1+1*p23+9*p49+9*p5+12*p78
Ice Lake
Measurements
Throughput
Computed from the port usage: 19.00
Measured (loop):
1548.98
Measured (unrolled):
1548.67
Number of μops
Executed: 58
Retire slots: 55
Decoded (MITE): 0
Microcode Sequencer (MS): 64
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
7*p0+1*p01+19*p06+19*p1+11*p5+1*p78
Cascade Lake
Measurements
Throughput
Computed from the port usage: 4.00
Measured (loop):
606.85
Measured (unrolled):
608.12
Number of μops
Executed: 16
Retire slots: 16
Decoded (MITE): 0
Microcode Sequencer (MS): 16
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p015+1*p0156+8*p06+2*p1+1*p23
Cannon Lake
Measurements
Throughput
Computed from the port usage: 2.75
Measured (loop):
881.20
Measured (unrolled):
878.20
Number of μops
Executed: 12
Retire slots: 14
Decoded (MITE): 0
Microcode Sequencer (MS): 14
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
1*p015+3*p0156+5*p06+2*p1+1*p23
Skylake-X
Measurements
Throughput
Computed from the port usage: 4.50
Measured (loop):
574.23
Measured (unrolled):
574.40
Number of μops
Executed: 16
Retire slots: 16
Decoded (MITE): 0
Microcode Sequencer (MS): 16
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p015+1*p0156+9*p06+2*p1+1*p23
Coffee Lake
Measurements
Throughput
Computed from the port usage: 1868.50
Measured (loop):
5698.85
Measured (unrolled):
5699.20
Number of μops
Executed: 8446
Retire slots: 8416
Decoded (MITE): 0
Microcode Sequencer (MS): 8466
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
1805*p0+1*p01+1932*p06+1783*p1+1160*p23+10*p237+14*p4+1740*p5
Kaby Lake
Measurements
Throughput
Computed from the port usage: 3199.50
Measured (loop):
7225.97
Measured (unrolled):
7221.58
Number of μops
Executed: 14394
Retire slots: 14350
Decoded (MITE): 0
Microcode Sequencer (MS): 14492
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3116*p0+3283*p06+3006*p1+1995*p23+7*p237+14*p4+2978*p5
Skylake
Measurements
Throughput
Computed from the port usage: 2457.50
Measured (loop):
6138.25
Measured (unrolled):
6138.60
Number of μops
Executed: 11095
Retire slots: 11046
Decoded (MITE): 0
Microcode Sequencer (MS): 11189
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2377*p0+4*p01+4*p05+2538*p06+2330*p1+3*p15+1529*p23+10*p237+15*p4+2291*p5
Broadwell
Measurements
Throughput
Computed from the port usage: 20.50
Measured (loop):
9223372036854775808.00
Measured (unrolled):
2085.80
Number of μops
Executed: 76
Retire slots: 59
Decoded (MITE): 0
Microcode Sequencer (MS): 104
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p0+38*p06+19*p1+7*p4+16*p5
Tremont
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
3523.60
Number of μops
Executed: 59
Microcode Sequencer (MS): 59
Requires the complex decoder
Goldmont Plus
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
1929.20
Number of μops
Executed: 16
Microcode Sequencer (MS): 16
Requires the complex decoder
Goldmont
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
3199.27
Number of μops
Executed: 17
Microcode Sequencer (MS): 17
Requires the complex decoder
AMD Zen 5
Measurements
Throughput
Measured (loop):
91967.92
Measured (unrolled):
92367.70
Number of μops
Executed: 31
Documentation
Latency: NA
Throughput: NA
Number of μops: ucode
Port usage: ucode
AMD Zen 4
Measurements
Throughput
Measured (loop):
84090.00
Measured (unrolled):
83787.45
Number of μops
Executed: 30
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode
AMD Zen 3
Measurements
Throughput
Measured (loop):
5888.45
Measured (unrolled):
5855.48
Number of μops
Executed: 31
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode
AMD Zen 2
Measurements
Throughput
Measured (loop):
3608.90
Measured (unrolled):
3613.97
Number of μops
Executed: 20
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode
AMD Zen+
Measurements
Throughput
Measured (loop):
2536.28
Measured (unrolled):
2535.53
Number of μops
Executed: 19
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode