RDSEED (R64)
Summary:
"Read Random SEED"
Reference:
https://www.felixcloutier.com/x86/RDSEED.html
Extension:
RDSEED
Category:
RDSEED
ISA-Set:
RDSEED
CPL:
3
iform:
RDSEED_GPRv
iclass:
RDSEED
ASM:
RDSEED
Operands
Operand 1 (w): Register (RAX, RCX, RDX, RBX, RSP, RBP, RSI, RDI, R8, R9, R10, R11, R12, R13, R14, R15)
Operand 2 (w, suppressed): Flags (AF: w, CF: w, OF: w, PF: w, SF: w, ZF: w)
Available performance data
Alder Lake-P
Alder Lake-E
Rocket Lake
Tiger Lake
Ice Lake
Cascade Lake
Cannon Lake
Skylake-X
Coffee Lake
Kaby Lake
Skylake
Broadwell
Tremont
Goldmont Plus
Goldmont
AMD Zen 4
AMD Zen 3
AMD Zen 2
AMD Zen+
Alder Lake-P
Measurements
Throughput
Computed from the port usage: 7.00
Measured (loop):
9223372036854775808.00
Measured (unrolled):
1362.33
Number of μops
Executed: 21
Retire slots: 23
Decoded (MITE): 0
Microcode Sequencer (MS): 20
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2*p0+3*p015+3*p056+5*p06+7*p1+1*p23A+3*p5
Alder Lake-E
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
1102.38
Number of μops
Executed: 19
Microcode Sequencer (MS): 18
Requires the complex decoder
Rocket Lake
Measurements
Throughput
Computed from the port usage: 8.00
Measured (loop):
9223372036854775808.00
Measured (unrolled):
1413.57
Number of μops
Executed: 33
Retire slots: 35
Decoded (MITE): 0
Microcode Sequencer (MS): 35
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2*p0+4*p015+5*p0156+11*p06+7*p1+3*p5+1*p78
Tiger Lake
Measurements
Throughput
Computed from the port usage: 15.00
Measured (loop):
9223372036854775808.00
Measured (unrolled):
13732.86
Number of μops
Executed: 80
Retire slots: 64
Decoded (MITE): 0
Microcode Sequencer (MS): 91
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
5*p0+5*p015+3*p0156+25*p06+11*p1+1*p23+9*p49+8*p5+12*p78
Ice Lake
Measurements
Throughput
Computed from the port usage: 21.50
Measured (loop):
9223372036854775808.00
Measured (unrolled):
10860.52
Number of μops
Executed: 91
Retire slots: 66
Decoded (MITE): 0
Microcode Sequencer (MS): 116
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p0+1*p0156+40*p06+10*p1+1*p23+10*p49+13*p5+13*p78
Cascade Lake
Measurements
Throughput
Computed from the port usage: 4.50
Measured (loop):
9223372036854775808.00
Measured (unrolled):
581.48
Number of μops
Executed: 16
Retire slots: 16
Decoded (MITE): 0
Microcode Sequencer (MS): 16
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p015+9*p06+2*p1+1*p23
Cannon Lake
Measurements
Throughput
Computed from the port usage: 2.75
Measured (loop):
9223372036854775808.00
Measured (unrolled):
905.85
Number of μops
Executed: 12
Retire slots: 14
Decoded (MITE): 0
Microcode Sequencer (MS): 14
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
1*p015+3*p0156+5*p06+2*p1+1*p23
Skylake-X
Measurements
Throughput
Computed from the port usage: 4.50
Measured (loop):
9223372036854775808.00
Measured (unrolled):
548.98
Number of μops
Executed: 16
Retire slots: 16
Decoded (MITE): 0
Microcode Sequencer (MS): 16
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p015+1*p0156+9*p06+2*p1+1*p23
Coffee Lake
Measurements
Throughput
Computed from the port usage: 1740.00
Measured (loop):
9223372036854775808.00
Measured (unrolled):
5544.50
Number of μops
Executed: 7876
Retire slots: 7846
Decoded (MITE): 0
Microcode Sequencer (MS): 7895
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
1677*p0+2*p0156+1803*p06+1666*p1+1*p15+1080*p23+10*p237+14*p4+1623*p5
Kaby Lake
Measurements
Throughput
Computed from the port usage: 3045.00
Measured (loop):
9223372036854775808.00
Measured (unrolled):
7041.15
Number of μops
Executed: 13711
Retire slots: 13666
Decoded (MITE): 0
Microcode Sequencer (MS): 13809
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2957*p0+5*p01+6*p05+3133*p06+2860*p1+2*p15+1899*p23+14*p4+2836*p5
Skylake
Measurements
Throughput
Computed from the port usage: 2457.50
Measured (loop):
9223372036854775808.00
Measured (unrolled):
6138.02
Number of μops
Executed: 11095
Retire slots: 11046
Decoded (MITE): 0
Microcode Sequencer (MS): 11189
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
2378*p0+2*p01+4*p015+5*p05+2537*p06+2330*p1+1529*p23+10*p237+15*p4+2292*p5
Broadwell
Measurements
Throughput
Computed from the port usage: 20.50
Measured (loop):
9223372036854775808.00
Measured (unrolled):
2085.80
Number of μops
Executed: 76
Retire slots: 59
Decoded (MITE): 0
Microcode Sequencer (MS): 104
Requires the complex decoder (no other instruction can be decoded with simple decoders in the same cycle)
Port usage:
3*p0+38*p06+19*p1+7*p4+16*p5
Tremont
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
3523.60
Number of μops
Executed: 59
Microcode Sequencer (MS): 59
Requires the complex decoder
Goldmont Plus
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
1929.20
Number of μops
Executed: 16
Microcode Sequencer (MS): 16
Requires the complex decoder
Goldmont
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
3199.27
Number of μops
Executed: 17
Microcode Sequencer (MS): 17
Requires the complex decoder
AMD Zen 4
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
83719.84
Number of μops
Executed: 30
AMD Zen 3
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
3950.23
Number of μops
Executed: 31
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode
AMD Zen 2
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
3619.75
Number of μops
Executed: 20
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode
AMD Zen+
Measurements
Throughput
Measured (loop):
9223372036854775808.00
Measured (unrolled):
2536.00
Number of μops
Executed: 19
Documentation
Latency: variable
Throughput: variable
Number of μops: ucode