NVIDIA ISA Viewer

SHFL

Warp Wide Register Shuffle

4 variants on SM90 (H100)

SHFL
P,R,R,R,R
distilled: @P0 SHFL.IDX P0, R0, R0, R0, R0 ;
key: SHFL_P_R_R_R_R
0123456701234567012345670123456701234567012345670123456701234567
100100011100predoperand 1operand 2operand 3000000000000000000modi 10000
0123456701234567012345670123456701234567012345670123456701234567
operand 400000000operand 0000000000000000000000stallyr-barw-barb-maskreuse00

Modifier Group 1

BinaryValue
00IDX
01UP
10DOWN
11BFLY
SHFL
P,R,R,R,I
distilled: @P0 SHFL.IDX P0, R0, R0, R0, 0x0 ;
key: SHFL_P_R_R_R_I
0123456701234567012345670123456701234567012345670123456701234567
100100011010predoperand 1operand 2operand 3operand 40000modi 10000
0123456701234567012345670123456701234567012345670123456701234567
0000000000000000operand 0000000000000000000000stallyr-barw-barb-maskreuse00

Modifier Group 1

BinaryValue
00IDX
01UP
10DOWN
11BFLY
SHFL
P,R,R,I,R
distilled: @P0 SHFL.IDX P0, R0, R0, 0x0, R0 ;
key: SHFL_P_R_R_I_R
0123456701234567012345670123456701234567012345670123456701234567
100100011001predoperand 1operand 200000000000000000000operand 3modi 10000
0123456701234567012345670123456701234567012345670123456701234567
operand 400000000operand 0000000000000000000000stallyr-barw-barb-maskreuse00

Modifier Group 1

BinaryValue
00IDX
01UP
10DOWN
11BFLY
SHFL
P,R,R,I,I
distilled: @P0 SHFL.IDX P0, R0, R0, 0x0, 0x0 ;
key: SHFL_P_R_R_I_I
0123456701234567012345670123456701234567012345670123456701234567
100100011111predoperand 1operand 200000000operand 4operand 3operand 4operand 3modi 10000
0123456701234567012345670123456701234567012345670123456701234567
0000000000000000operand 0000000000000000000000stallyr-barw-barb-maskreuse00

Modifier Group 1

BinaryValue
00IDX
01UP
10DOWN
11BFLY