UBLKRED
Bulk Data Copy from Shared Memory with Reduction
2 variants on SM90 (H100)
UBLKRED.???0
[UR],[UR],UR
distilled:
@UP0 UBLKRED.G.???0.ADD [UR0], [UR0], UR0 ;key:
UBLKRED_UR_UR_UR| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | pred | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | operand 1 | 0 | operand 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| operand 2 | 0 | 0 | modi 1 | 0 | 0 | 0 | 0 | 0 | 0 | modi 2 | 0 | 0 | modi 3 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | stall | y | r-bar | w-bar | b-mask | reuse | 0 | 0 | |||||||||||||||||||||||||||
Modifier Group 1
| Binary | Value |
|---|---|
| 00 | G |
| 01 | S |
| 10 | G.S |
| 11 | S.S |
Modifier Group 2
| Binary | Value |
|---|---|
| 0000 | (default) |
| 0001 | S32 |
| 0010 | U64 |
| 0011 | S64 |
| 0100 | F16.RN |
| 0101 | F32.RN |
| 0110 | F32.FTZ.RN |
| 0111 | F64.RN |
| 1000 | BF16.RN |
| 1001 | INVALID9 |
| 1010 | INVALID10 |
| 1011 | INVALID11 |
| 1100 | INVALID12 |
| 1101 | INVALID13 |
| 1110 | INVALID14 |
| 1111 | INVALID15 |
Modifier Group 3
| Binary | Value |
|---|---|
| 000 | ADD |
| 001 | MIN |
| 010 | MAX |
| 011 | INC |
| 100 | DEC |
| 101 | AND |
| 110 | OR |
| 111 | XOR |
UBLKRED.???0
[UR],[UR],UR,desc[UR]
distilled:
@UP0 UBLKRED.G.???0.ADD [UR0], [UR0], UR0, desc[UR0] ;key:
UBLKRED_UR_UR_UR_UR| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | pred | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | operand 1 | 0 | operand 0 | 0 | operand 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |||||||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| operand 2 | 0 | 0 | modi 1 | 0 | 1 | 0 | 0 | 0 | 0 | modi 2 | 0 | 0 | modi 3 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | stall | y | r-bar | w-bar | b-mask | reuse | 0 | 0 | |||||||||||||||||||||||||||
Modifier Group 1
| Binary | Value |
|---|---|
| 00 | G |
| 01 | S |
| 10 | G.S |
| 11 | S.S |
Modifier Group 2
| Binary | Value |
|---|---|
| 0000 | (default) |
| 0001 | S32 |
| 0010 | U64 |
| 0011 | S64 |
| 0100 | F16.RN |
| 0101 | F32.RN |
| 0110 | F32.FTZ.RN |
| 0111 | F64.RN |
| 1000 | BF16.RN |
| 1001 | INVALID9 |
| 1010 | INVALID10 |
| 1011 | INVALID11 |
| 1100 | INVALID12 |
| 1101 | INVALID13 |
| 1110 | INVALID14 |
| 1111 | INVALID15 |
Modifier Group 3
| Binary | Value |
|---|---|
| 000 | ADD |
| 001 | MIN |
| 010 | MAX |
| 011 | INC |
| 100 | DEC |
| 101 | AND |
| 110 | OR |
| 111 | XOR |