fused_act_dequant
- paddle.incubate.nn.functional. fused_act_dequant ( x: Tensor, x_scale: Tensor ) Tensor [source]
-
Applies fused activation and dequantization operation to convert float8 quantized data back to bfloat16.
- Parameters
-
x (Tensor) – Input quantized tensor with dtype float8_e4m3fn and shape [M, N]. This tensor contains the quantized activations from previous layers.
x_scale (Tensor) – Dequantization scale tensor with dtype float32 and shape [M, (N + 127) // 128]. Each scale value corresponds to a 128-column block in the input tensor.
- Returns
-
- Tensor. Dequantized output tensor with dtype bfloat16 and shape [M, N]. The values are
-
computed as input * scale for each corresponding 128-column block.