bitorch_engine.layers.qlinear.nbit.cutlass.q8_layer
Classes
|
Implements an 8-bit quantized linear layer using CUTLASS for efficient computation. |
|
Implements a quantized linear function using 8-bit quantization for both activations and weights. |