bitorch_engine.layers.qlinear.nbit.cuda.mpq_layer
Classes
|
Represents a CUDA-compatible implementation of the mixed precision quantized (MPQ) linear layer, inheriting from MPQLinearBase. |
|
A custom autograd function for mixed-precision quantized (MPQ) linear operations on CUDA. |