bitorch_engine.layers.qlinear.nbit.cuda.mbwq_layer
Classes
|
Implements a Mixed-BitWidth Quantized (MBWQ) linear layer for CUDA devices. |
|
Custom CUDA function for performing forward and backward passes in MBWQ Linear layers. |
Classes
|
Implements a Mixed-BitWidth Quantized (MBWQ) linear layer for CUDA devices. |
|
Custom CUDA function for performing forward and backward passes in MBWQ Linear layers. |