bitorch_engine.layers.qlinear.nbit.cuda.mpq_layer

Classes

MPQLinearCuda(*args, **kwargs)

Represents a CUDA-compatible implementation of the mixed precision quantized (MPQ) linear layer, inheriting from MPQLinearBase.

MPQLinearCudaFunction(*args, **kwargs)

A custom autograd function for mixed-precision quantized (MPQ) linear operations on CUDA.