bitorch_engine.layers.qlinear.nbit.mps.mpq_layer
Classes
|
Represents a MPS-compatible implementation of the mixed precision quantized (MPQ) linear layer, inheriting from MPQLinearBase. |
|
A custom autograd function for mixed-precision quantized (MPQ) linear operations on MPS acceleated by mlx. |