bitorch_engine.utils.quant_operators
Functions
Sets a specific bit in an integer variable to a given value. |
|
Binarizes an array column-wise, transforming each element into a binary representation. |
|
Binarizes an input NDArray (nd_row) into a binary representation (binary_row) based on the specified number of bits per binary word (bits_per_binary_word). |
|
Reconstructs the fp16 weight tensor from the input quantized weight parameter in GPTQ style. |
|
Packs the zeros tensor in GPTQ style for efficient storage and computation. |
|
Quantizes the given tensor using specified quantization parameters. |
|
Quantizes an input tensor to 4-bit integers using uniform quantization. |
|
Quantizes an input tensor to 8-bit integers using uniform quantization. |