bitorch_engine.utils.quant_operators.gptq_style_zeros_packing

bitorch_engine.utils.quant_operators.gptq_style_zeros_packing(zeros: Tensor, w_bit: int, out_features: int, group_size: int) Tensor[source]

Packs the zeros tensor in GPTQ style for efficient storage and computation.

Parameters:
  • zeros (torch.Tensor) – Input tensor containing zeros.

  • w_bit (int) – Number of bits for weight quantization.

  • out_features (int) – Number of output features.

  • group_size (int) – Size of the group for packing.

Returns:

Packed tensor with reduced storage.

Return type:

torch.Tensor