bitorch_engine.functions.cuda.functions
Functions
Converts a 32-bit floating point tensor to a 4-bit integer representation. |
|
Packs a tensor into a 4-bit packed format using CUDA accelerated functions. |
|
Unpacks a tensor that has been previously packed using 4-bit quantization into its original format. |
|
Unpacks a tensor that has been previously packed using 4-bit quantization into its original format. |
|
Packs the given tensor into an 8-bit unsigned integer tensor representation on CUDA. |
|
Unpacks an 8-bit unsigned integer tensor into a floating-point tensor using CUDA, scaling the unpacked values by a provided scale tensor. |