bitorch_engine.layers.qmha.binary.layer
Classes
|
Implements a binary version of multi-head attention (MHA) where the linear transformations are executed using binary operations to improve efficiency. |
|
A module that introduces a learnable bias term to the input tensor. |