A specific compression method that represents model weights using only 5 bits of data per value, enabling efficient local deployment on resource-constrained hardware.