or: Training With Quantization Noise For Extreme Model Compression

Scalar quantization of weights, or PQ of grouped weights, can be performed during training if only a small random selection of the network’s weights are quantized during any given batch.