Mixed Quantization Scaling Law

From GM-RKB
(Redirected from Multi-Precision Scaling Law)
Jump to navigation Jump to search

A Mixed Quantization Scaling Law is a quantization scaling law that optimizes performance-efficiency trade-offs through heterogeneous bit-precision allocation across model components.