I measured 360+ configs — quantization often costs energy below the crossover point

hongpingzhang · May 14, 2026, 12:16pm

A couple of things I’d love community input on:

Has anyone reproduced the LLM.int8() default energy regression on Hopper (H100/H200)? My data stops at Blackwell (RTX 5090).
Any GPTQ / AWQ / GGUF k-quant numbers with wall-power (NVML / RAPL)? My benchmark only covers bitsandbytes NF4 / INT8.
Apple Silicon / Jetson — unified memory likely changes the dequant story; I have no numbers there.

Happy to add submitted hardware rows to the public dataset with attribution.

Topic		Replies	Views
Current State and Future of "Integer-Only" LLM Inference (Non-Floating Point) 🤗Transformers	1	174	April 14, 2026
Qunatized model with LORA takes much more GPU memory than the un-quantized model with LORA for the (E-5-Large Embedding Transformer) 🤗Transformers	4	1918	October 8, 2023
Correct Usage of BitsAndBytesConfig 🤗Transformers	4	31185	March 18, 2023
Loading llama3.21B in quantized config shows no change in size Beginners	1	95	December 10, 2024
Low bf16 performance on TPU, int4 vs int8 quantizatoin 🤗Accelerate	0	461	June 1, 2024