Commit 483ef10
[Inductor][float8] Support qlinear for float8 in inductor (#2565)
* quantize_affine_float8/dequantize_affine_float8 not decomposed on inductor
* remove redundant unittest.skipIf
* fix rebase issue
* change dispatch key to a flag decomposed
* support scaled_mm on inductor
* fix rebase issue
* support dequant promtion for fp8
* add ut
* remove redundant codes
* fix lint
* resolve conflict
* change to use qlinear
* add ut
* fix lint
* support fp8 quant_lift_up
* add reshape into _VIEW_METHOD_OPS
* add quant_input_check
* fix lint
* refine ut
* remove fp8 dynamic quant ut
* fix output_scale issue
* add float8_e4m3fn to dtype_list
* refine code
* refine code
* fix bugs
* add comment
* merge main
* change to use non-decomposed q/dq
* fix lint
* add version check
* change version
* fix attention bug; update ut
* add liftup oplist1 parent 1c55f61 commit 483ef10
File tree
2 files changed
+834
-250
lines changed- test/quantization/pt2e
- torchao/quantization/pt2e/inductor_passes
2 files changed
+834
-250
lines changed
0 commit comments