-
Notifications
You must be signed in to change notification settings - Fork 77
Pull requests: intel/auto-round
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support quant for
meituan-longcat/LongCat-Flash-Lite
#1388
opened Feb 3, 2026 by
Kaihui-intel
Loading…
1 of 9 tasks
Optimize CPU RAM peak memory during quantization
#1386
opened Feb 3, 2026 by
lvliang-intel
Loading…
3 of 9 tasks
Implement MXFP4Handler MXFP8Handler NVFP4Handler
#1385
opened Feb 3, 2026 by
xin3he
Loading…
3 of 9 tasks
fix Qwen3-VL model auto_awq export, add auto_awq vllm ut
#1384
opened Feb 2, 2026 by
WeiweiZhang1
Loading…
3 of 9 tasks
support llm dynamic wint8aint8 export
#1376
opened Jan 30, 2026 by
mengniwang95
Loading…
1 of 6 tasks
Refactor module access to use PyTorch get/set_submodule API
#1365
opened Jan 29, 2026 by
scopophobic
Loading…
refactor init of compressor
engineering
ready
only add when the PR is ready to merge
#1339
opened Jan 26, 2026 by
n1ck-guo
Loading…
1 of 9 tasks
Optimize FP8 layer conversion by skipping weight initialization
#1295
opened Jan 16, 2026 by
Copilot
AI
Loading…
Robust FP8 layer detection for ignore_layers (#1283)
#1289
opened Jan 15, 2026 by
scopophobic
Loading…
Fix ignore_layers not working for FP8 models
#1286
opened Jan 15, 2026 by
Copilot
AI
Loading…
11 tasks done
[WIP][refactor quanizers][step 1] refactor rtn and tuning
#1278
opened Jan 14, 2026 by
n1ck-guo
Loading…
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.