-
Notifications
You must be signed in to change notification settings - Fork 105
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Tests] Fix oneshot + finetune test by passing splits to oneshot
ready
When a PR is ready for review
#1316
opened Apr 2, 2025 by
kylesayrs
Loading…
Fix Multi-Context Manager Syntax for Python 3.9 Compatibility
ready
When a PR is ready for review
#1313
opened Apr 2, 2025 by
rahul-tuli
Loading…
bugfix kv cache quantization with ignored layers
#1312
opened Apr 1, 2025 by
brian-dellabetta
•
Draft
Update nm-actions/changed-files to v1.16.0
ready
When a PR is ready for review
#1311
opened Apr 1, 2025 by
dbarbuzzi
Loading…
[Tracing] Remove
TraceableWhisperForConditionalGeneration
#1310
opened Apr 1, 2025 by
kylesayrs
Loading…
Reduce SmoothQuant Repr
ready
When a PR is ready for review
#1289
opened Mar 27, 2025 by
kylesayrs
Loading…
Smoothquant typehinting and onloading context
ready
When a PR is ready for review
#1285
opened Mar 26, 2025 by
kylesayrs
Loading…
[BugFix] Directly Convert Modifiers to Recipe Instance
ready
When a PR is ready for review
#1271
opened Mar 20, 2025 by
rahul-tuli
Loading…
[Tests] Add mark skip for GPU
ready
When a PR is ready for review
#1264
opened Mar 18, 2025 by
kylesayrs
Loading…
[Performance] Sequential onloading
ready
When a PR is ready for review
#1263
opened Mar 18, 2025 by
kylesayrs
Loading…
[Quantization] Channel-wise Output Activation Quantization for Attention QKV Modules + KV-cache channel quantization
ready
When a PR is ready for review
[Callbacks][Docs] Add docstrings to saving functions
ready
When a PR is ready for review
#1201
opened Feb 26, 2025 by
kylesayrs
Loading…
[Callbacks] Merge When a PR is ready for review
on_event
with on_update
, remove MagnitudePruningModifier.leave_enabled
ready
#1199
opened Feb 26, 2025 by
kylesayrs
Loading…
[Utils] Replace When a PR is ready for review
preserve_attr
with patch_attr
ready
#1187
opened Feb 24, 2025 by
kylesayrs
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-30.