Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix] Strip Gemma4 string delimiters from dict keys bug Something isn't working tool-calling
#44756 opened Jun 7, 2026 by he-yufeng Contributor Loading…
4 tasks done
[Bugfix] Shut down engine cores on startup handshake failure bug Something isn't working v1
#44751 opened Jun 6, 2026 by fiddleboy Loading…
[Bugfix] Propagate ImportError from load_audio_pyav when vllm[audio] … bug Something isn't working multi-modality Related to multi-modality (#4194)
#44750 opened Jun 6, 2026 by littlecircle0730 Loading…
3 of 4 tasks
[Misc] Remove orphaned env vars and stale env-var references documentation Improvements or additions to documentation
#44749 opened Jun 6, 2026 by DaoyuanLi2816 Contributor Loading…
[Cohere] Fix Cohere2MoE weight loading when using Transformers ≥5.10 ready ONLY add when PR is ready to merge/full CI is needed
#44747 opened Jun 6, 2026 by Terrencezzj Contributor Loading…
4 tasks
[Bugfix] Harden allowed_token_ids metadata for spec-decode bug Something isn't working v1
#44742 opened Jun 6, 2026 by jperezdealgaba Contributor Loading…
[Bugfix] Gemma4 streaming parser for multi-boundary tool deltas bug Something isn't working tool-calling
#44741 opened Jun 6, 2026 by yasu-oh Loading…
4 tasks done
[Bugfix][Model] GraniteMoE: load FP8_DYNAMIC expert weight_scale tensors bug Something isn't working ci/build
#44739 opened Jun 6, 2026 by javierdejesusda Contributor Loading…
[Opt] Optimize rotary embedding cache length
#44738 opened Jun 6, 2026 by labAxiaoming Contributor Loading…
4 tasks
[Bugfix] Canonicalize FP8 weight layout to (K, N) at the source bug Something isn't working quantization ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#44735 opened Jun 6, 2026 by mgoin Member Loading…
3 of 4 tasks
[Bugfix][Rust Frontend] Set a structured-output backend so requests do not 500 bug Something isn't working rust
#44729 opened Jun 6, 2026 by Sunt-ing Contributor Loading…
[Bugfix] Fix shape mismatch crash and add logprob_token_ids support in RejectionSampler bug Something isn't working v1
#44727 opened Jun 6, 2026 by skajre Loading…
4 tasks done
[Bugfix][Core] Close underlying iterator in merge_async_iterators single-iterator fast path bug Something isn't working
#44726 opened Jun 6, 2026 by Sunt-ing Contributor Loading…
[Bugfix][Frontend] Fix Anthropic count_tokens decorator order driving server load negative bug Something isn't working frontend
#44725 opened Jun 6, 2026 by Sunt-ing Contributor Loading…
ProTip! Adding no:label will show everything without a label.