-
-
Notifications
You must be signed in to change notification settings - Fork 7.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[benchmark][structured output] Add offline benchmark script for structured output
needs-rebase
structured-output
#17437
opened Apr 30, 2025 by
lk-chen
Loading…
fix missing
_num_cached_tokens
in subtract_num_batched_tokens
#17436
opened Apr 30, 2025 by
initzhang
Loading…
[Bugfix] Fix AttributeError: 'State' object has no attribute 'engine_client'
frontend
#17434
opened Apr 30, 2025 by
chaunceyjiang
Loading…
Support LoRA for Mistral3
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#17428
opened Apr 30, 2025 by
mgoin
Loading…
[V1] Allow turning off pickle fallback in vllm.v1.serial_utils
v1
#17427
opened Apr 30, 2025 by
russellb
Loading…
[Misc][AMD] Add query_platform method to interface.py
#17424
opened Apr 29, 2025 by
rasmith
Loading…
[Chore] import as annotations on config
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#17423
opened Apr 29, 2025 by
aarnphm
Loading…
[Feature][CLI] Unify configuration for structured outputs via Improvements or additions to documentation
needs-rebase
structured-output
tool-calling
v1
--structured-output-config
documentation
#17420
opened Apr 29, 2025 by
aarnphm
Loading…
[Model] Uses vllm_flash_attn for Qwen 2 VL catalogs of models
#17413
opened Apr 29, 2025 by
aarnphm
Loading…
[Bugfix] Temporarily disable gptq_bitblas on ROCm
documentation
Improvements or additions to documentation
#17411
opened Apr 29, 2025 by
nlzy
Loading…
[Frontend] Fix tool_call handling in llama3.1 and llama3.2 chat template to allow zero tool_calls
documentation
Improvements or additions to documentation
tool-calling
#17409
opened Apr 29, 2025 by
CatherineSue
Loading…
[CI/Build] Fix docker command casing warning
ci/build
#17403
opened Apr 29, 2025 by
Luohaothu
Loading…
Revert "[NVIDIA] Support Cutlass MLA for Blackwell GPUs (#16032)"
ci/build
#17402
opened Apr 29, 2025 by
Alexei-V-Ivanov-AMD
Loading…
Remove Zephyr 7B from everywhere possible in CI
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
#17401
opened Apr 29, 2025 by
hmellor
Loading…
[v1][Spec Decode] Make sliding window compatible with eagle prefix caching
v1
#17398
opened Apr 29, 2025 by
heheda12345
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.