-
Notifications
You must be signed in to change notification settings - Fork 113
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP]Add Func: npugraph_batch_size auto-adjust to different model
#713
opened Apr 28, 2025 by
chris668899
Loading…
[WIP][Build][0.7.3] Integrate MindIE Turbo into vLLM Ascend
documentation
Improvements or additions to documentation
#708
opened Apr 28, 2025 by
MengqingCao
Loading…
[Disaggregated Prefill][WIP] P2P Disaggregated Prefill based on llm_datadist
#694
opened Apr 28, 2025 by
whx-sjtu
Loading…
[MISC] Clean up torch_npu
module:core
module:ops
module:quantization
module:tests
#688
opened Apr 28, 2025 by
wangxiyuan
Loading…
[Feature] Impl the connector based on the llmdatadist for v1
module:core
#684
opened Apr 27, 2025 by
jianzs
Loading…
1 of 5 tasks
feat: performance optimization for deepseek
module:quantization
#683
opened Apr 27, 2025 by
zzzzwwjj
Loading…
[Bugfix] Fix early return in CustomDeepseekV2MoE.forward during profile_run
#682
opened Apr 27, 2025 by
ApsarasX
Loading…
update chunk prefill torch
module:ops
module:tests
#679
opened Apr 27, 2025 by
ttanzhiqiang
Loading…
[WIP] Add support for custom DeepSeek modelling in ACL Graph mode
module:core
module:ops
#677
opened Apr 27, 2025 by
yiz-liu
Loading…
[Feature] Enable disaggregated prefill functionality for v0
module:core
module:tests
#658
opened Apr 25, 2025 by
jianzs
Loading…
Adjust KV cache shape for compatibility with updated APIs for graph mode
ci/build
#657
opened Apr 25, 2025 by
linfeng-yuan
Loading…
[MISC] fix format check error
documentation
Improvements or additions to documentation
module:ops
module:tests
module:tools
#654
opened Apr 25, 2025 by
wangxiyuan
Loading…
[Doc] Add benchmark guide
documentation
Improvements or additions to documentation
#635
opened Apr 23, 2025 by
Potabk
Loading…
[Misc] format patch to make the code clear
module:core
module:quantization
#613
opened Apr 22, 2025 by
wangxiyuan
Loading…
[Platform] format platform to make it more clear
module:core
#610
opened Apr 22, 2025 by
wangxiyuan
Loading…
[Perf] Deepseekv3 performance optimization for eager mode
module:core
module:ops
#598
opened Apr 21, 2025 by
ganyi1996ppo
Loading…
[Bugfix] Fix the bug of
torch_npu
that raising segment fault when enable pin_memory
while creating a tensor
module:core
#597
opened Apr 21, 2025 by
shen-shanshan
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-03-28.