Skip to content

Pull requests: HabanaAI/vllm-fork

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Added lora manager tests
#670 opened Jan 8, 2025 by rsshaik1 Draft
[SW-197036] - use torch._scaled_mm with hpu
#660 opened Dec 22, 2024 by nirda7 Loading…
Draft: Delayed prompts
#659 opened Dec 20, 2024 by kamil-kaczor Draft
Chunked Prefill
#656 opened Dec 20, 2024 by hlahkar Draft
Fix: selecting correct backend for MultiHeadAttention habana Issues or PRs submitted by Habana Labs
#645 opened Dec 18, 2024 by adobrzyniewicz-habana Loading…
Fix model OOM issue in llama-405 and mixtral - 2nd attempt habana Issues or PRs submitted by Habana Labs
#644 opened Dec 18, 2024 by afierka-intel Loading…
Selective merged prefill
#643 opened Dec 18, 2024 by xuechendi Loading…
Multimodality fix for llava habana Issues or PRs submitted by Habana Labs
#641 opened Dec 17, 2024 by adobrzyniewicz-habana Loading…
Add inc fp8 qunatization documentation
#635 opened Dec 16, 2024 by nirda7 Loading…
[WIP] Add HPU support to vLLM v1 - cont.
#609 opened Dec 10, 2024 by kzawora-intel Loading…
21 of 23 tasks
Add in Dockerfile.hpu.ubi
#602 opened Dec 9, 2024 by Xaenalt Loading…
Add real BS & seq_len to profiling
#601 opened Dec 9, 2024 by kamil-kaczor Loading…
Documentation update for 1.19
#597 opened Dec 5, 2024 by PatrykWo Loading…
Multi models support for upstream
#590 opened Dec 4, 2024 by xuechendi Loading…
Remove assert for alibi in case of FusedSDPA.
#587 opened Dec 4, 2024 by itaraban Loading…
Update documentation
#555 opened Nov 26, 2024 by michalkuligowski Draft
[SW-201504] Trigger Internal Tests
#538 opened Nov 24, 2024 by RonBenMosheHabana Loading…
Bump aiohttp from 3.10.10 to 3.10.11 dependencies Pull requests that update a dependency file
#536 opened Nov 21, 2024 by dependabot bot Loading…
Clean-up LoRA flow
#518 opened Nov 18, 2024 by SanjuCSudhakaran Draft
1.19 documentation update
#507 opened Nov 15, 2024 by kzawora-intel Draft
ProTip! Filter pull requests by the default branch with base:habana_main.