Skip to content

Pull requests: mlc-ai/mlc-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[KVCache] Per Layer Sliding Window
#3248 opened Jun 10, 2025 by joshua-j-hong Loading…
Add support for gemma3_instruction to genconfig
#3224 opened May 9, 2025 by akshatsehgal Loading…
[Refactor] PagedKVCache spec for MLC-LLM
#3203 opened Apr 14, 2025 by annanyapr Loading…
Refactored random.h to have PhiloxRandomGenerator
#3181 opened Mar 18, 2025 by annanyapr Loading…
[Model] Qwen-2-VL Support
#3125 opened Feb 10, 2025 by nihalgeorge01 Draft
[Bench] Add support for multiple backend
#3037 opened Nov 20, 2024 by cyx-6 Draft
[Model] Add use_qk_norm option for Cohere model
#2877 opened Sep 2, 2024 by tlopex Loading…
[Serving] PagedKVCache Quantization
#2663 opened Jul 16, 2024 by davidpissarra Loading…
[Bench] Add bench for GSM8K eval
#2585 opened Jun 16, 2024 by Hzfengsy Loading…
[Bench] Add bench for MMLU eval
#2584 opened Jun 16, 2024 by Hzfengsy Loading…
Add docker container support
#1271 opened Nov 15, 2023 by Sing-Li Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.