Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add new feature for kimi k2.5 mtp support
#1115 opened Apr 22, 2026 by haic0 Collaborator Draft
Standardize eval score verification across workflow templates
#1112 opened Apr 22, 2026 by Oseltamivir Collaborator Loading…
3 tasks done
Add haic0 patch for AMD kimi k2.5 MTP support
#1108 opened Apr 21, 2026 by haic0 Collaborator Draft
Bump max-model-len headroom from +200 to +256 (block-size alignment)
#1104 opened Apr 20, 2026 by Ankur-singh Collaborator Loading…
1 of 2 tasks
[WIP] [experimental] agentx integration
#1103 opened Apr 20, 2026 by cquil11 Collaborator Draft
[WIP] [AMD/ROCM] atom glm5.1 fp4 on mi355x AMD
#1043 opened Apr 16, 2026 by seungrokj Collaborator Loading…
[WIP] [AMD/ROCM] atom minimaxm2.5 fp4 on mi355x AMD
#1042 opened Apr 16, 2026 by seungrokj Collaborator Loading…
[WIP] [AMD/ROCM] atom qwen fp8/bf16 on mi355x AMD
#1040 opened Apr 16, 2026 by seungrokj Collaborator Loading…
Add vLLM dynamic scheduler reconfigure for single-server sweeps
#1029 opened Apr 14, 2026 by JordanNanos Collaborator Loading…
3 of 6 tasks
[WIP] Update Qwen3.5 FP8 B200 SGLang sweep-enabled
#1027 opened Apr 13, 2026 by Ankur-singh Collaborator Loading…
[WIP] Update Qwen3.5 FP4 B200 SGLang sweep-enabled
#1018 opened Apr 10, 2026 by Ankur-singh Collaborator Loading…
[experimental] Add multinode profiling workflow experimental github_actions Pull requests that update GitHub Actions code
#1007 opened Apr 6, 2026 by hbarclay Collaborator Loading…
ProTip! Add no:assignee to see everything that’s not assigned.