-
Notifications
You must be signed in to change notification settings - Fork 141
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Standardize eval score verification across workflow templates
#1112
opened Apr 22, 2026 by
Oseltamivir
Collaborator
Loading…
3 tasks done
[AMD/Hyperloom] Tune dsr1-fp8-mi355x-sglang: --num-continuous-decode-steps 4 → 8
#1109
opened Apr 21, 2026 by
lishuoshuo-amd
Loading…
4 tasks done
Bump max-model-len headroom from +200 to +256 (block-size alignment)
#1104
opened Apr 20, 2026 by
Ankur-singh
Collaborator
Loading…
1 of 2 tasks
[SGLang broken] Add MI355X config: glm5-fp4-sglang-mtp
vllm/sglang release broken -need to wait
#1091
opened Apr 18, 2026 by
functionstackx
Contributor
•
Draft
4 of 5 tasks
[sglang broken] Add MI355X config: qwen3.5-fp4-sglang-mtp
vllm/sglang release broken -need to wait
#1078
opened Apr 18, 2026 by
functionstackx
Contributor
Loading…
3 of 4 tasks
[vllm broken - waiting for 0.20] Add B300 config: kimi-k2.5-int4-vllm
vllm/sglang release broken -need to wait
#1071
opened Apr 17, 2026 by
cquil11
Collaborator
Loading…
2 tasks
[Do Not Merge] Upgrade Kimi-K2.5-INT4-MI355X-vLLM image to upstream daily image bcc2306cefa4179c548d3e638e7a22a88d281733
sweep-enabled
#1066
opened Apr 17, 2026 by
chunfangamd
Collaborator
Loading…
[WIP][NV] qwen35 b200 MTP update sglang config
NVIDIA
sweep-enabled
#1065
opened Apr 17, 2026 by
hshrivastava-droid
Collaborator
Loading…
Add options to override default extra_body and num_prompts when profiler is enabled
#1044
opened Apr 16, 2026 by
devalshahamd
Loading…
[WIP] [AMD/ROCM] atom glm5.1 fp4 on mi355x
AMD
#1043
opened Apr 16, 2026 by
seungrokj
Collaborator
Loading…
[WIP] [AMD/ROCM] atom minimaxm2.5 fp4 on mi355x
AMD
#1042
opened Apr 16, 2026 by
seungrokj
Collaborator
Loading…
[WIP] [AMD/ROCM] atom qwen fp8/bf16 on mi355x
AMD
#1040
opened Apr 16, 2026 by
seungrokj
Collaborator
Loading…
[Do Not Merge][NV] GLM5 fp8 update sglang container
NVIDIA
sweep-enabled
#1033
opened Apr 15, 2026 by
hshrivastava-droid
Collaborator
Loading…
[isb1] add converted trace corpus + kv-cache-tester contract helpers
#1032
opened Apr 15, 2026 by
OCWC22
Loading…
[Do Not Merge] update glm5 fp8 b200 sglang container
NVIDIA
sweep-enabled
#1030
opened Apr 14, 2026 by
hshrivastava-droid
Collaborator
Loading…
1 task
Add vLLM dynamic scheduler reconfigure for single-server sweeps
#1029
opened Apr 14, 2026 by
JordanNanos
Collaborator
Loading…
3 of 6 tasks
[WIP] Update Qwen3.5 FP8 B200 SGLang
sweep-enabled
#1027
opened Apr 13, 2026 by
Ankur-singh
Collaborator
Loading…
[WIP] Update Qwen3.5 FP4 B200 SGLang
sweep-enabled
#1018
opened Apr 10, 2026 by
Ankur-singh
Collaborator
Loading…
[AMD] Upgrade DeepSeek-R1 MI35x docker to the latest SGLang version 0.5.10
AMD
#1013
opened Apr 8, 2026 by
aarnetalman
Collaborator
•
Draft
[experimental] Add multinode profiling workflow
experimental
github_actions
Pull requests that update GitHub Actions code
#1007
opened Apr 6, 2026 by
hbarclay
Collaborator
Loading…
[AMD] feat: MiniMax M2.5 PD Disagg (1P2D) + PIECEWISE cudagraph optimization (+20% throughput)
AMD
vllm/sglang release broken -need to wait
#999
opened Apr 2, 2026 by
ChuanLi1101
Contributor
•
Draft
6 tasks done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.