-
Notifications
You must be signed in to change notification settings - Fork 132
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: nemotron_nano_9b_squad_peft checkpoint robustness thresholds
#1944
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 tasks done
fix: nemotron_nano_9b_squad checkpoint robustness thresholds
#1943
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 of 3 tasks
fix: widen qwen3_moe_30b_hellaswag ckpt-robustness KL threshold to 3e-2
#1942
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 of 3 tasks
fix: AC silently skipped on all registered VLMs — flatten ModuleList
community-request
#1941
opened Apr 21, 2026 by
khazic
Contributor
Loading…
3 tasks done
fix: widen hf_kl_threshold for customizer_gpt_oss_full_sft_chat
#1940
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 of 3 tasks
fix: bump hf_kl_threshold for customizer_nemotron_nano_full_sft_chat
#1939
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 of 3 tasks
fix: bump hf_kl_threshold for customizer_llama_3_2_1b_full_sft_chat
#1938
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
2 of 3 tasks
fix: qwen2_5_7b_squad ckpt robustness thresholds for transformers v5.5
#1937
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
3 tasks done
fix: gemma_3_270m_squad_peft HF KL regression in ckpt robustness
docs-only
With great power comes great responsibility.
r0.4.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1933
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
3 tasks done
fix: gemma_3_270m_squad HF KL regression in ckpt robustness
docs-only
With great power comes great responsibility.
r0.4.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1932
opened Apr 21, 2026 by
adil-a
Collaborator
Loading…
4 tasks done
cp: Trigger Testing CICD
fix: qlora ckpt loading (1549) into r0.4.0
cherry-pick
Run CICD
#1920
opened Apr 20, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
feat: add Context Parallelism support for Gemma4 dense and MoE VLM
community-request
#1914
opened Apr 20, 2026 by
khazic
Contributor
Loading…
3 tasks done
fix(moe): align EP expert weight dtype with activation dtype
#1913
opened Apr 20, 2026 by
jQizhang
Contributor
Loading…
1 of 3 tasks
fix: fp32 master weights for custom MoE models under FSDP2
#1896
opened Apr 17, 2026 by
zpqiu
Contributor
Loading…
1 of 3 tasks
ci(feat): use AWS ephemeral runners for external contributors
#1892
opened Apr 17, 2026 by
ko3n1g
Contributor
Loading…
3 tasks
feat: Qwen3.5 VLM TP+PP support with per-microbatch grad reduce-scatter knob
r0.4.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1859
opened Apr 15, 2026 by
akoumpa
Contributor
Loading…
6 tasks done
docs: add embedding + reranker model coverage
docs-only
With great power comes great responsibility.
#1843
opened Apr 14, 2026 by
akoumpa
Contributor
Loading…
3 tasks
feat: add extract_submodel parameter to build_encoder_backbone
#1838
opened Apr 14, 2026 by
oliverholworthy
Contributor
•
Draft
2 of 3 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.