Skip to content

Pull requests: NVIDIA-NeMo/Automodel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: nemotron_nano_9b_squad_peft checkpoint robustness thresholds
#1944 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 tasks done
fix: nemotron_nano_9b_squad checkpoint robustness thresholds
#1943 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 3 tasks
fix: widen qwen3_moe_30b_hellaswag ckpt-robustness KL threshold to 3e-2
#1942 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 3 tasks
fix: widen hf_kl_threshold for customizer_gpt_oss_full_sft_chat
#1940 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 3 tasks
fix: bump hf_kl_threshold for customizer_nemotron_nano_full_sft_chat
#1939 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 3 tasks
fix: bump hf_kl_threshold for customizer_llama_3_2_1b_full_sft_chat
#1938 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 3 tasks
fix: qwen2_5_7b_squad ckpt robustness thresholds for transformers v5.5
#1937 opened Apr 21, 2026 by adil-a Collaborator Loading…
3 tasks done
fix: gemma_3_270m_squad_peft HF KL regression in ckpt robustness docs-only With great power comes great responsibility. r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1933 opened Apr 21, 2026 by adil-a Collaborator Loading…
3 tasks done
fix: gemma_3_270m_squad HF KL regression in ckpt robustness docs-only With great power comes great responsibility. r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1932 opened Apr 21, 2026 by adil-a Collaborator Loading…
4 tasks done
fix: fallback to safetensors if using peft
#1924 opened Apr 20, 2026 by akoumpa Contributor Draft
3 tasks
fix: llava onevision recipes
#1922 opened Apr 20, 2026 by akoumpa Contributor Draft
3 tasks
cp: fix: qlora ckpt loading (1549) into r0.4.0 cherry-pick Run CICD Trigger Testing CICD
#1920 opened Apr 20, 2026 by svcnvidia-nemo-ci Contributor Loading…
ci: Add test_recipes for custom test scope docs-only With great power comes great responsibility. r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1915 opened Apr 20, 2026 by thomasdhc Contributor Draft
3 tasks
feat: add Context Parallelism support for Gemma4 dense and MoE VLM community-request
#1914 opened Apr 20, 2026 by khazic Contributor Loading…
3 tasks done
fix(moe): align EP expert weight dtype with activation dtype
#1913 opened Apr 20, 2026 by jQizhang Contributor Loading…
1 of 3 tasks
fix: fp32 master weights for custom MoE models under FSDP2
#1896 opened Apr 17, 2026 by zpqiu Contributor Loading…
1 of 3 tasks
ci: onboard GB200 testing
#1893 opened Apr 17, 2026 by ko3n1g Contributor Loading…
3 tasks
ci(feat): use AWS ephemeral runners for external contributors
#1892 opened Apr 17, 2026 by ko3n1g Contributor Loading…
3 tasks
fix: lora with gemma4 large models on Spark single GPU
#1866 opened Apr 15, 2026 by athitten Contributor Draft
3 tasks
feat: Qwen3.5 VLM TP+PP support with per-microbatch grad reduce-scatter knob r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1859 opened Apr 15, 2026 by akoumpa Contributor Loading…
6 tasks done
docs: add embedding + reranker model coverage docs-only With great power comes great responsibility.
#1843 opened Apr 14, 2026 by akoumpa Contributor Loading…
3 tasks
ci: add sync-skills workflow
#1841 opened Apr 14, 2026 by ko3n1g Contributor Loading…
2 tasks
feat: add extract_submodel parameter to build_encoder_backbone
#1838 opened Apr 14, 2026 by oliverholworthy Contributor Draft
2 of 3 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.