Skip to content

Pull requests: NVIDIA-NeMo/Automodel

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add retrieval bi-encoder and cross-encoder nightly tests
#2042 opened Apr 24, 2026 by oliverholworthy Contributor Draft
3 tasks
feat: DeepSeek V4 Flash support community-request
#2039 opened Apr 24, 2026 by khazic Contributor Loading…
4 of 7 tasks
ci: Update transformers to latest version 5.6.2
#2038 opened Apr 24, 2026 by svcnvidia-nemo-ci Contributor Loading…
ci: Update transformers to latest version 5.6.0
#2015 opened Apr 23, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat: lazy dataset preprocessing community-request waiting-on-customer Waiting on the original author to respond
#2007 opened Apr 23, 2026 by edjson Loading…
2 of 3 tasks
fix: llama3_3_nemotron_super_49B_squad checkpoint robustness thresholds
#1950 opened Apr 21, 2026 by adil-a Collaborator Loading…
2 of 4 tasks
fix: lower qwen3_moe_30b_lora local_batch_size to avoid CI OOM
#1948 opened Apr 21, 2026 by adil-a Collaborator Loading…
3 of 4 tasks
fix: fallback to safetensors if using peft
#1924 opened Apr 20, 2026 by akoumpa Contributor Loading…
3 tasks
fix: llava onevision recipes
#1922 opened Apr 20, 2026 by akoumpa Contributor Loading…
3 tasks
feat: add Context Parallelism support for Gemma4 dense and MoE VLM community-request waiting-on-customer Waiting on the original author to respond
#1914 opened Apr 20, 2026 by khazic Contributor Loading…
3 tasks done
fix: fp32 master weights for custom MoE models under FSDP2
#1896 opened Apr 17, 2026 by zpqiu Contributor Loading…
1 of 3 tasks
ci: onboard GB200 testing
#1893 opened Apr 17, 2026 by ko3n1g Contributor Loading…
5 tasks
fix: lora with gemma4 large models on Spark single GPU
#1866 opened Apr 15, 2026 by athitten Contributor Draft
3 tasks
docs: add embedding + reranker model coverage docs-only With great power comes great responsibility.
#1843 opened Apr 14, 2026 by akoumpa Contributor Loading…
3 tasks
ci: add sync-skills workflow
#1841 opened Apr 14, 2026 by ko3n1g Contributor Loading…
2 tasks
feat: add extract_submodel parameter to build_encoder_backbone
#1838 opened Apr 14, 2026 by oliverholworthy Contributor Draft
2 of 3 tasks
refactor: Remove separate moe_mesh references community-request waiting-on-customer Waiting on the original author to respond
#1824 opened Apr 14, 2026 by edjson Loading…
3 tasks
ci: Update transformers to latest version 5.5.4
#1823 opened Apr 14, 2026 by svcnvidia-nemo-ci Contributor Loading…
feat: Add Ministral3-3B bidirectional encoder training scripts
#1809 opened Apr 13, 2026 by rnyak Collaborator Loading…
3 tasks
fix: Set CUDA arch list for UCCL EP build to SM90+ r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1808 opened Apr 13, 2026 by thomasdhc Contributor Loading…
3 tasks
ci: add HF doc contract test for pretraining instantiation r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#1802 opened Apr 13, 2026 by adil-a Collaborator Draft
3 tasks done
fix(ci): cache heavy CUDA wheels in install-test (#1796) community-request waiting-on-maintainers Waiting on maintainers to respond
#1798 opened Apr 13, 2026 by KevinSailema Loading…
3 tasks done
ci: Update transformers to latest version 5.5.3
#1789 opened Apr 12, 2026 by svcnvidia-nemo-ci Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.