-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Pull requests: volcengine/verl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[worker, training_utils] fix: Engine Metric Aggregation (Simple)
#4785
opened Jan 4, 2026 by
JacobHelwig
Loading…
[recipe] fix: workaround for making the one-step off-policy recipe compatible with IPv6 environments on Ascend NPU
#4782
opened Jan 4, 2026 by
ji-huazhong
Loading…
7 tasks
[worker, training_utils] fix: Engine Metric Aggregation
#4778
opened Jan 4, 2026 by
JacobHelwig
Loading…
[checkpoint] fix: normalize LoRA state_dict keys when saving HF model
#4770
opened Jan 2, 2026 by
yurekami
Loading…
4 of 5 tasks
[trainer,data,algo] feat: Unify SFT and PPO Pipelines
#4766
opened Jan 2, 2026 by
JacobHelwig
Loading…
[WIP][tool] feature: scheduling analysis based on profiling data
#4746
opened Dec 31, 2025 by
tardis-key
•
Draft
7 tasks
[rollout] fix: remove redundant system message added in ToolAgentLoop._handle_processing_tools_state
#4744
opened Dec 30, 2025 by
m-Just
Loading…
6 of 7 tasks
fix(fully_async): use rollout.total_epochs for total_rollout_steps calculation
#4736
opened Dec 30, 2025 by
FarrenZhang
Loading…
2 tasks done
[hardware] feat: support qwen3vl ON ASCEND NPU
#4734
opened Dec 30, 2025 by
Seren-hao
Loading…
7 tasks
fix(rollout): correct infer_tp calculation for multi-server weight sync
#4728
opened Dec 29, 2025 by
yurekami
Loading…
2 tasks
fix: add support for video data in
Agent Loop and Qwen3 VL
#4727
opened Dec 29, 2025 by
kaln27
Loading…
5 tasks done
fix(fsdp): handle param offloading in get_actor_weights_info
#4726
opened Dec 29, 2025 by
yurekami
Loading…
2 of 3 tasks
fix(registry): make reward manager registration idempotent
#4725
opened Dec 29, 2025 by
yurekami
Loading…
2 tasks done
examples: Add memory optimization examples and documentation
#4722
opened Dec 29, 2025 by
yurekami
Loading…
2 of 3 tasks
tests: Add gradient accumulation tests for all loss aggregation modes
#4721
opened Dec 29, 2025 by
yurekami
Loading…
1 of 2 tasks
[recipe, fsdp] feat: support GPT-OSS-20B DAPO training script on ASCEND NPU
#4716
opened Dec 29, 2025 by
mikequan0425
•
Draft
2 of 7 tasks
[perf] fix: correct torch profiler logic for distributed trace collection
#4707
opened Dec 28, 2025 by
AkiRusProd
Loading…
fix: use model_dump() for proper Pydantic serialization in token2text
#4706
opened Dec 28, 2025 by
yurekami
Loading…
3 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.