Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(ppo): stop corrupting the logged rollout/kl metric
#2114 opened Jun 21, 2026 by EazyReal Contributor Loading…
Fix(rollout): Fail closed on unknown SGLang model names
#2112 opened Jun 21, 2026 by Baiyu-Su Contributor Loading…
fix: support eval-only mode (--num-rollout 0)
#2109 opened Jun 20, 2026 by EazyReal Contributor Loading…
feat(examples/strands_sglang): update to strands-sglang 0.4.2
#2106 opened Jun 20, 2026 by Lawhy Contributor Loading…
Preserve reloadable process group options
#2095 opened Jun 17, 2026 by EazyReal Contributor Draft
fix(scripts): correct model config source path in FP8 low_precision scripts
#2094 opened Jun 17, 2026 by aoshen02 Contributor Loading…
2 tasks done
Add --loss-aggregation for the four ScaleRL pg_loss aggregation modes
#2090 opened Jun 16, 2026 by EazyReal Contributor Loading…
Disk-level delta weight sync
#2089 opened Jun 16, 2026 by nanjiangwill Collaborator Loading…
fix(opd): score teacher logprobs at rollout temperature, not 0
#2085 opened Jun 15, 2026 by EazyReal Contributor Loading…
feat(rl): add REINFORCE advantage estimator
#2083 opened Jun 15, 2026 by EazyReal Contributor Loading…
feat(coding_agent_rl): add SWE-bench harness evaluation path
#2079 opened Jun 15, 2026 by aoshen02 Contributor Draft
3 tasks
fix(rollout): isolate per-trajectory exceptions in generate_and_rm_group
#2078 opened Jun 15, 2026 by aoshen02 Contributor Loading…
fix(script): correct GLM-4.7 expert_model_parallel_size for single-node 8 GPU
#2077 opened Jun 15, 2026 by aoshen02 Contributor Loading…
1 task
Support Qwen3.5-VL (dense + MoE) via Megatron-Bridge
#2075 opened Jun 14, 2026 by demouo Contributor Loading…
feat(rollouts) external rollouts endpoint with publish-only weight sync
#2071 opened Jun 12, 2026 by jvmncs Loading…
4 tasks done
fix(sglang): authenticate engine control-plane and router calls
#2068 opened Jun 12, 2026 by EazyReal Contributor Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.