Skip to content

feat(loss): add --pg-loss-divisor: first-class constant-divisor pg_loss normalization (Dr.GRPO)#2060

Closed
EazyReal wants to merge 1 commit into
THUDM:mainfrom
EazyReal:upstream-pr/drgrpo-reducer-example
Closed

feat(loss): add --pg-loss-divisor: first-class constant-divisor pg_loss normalization (Dr.GRPO)#2060
EazyReal wants to merge 1 commit into
THUDM:mainfrom
EazyReal:upstream-pr/drgrpo-reducer-example

Add --pg-loss-divisor: first-class constant-divisor pg_loss normaliza…

0baefb5
Select commit
Loading
Failed to load commit list.
Sign in for the full log view